Multiple Regression with Two Predictor Variables

Multiple regression is an extension of simple linear regression in which more than one independent variable (X) is used to predict a single dependent variable (Y). The predicted value of Y is a linear transformation of the X variables such that the sum of squared deviations of the observed and predicted Y is a minimum. The computations are more complex, however, because the interrelationships among all the variables must be taken into account in the weights assigned to the variables. The interpretation of the results of a multiple regression analysis is also more complex for the same reason.

With two independent variables the prediction of Y is expressed by the following equation:

Note that this transformation is similar to the linear transformation of two variables discussed in the previous chapter except that the w's have been replaced with b's and the X'_i has been replaced with a Y'_i.

The "b" values are called regression weights and are computed in a way that minimizes the sum of squared deviations

in the same manner as in simple linear regression. The difference is that in simple linear regression only two weights, the intercept (b₀) and slope (b₁), were estimated, while in this case, three weights (b₀, b₁, and b₂) are estimated.

Example Data

The data used to illustrate the inner workings of multiple regression are presented below:

Example Homework Assignment
Y₁	Y₂	X₁	X₂	X₃	X₄
125	113	13	18	25	11
158	115	39	18	59	30
207	126	52	50	62	53
182	119	29	43	50	29
196	107	50	37	65	56
175	135	64	19	79	49
145	111	11	27	17	14
144	130	22	23	31	17
160	122	30	18	34	22
175	114	51	11	58	40
151	121	27	15	29	31
161	105	41	22	53	39
200	131	51	52	75	36
173	123	37	36	44	27
175	121	23	48	27	20
162	120	43	15	65	36
155	109	38	19	62	37
230	130	62	56	75	50
162	134	28	30	36	20
153	124	30	25	41	33

If a student desires a more concrete description of this data file, meaning could be given the variables as follows:

Univariate Analysis

The first step in the analysis of multivariate data is a table of means and standard deviations. Additional analysis recommendations include histograms of all variables with a view for outliers, or scores that fall outside the range of the majority of scores. In a multiple regression analysis, these score may have a large "influence" on the results of the analysis and are a cause for concern. In the case of the example data, the following means and standard deviations were computed using SPSS by clicking Analyze/Summarize/Descriptives.

The Correlation Matrix

The second step is an analysis of bivariate relationships between variables. This can be done using a correlation matrix, generated using the Analyze/Correlate/Bivariate commands in SPSS.

In the case of the example data, it is noted that all X variables correlate significantly with Y₁, while none correlate significantly with Y₂. In addition, X₁ is significantly correlated with X₃ and X₄, but not with X₂. Interpreting the variables using the suggested meanings, success in graduate school could be predicted individually with measures of intellectual ability, spatial ability, and work ethic. The measures of intellectual ability were correlated with one another. Measures of intellectual ability and work ethic were not highly correlated. The score on the review paper could not be accurately predicted with any of the other variables.

A visual presentation of the scatter plots generating the correlation matrix can be generated using the Graphs/Scatter/Matrix commands in SPSS.

These graphs may be examined for multivariate outliers that might not be found in the univariate view.

Three-dimensional scatter plots also permit a graphical representation in the same information as the multiple scatter plots. Using the Graphs/Scatter/3-D commands in SPSS results in the following two graphs.

Q5.1

The difference between simple linear regression and multiple regression is
multiple regression has more independent measures.
simple linear regression has a single dependent measure.
multiple regression has multiple standard errors of estimate.
multiple regression has many different regression equations.

The Regression Weights

The formulas to compute the regression weights with two independent variables are available from various sources (Pedhazur, 1997). They are messy and do not provide a great deal of insight into the mathematical "meanings" of the terms. For that reason, computational procedures will be done entirely with a statistical package.

The multiple regression is done in SPSS by selecting Analyze/Regression/Linear. The interface should appear as follows:

In the first analysis, Y₁ is the dependent variable and two independent variables are entered in the first block, X₁ and X₂. In addition, under the "Save..." option, both unstandardized predicted values and unstandardized residuals were selected.

The output consists of a number of tables. The "Coefficients" table presents the optimal weights in the regression model, as seen in the following.

Recalling the prediction equation, Y'_i = b₀ + b₁X_1i + b₂X_2i, the values for the weights can now be found by observing the "B" column under "Unstandardized Coefficients." They are b₀ = 101.222, b₁ = 1.000, and b₂ = 1.071, and the regression equation appears as:

The "Beta" column under "Standardized Coefficients" gives similar information, except all values of X and Y have been standardized (set to mean of zero and standard deviation of one) before the weights are computed. In this case the value of b₀ is always 0 and not included in the regression equation. The equation and weights for the example data appear below.

The standardization of all variables allows a better comparison of regression weights, as the unstandardized weights are a function of the variance of both the Y and the X variables.

Q5.2

The standardized regression weights
will always have a constant term of zero.
will equal the correlation coefficients between the independent and dependent variables.
will be the same as the unstandardized coefficients if multicollinearity is assumed.
are difficult to interpret because they are a function of the variability of the independent measure.

Q5.3

The equation Y" = b0 + b1X1 + b2X2 describes
a straight line in three-dimensional space
a curved line in three-dimensional space
a bell curve
a plane in three-dimensional space

Predicted and Residual Values

The values of Y_1i can now be predicted using the following linear transformation.

Thus, the value of Y_1i where X_1i = 13 and X_2i = 18 for the first student could be predicted as follows.

The scores for all students are presented below, as computed in the data file of SPSS. Note that the predicted Y score for the first student is 133.50. The predicted Y and residual values are automatically added to the data file when the unstandardized predicted values and unstandardized residuals are selected using the "Save" option.

The difference between the observed and predicted score, Y-Y ', is called a residual. This column has been computed, as has the column of squared residuals. The squared residuals (Y-Y')² may be computed in SPSS by squaring the residuals using Transform/Compute commands.

The analysis of residuals can be informative. The larger the residual for a given observation, the larger the difference between the observed and predicted value of Y and the greater the error in prediction. In the example data, the regression under-predicted the Y value for observation 10 by a value of 10.98, and over-predicted the value of Y for observation 6 by a value of 10.60. In some cases the analysis of errors of prediction in a given model can direct the search for additional independent variables that might prove valuable in more complete models.

The residuals are assumed to be normally distributed when the testing of hypotheses using analysis of variance (R² change). Although analysis of variance is fairly robust with respect to this assumption, it is a good idea to examine the distribution of residuals, especially with respect to outliers. The distribution of residuals for the example data is presented below.

Q5.4

A large residual for a given individual means
lack of a good fit for that individual
possible greater influence of that individual on the regression
the predicted and observed values of Y are different
all of the answers are correct

Q5.5

In a regression equation predicting points in a graduate statistics course, unstandardized regression weights were -10.37, 1.33, and .78 for the constant term, a measure of intellectual ability, and a measure of motivation. What would be the predicted number of points for a student with a score of 123 on the measure of intellectual ability and a score of 154 on the measure of motivation?
294.08
283.71
273.34
-8.26

Q5.6

In a regression equation predicting points in a graduate statistics course, unstandardized regression weights were -10.37, 1.33, and .78 for the constant term, a measure of intellectual ability, and a measure of motivation. What would be the residual for a student with a score of 123 on the measure of intellectual ability and a score of 154 on the measure of motivation, and an observed number of points of 298, the residual for this student would be
-10.37
-14.34
73.34
24.66

Q5.7

Relatively small values for residuals in a multiple regression equation can be interpreted as
lack of a good fit for that score
violation of the assumption of normality
multicollinearity
small error in prediction

Q5.8

Assumptions made when using analysis of variance in multiple regression include
multicollinearity
small error in prediction
equality of regression weights
normally distributed residuals

The Multiple Correlation Coefficient

The multiple correlation coefficient, R, is the correlation coefficient between the observed values of Y and the predicted values of Y. For this reason, the value of R will always be positive and will take on a value between zero and one. The direction of the multivariate relationship between the independent and dependent variables can be observed in the sign, positive or negative, of the regression weights. The interpretation of R is similar to the interpretation of the correlation coefficient, the closer the value of R to one, the greater the linear relationship between the independent variables and the dependent variable.

The value of R can be found in the "Model Summary" table of the SPSS output. In the case of the example data, the value for the multiple R when predicting Y₁ from X₁ and X₂ is .968, a very high value.

The multiple correlation coefficient squared ( R² ) is also called the coefficient of determination. It may be found in the SPSS output alongside the value for R. The interpretation of R² is similar to the interpretation of r², namely the proportion of variance in Y that may be predicted by knowing the value of the X variables. The value for R squared will always be less than the value for R. In general the value of multiple R is to be preferred over R squared as a measure of relationship because R squared is measured in units of measurement squared while R is in terms of units of measurement.

The adjustment in the "Adjusted R Square" value in the output tables is a correction for the number of X variables included in the prediction model. In general, the smaller the N and the larger the number of variables, the greater the adjustment. In the example data, the results could be reported as "92.9% of the variance in the measure of success in graduate school can be predicted by measures of intellectual ability and work ethic."

Q5.9

The correlation coefficient between X and Y (rxy) will be different than the multiple correlation coefficient (Ryx) when
the value of b0 is greater than 100
the value of b1 is less than zero
the value of rxy is greater than .5
they will never differ.

Q5.10

The value of the multiple correlation coefficient R, is
the proportion of variance in Y accounted for by all the X values.
the correlation between the predicted and observed values of Y.
called the coefficient of determination.
a value between -1 and 1.

Q5.11

A new statistician found a multiple correlation coefficient of .54 when the previous statistician found a multiple correlation coefficient of .32 on the same data but different variables, you should
give the new statistician a bonus.
fire the new statistician.
fire both statisticians.
only do qualitative research.

Q5.12

A new statistician found a multiple correlation coefficient of -.54 when the previous statistician found a multiple correlation coefficient of .32 on the same data but different variables, you should
give the new statistician a bonus.
fire the new statistician.
fire both statisticians.
only do qualitative research.

Q5.13

The coefficient of determination is
the multiple correlation coefficient squared.
the proportion of variance in Y that cannot be predicted by knowing the values for the independent measures.
the absolute value of the standard error of estimate.
a measure of multicollineariety.

Q5.16

In multiple regression, the unadjusted R2 value will
be negative if there is an inverse relationship
be larger for a full model than a partial model
be positively related to the standard error of estimate
indicate multicollinearily if the hyperplane extends the hyperspace

Q5.17

The "adjustment" in the "Adjusted R Square" is for
the number of variables in the regression equation
the lack of fit of the standard error of estimate
missing data in the predictor variables
repressed memories of childhood trauma

The Standard Error of Estimate

The standard error of estimate is a measure of error of prediction. The definitional formula for the standard error of estimate is an extension of the definitional formula in simple linear regression and is presented below.

The difference between this formula and the formula presented in an earlier chapter is in the denominator of the equation. In both cases the denominator is N - k, where N is the number of observations and k is the number of parameters which are estimated to find the predicted value of Y. In the case of simple linear regression, the number of parameters needed to be estimated was two, the intercept and the slope, while in the case of the example with two independent variables, the number was three, b₀, b₁, and b₂.

The computation of the standard error of estimate using the definitional formula for the example data is presented below. The numerator, or sum of squared residuals, is found by summing the (Y-Y')² column.

Note that the value for the standard error of estimate agrees with the value given in the output table of SPSS.

Q5.18

The denominator in the definitional formula for the standard error of estimate in linear regression is
the number of scores minus the number of terms in the model.
the number of independent variables in the model.
the multiple correlation coefficient (R) squared.
the number of scores times the number of independent variables.

Q5.19

The larger the value of the unadjusted R squared the
smaller the standard error of estimate.
smaller the value of the adjusted R squared.
smaller the standardized regression coefficients.
weaker the relationship between the independent variables.

Q5.20

If the sum of squared residuals was 133.48 in a linear regression model for twenty-five score, two independent variables, and a constant term, the standard error of estimate would be.
2.46
6.07
5.38
11.55

The ANOVA Table

The ANOVA table output when both X₁ and X₂ are entered in the first block when predicting Y₁ appears as follows.

Because the exact significance level is less than alpha, in this case assumed to be .05, the model with variables X₁ and X₂ significantly predicted Y₁. As described in the chapter on testing hypotheses using regression, the Sum of Squares for the residual, 727.29, is the sum of the squared residuals (see the standard error of estimate above). The mean square residual, 42.78, is the squared standard error of estimate. The total sum of squares, 11420.95, is the sum of the squared differences between the observed values of Y and the mean of Y. The regression sum of squares, 10693.66, is the sum of squared differences between the model where Y'_i = b₀ and Y'_i = b₀ + b₁X_1i + b₂X_2i. The regression sum of squares is also the difference between the total sum of squares and the residual sum of squares, 11420.95 - 727.29 = 10693.66. The regression mean square, 5346.83, is computed by dividing the regression sum of squares by its degrees of freedom. In this case the regression mean square is based on two degrees of freedom because two additional parameters, b₁ and b₂, were computed.

The following table illustrates the computation of the various sum of squares in the example data.

Note that this table is identical in principal to the table presented in the chapter on testing hypotheses in regression.

Q5.21

In the ANOVA table produced when doing a multiple regression with SPSS the Sum of Squares due to Regression is
the numerator for the standard error of estimate
the sum of the squared differences between the predicted an observed value for Y
both of the answers
neither of the answers

Changes in the Regression Weights

When more terms are added to the regression model, the regression weights change as a function of the relationships between both the independent variables and the dependent variable. This can be illustrated using the example data.

As established earlier, the full regression model when predicting Y₁ from X₁ and X₂ is

As can be observed, the values of both b₁ and b₂ change when both X₁ and X₂ are included in the regression model. The size and effect of these changes are the foundation for the significance testing of sequential models in regression.

R² Change

The unadjusted R² value will increase with the addition of terms to the regression model. The amount of change in R² is a measure of the increase in predictive power of the independent variable or variables, given the independent variable or variables already in the model. For example, the effect of work ethic (X₂) on success in graduate school (Y₁) could be assessed given one already has a measure of intellectual ability (X₁.) The following table presents the results for the example data.

R² and R² change in successive models.
Variables in Equation	R²	Increase in R²
None	0.00	-
X₁	.584	.584
X₁, X₂	.936	.352

A similar table can be constructed to evaluate the increase in predictive power of X₃ given X₁ is already in the model.

As can be seen, although both X₂ and X₃ individually correlate significantly with Y₁, X₂ contributes a fairly large increase in predictive power in combination with X₁, while X₃ does not. Because X₁ and X₃ are highly correlated with each other, knowledge of one necessarily implies knowledge of the other. In regression analysis terms, X₂ in combination with X₁ predicts unique variance in Y₁, while X₃ in combination with X₁ predicts shared variance.

It is possible to do significance testing to determine whether the addition of another dependent variable to the regression model significantly increases the value of R². This significance test is the topic of the next section.

Q5.22

When adding additional terms to a multiple regression model
the regression weights in the original model will change in value.
only the regression weights in the original model will not change in value.
the regression weights in the original model will change only if the R squared change is statistically significant.
the regression weights in the original model will not change only if the R squared change is greater than zero.

Q5.23

R2 change
measures the gain in predictive power.
will be negative when the correlation between the dependent and independent variables is negative.
will be larger than the adjusted R².
cannot be tested for significance.

Sequential Significance Testing

In order to test whether a variable adds significant predictive power to a regression model, it is necessary to construct the regression model in stages or blocks. This is accomplished in SPSS by entering the independent variables in different blocks. For example, if the increase in predictive power of X₂ after X₁ has been entered in the model was desired, then X₁ would be entered in the first block and X₂ in the second block. The following demonstrates how to construct these sequential models. The figure below illustrates how X₁ is entered in the model first.

In order to obtain the desired hypothesis test, click on the "Statistics..." button and then select the "R squared change" option, as presented below.

The additional output obtained by selecting these option include a model summary,

The only new information presented in these tables is in the model summary and the "Change Statistics" entries. The critical new entry is the test of the significance of R² change for model 2. In this case the change is statistically significant. It could be said that X₂ adds significant predictive power in predicting Y₁ after X₁ has been entered into the regression model.

Conducting a similar hypothesis test for the increase in predictive power of X₃ when X₁ is already in the model produces the following model summary table.

Note that in this case the change is not significant. The table of coefficients also presents some interesting relationships.

Note that the "Sig." level for the X₃ variable in model 2 (.562) is the same as the "Sig. F Change" in the preceding table. The interpretation of the "Sig." level for the "Coefficients" is now apparent. It is the significance of the addition of that variable given all the other independent variables are already in the regression equation. Note also that the "Sig. " Value for X₁ in Model 2 is .039, still significant, but less than the significance of X₁ alone (Model 1 with a value of .000). Thus a variable may become "less significant" in combination with another variable than by itself.

Q5.24

Sequential hypothesis testing in multiple regression models is done using SPSS by
entering the variables in sequential blocks.
by clicking on the additional test button.
by requesting that the unstandardized predicted values be saved to the data file.
selecting multiple scatter plots.

Q5.25

If the R squared change is statistically significant
then the additional independent variables predicted additional variability in the dependent measure better than chance.
then multicollinearity is present.
then the prediction model is also practically significant.
the principal components of the independent variables are similar to the principal components of the dependent variables.

Q5.26

The value of the sig. column in the coefficients table of multiple regression
will be the same the significance level of R squared change if that variable was entered last by itself to the sequential regression model.
will be the same value as the R squared change.
will be the same the significance level of R squared change if that variable was entered first by itself to the sequential regression model.
can only be interpreted using the standardized regression weights.

Q5.27

The Sig. level provided by SPSS on the Coefficients table in the Multiple Regression procedure
is the significance of the correlation of that variable with the value of Y
tells whether that variable in isolation is a significant predictor of Y
can sometimes be negative when that variable is an extremely poor predictor of Y
will always be the same as the Sig. level of R2 change when that variable is added to the regression equation last.

Visual Representation of Multiple Regression

The regression equation, Y'_i = b₀ + b₁X_1i + b₂X_2i, defines a plane in a three dimensional space. If all possible values of Y were computed for all possible values of X₁ and X₂, all the points would fall on a two-dimensional surface. This surface can be found by computing Y' for three arbitrarily (X₁, X₂) pairs of data, plotting these points in a three-dimensional space, and then fitting a plane through the points in the space. The plane is represented in the three-dimensional rotating scatter plot as a yellow surface.

The residuals can be represented as the distance from the points to the plane parallel to the Y-axis. Residuals are represented in the rotating scatter plot as red lines.

Graphically, multiple regression with two independent variables fits a plane to a three-dimensional scatter plot such that the sum of squared residuals is minimized. The multiple regression plane is represented below for Y₁ predicted by X₁ and X₂.

Sub window_onLoad() document.X12YP.RegressionLine 0.6, 0.6, .25, .25 document.X13YP.RegressionLine 0.6, 0.6, .45, .85 end sub

While humans have difficulty visualizing data with more than three dimensions, mathematicians have no such problem in mathematically thinking about with them. When dealing with more than three dimensions, mathematicians talk about fitting a hyperplane in hyperspace.

Q5.28

A linear regression equation predicting a single dependent variable from two independent variables can be represented as a
plane.
curved line.
straight line.
rotation of the axis.

Variations of relationships

With three variable involved, X₁, X₂, and Y, many varieties of relationships between variables are possible. It will prove instructional to explore three such relationships.

Unrelated Independent Variables

In this example, both X₁ and X₂ are correlated with Y, and X₁ and X₂ are uncorrelated with each other. In the example data, X₁ and X₂ are correlated with Y₁ with values of .764 and .769 respectively. The independent variables, X₁ and X₂, are correlated with a value of .255, not exactly zero, but close enough. In this case X₁ and X₂ contribute independently to predict the variability in Y. It doesn't matter much which variable is entered into the regression equation first and which variable is entered second.

The following table of R square change predicts Y₁ with X₁ and then with both X₁ and X₂.

The next table of R square change predicts Y₁ with X₂ and then with both X₁ and X₂.

The value of R square change for X₁ from Model 1 in the first case (.584) to Model 2 in the second case (.345) is not identical, but fairly close. If the correlation between X₁ and X₂ had been 0.0 instead of .255, the R square change values would have been identical.

Because of the structure of the relationships between the variables, slight changes in the regression weights would rather dramatically increase the errors in the fit of the plane to the points.

Related Predictor Variables

In this case, both X₁ and X₂ are correlated with Y, and X₁ and X₂ are correlated with each other. In the example data, X₁ and X₃ are correlated with Y₁ with values of .764 and .687 respectively. The independent variables, X₁ and X₃, are correlated with a value of .940. In this situation it makes a great deal of difference which variable is entered into the regression equation first and which is entered second.

Entering X₁ first and X₃ second results in the following R square change table.

Entering X₃ first and X₁ second results in the following R square change table.

As before, both tables end up at the same place, in this case with an R² of .592. In this case, however, it makes a great deal of difference whether a variable is entered into the equation first or second. Variable X₃, for example, if entered first has an R square change of .561. If entered second after X₁, it has an R square change of .008. In the first case it is statistically significant, while in the second it is not.

As two independent variables become more highly correlated, the solution to the optimal regression weights becomes unstable. This can be seen in the rotating scatter plots of X₁, X₃, and Y₁. The plane that models the relationship could be modified by rotating around an axis in the middle of the points without greatly changing the degree of fit. The solution to the regression weights becomes unstable. That is, there are any number of solutions to the regression weights which will give only a small difference in sum of squared residuals. This is called the problem of multicollinearity in mathematical vernacular.

Suppressor Variables

One of the many varieties of relationships occurs when neither X₁ nor X₂ individually correlates with Y, X₁ correlates with X₂, but X₁ and X₂ together correlate highly with Y. This phenomena may be observed in the relationships of Y₂, X₁, and X₄. In the example data neither X₁ nor X₄ is highly correlated with Y₂, with correlation coefficients of .251 and .018 respectively. Variables X₁ and X₄ are correlated with a value of .847. Fitting X1 followed by X4 results in the following tables.

In this case, the regression weights of both X₁ and X₄ are significant when entered together, but insignificant when entered individually. It is also noted that the regression weight for X₁ is positive (.769) and the regression weight for X₄ is negative (-.783). In this case the variance in X₁ that does not account for variance in Y₂ is cancelled or suppressed by knowledge of X₄. Variable X₄ is called a suppressor variable.

In terms of the descriptions of the variables, if X₁ is a measure of intellectual ability and X₄ is a measure of spatial ability, it might be reasonably assumed that X₁ is composed of both verbal ability and spatial ability. If the score on a major review paper is correlated with verbal ability and not spatial ability, then subtracting spatial ability from general intellectual ability would leave verbal ability. Thus the high multiple R when spatial ability is subtracted from general intellectual ability. It is for this reason that X₁ and X₄, while not correlated individually with Y₂, in combination correlate fairly highly with Y₂.

Summary

Multiple regression predicting a single dependent variable with two independent variables is conceptually similar to simple linear regression, predicting a single dependent variable with a single independent variable, except more weights are estimated and rather than fitting a line in a two-dimensional scatter plot, a plane is fitted to describe a three-dimensional scatter plot. Interpretation of the results is confounded by both the relationship between the two independent variables and their relationship with dependent variable.

A variety of relationships and interactions between the variables were then explored. These relationships discussed barely scratched the surface of the possibilities. Suffice it to say that the more variables that are included in an analysis, the greater the complexity of the analysis. Multiple regression is usually done with more than two independent variables. The next chapter will discuss issues related to more complex regression models.

Q5.29

Correlation matrix for Multivariate Test Two

In a sequential multiple regression modeling procedure predicting Dept, adding Indp02 after Indp01 is already in the regression model would most likely
increase the multiple R squared, but not significantly.
significantly increase the multiple R squared.
leave the multiple R squared unchanged.
decrease the multiple R squared.

Q5.30

In a sequential multiple regression modeling procedure predicting Dept, adding Indp04 after Indp01 is already in the regression model would most likely
increase the multiple R squared, but not significantly.
significantly increase the multiple R squared.
leave the multiple R squared unchanged.
decrease the multiple R squared.

Q5.31

In a multiple regression model, which of the following two variables in combination would most likely best predict Dept?
Indp03 and Indp04
Indp01 and Indp02
Indp01 and Indp03
Indp02 and Indp03

Q5.32

Regression Summary for Multivariate Test Two

In a sequential multiple regression modeling procedure predicting Dept, adding Indp03 after Indp01 is already in the regression model
increases the multiple R squared, but not significantly.
significantly increases the multiple R squared.
leaves the multiple R squared unchanged.
decreases the adjusted multiple R squared.

Q5.33

In a sequential multiple regression modeling procedure predicting Dept01, the significance level on the coefficients table for Indp03 when both Indp01 and Indp03 have been entered in the regression model would be
.149.
.000.
.053.
unknown from the information given.

Q5.34

In a sequential multiple regression modeling procedure predicting Dept01, the significance level on the coefficients table for Indp01 when only Indp01 has been entered in the regression model would be
.149.
.000.
.053.
unknown from the information given.

Q5.35

In a sequential multiple regression modeling procedure predicting Dept, the significance level for R squared change on the summary table for Indp01 after Indp03 has been entered in the regression model would be
.149.
.000.
.692.
unknown from the information given.

Q5.36

In a sequential multiple regression modeling procedure predicting Dept, the predicted value of Dept01 when Indp01=30 and Indp03=20 would be
109.60.
102.685.
100.822.
unknown from the information given.

Q5.37

In a sequential multiple regression modeling procedure predicting Dept01, the predicted stand score value of Dept01 when the standard score of Indp01=1.30 and the standard score of Indp03=-2.20 would be
-1.153.
1.584.
-1.249.
unknown from the information given.

Q5.38

If two independent variables are highly correlated
then adding the second to the regression model will result in only slightly better prediction.
then the combined regression model will become more stable.
then the multiple R will be high.
then the multiple R will be low.

Q5.39

When predicting Y from two predictor variables, if the predictor variables are uncorrelated then
the value of R squared change will be equal to the correlation coefficient squared between the predictor variables and Y
the ANOVA table provided by SPSS/WIN will almost always be significant
multicollinearity is almost always a problem
one of the predictor variables is most likely a suppressor variable

Q5.40

When two predictor variables are highly correlated in multiple regression
suppressor variables are unlikely
collinearity may be a problem
the hyperplane will describe the hyperspace
R2 change will likely be relatively high.

Q5.41

A suppressor variable relationship is possible when
two predictor variables are uncorrelated, neither individually correlate highly with Y, and together they have a small multiple R.
two predictor variables are highly correlated, both individually highly correlate highly with Y, and together they have a large multiple R.
two predictor variables are highly correlated, neither individually correlate highly with Y, and together they have a large multiple R.
two predictor variables are moderately correlated, one correlates highly with Y, the other is negatively correlated with Y, and together they have a moderate multiple R.