20.01.2015 Views

Answers to selected problems in chapters 18, 19

Answers to selected problems in chapters 18, 19

Answers to selected problems in chapters 18, 19

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Simple and Multiple Regression<br />

Chapter <strong>18</strong> Simple L<strong>in</strong>ear Regression<br />

(Data file lowbwt.sav (SPSS format) is <strong>in</strong> the CD that came with the book)<br />

9.<br />

a)<br />

90<br />

80<br />

70<br />

RESPONSE OF SYSTOLIC BLOOD<br />

PRESSURE VERSUS GESTATIONAL AGE<br />

sys<strong>to</strong>lic blood pressure<br />

60<br />

50<br />

40<br />

30<br />

20<br />

10<br />

22<br />

24<br />

26<br />

28<br />

30<br />

32<br />

34<br />

36<br />

gestational age<br />

b) If us<strong>in</strong>g SPSS the steps would be Analyze -> Regression -> L<strong>in</strong>ear and select the sbp<br />

(sys<strong>to</strong>lic blood pressure) as the dependent variable and gestage (gestational age) as the<br />

<strong>in</strong>dependent variable. Click on save but<strong>to</strong>n and check on save the unstandardized predicted<br />

values and unstandardized residuals, and also check the prediction <strong>in</strong>tervals for mean and<br />

<strong>in</strong>dividual, and click Cont<strong>in</strong>ue and click OK. You should see a list of output tables <strong>in</strong> the<br />

SPSS output w<strong>in</strong>dow.<br />

Coefficients a<br />

Model<br />

1<br />

(Constant)<br />

gestational age<br />

Unstandardized<br />

Coefficients<br />

B<br />

Std. Error<br />

Standardiz<br />

ed<br />

Coefficient<br />

s<br />

Beta<br />

10.552 12.651 .834 .406<br />

1.264 .436 .281 2.898 .005<br />

t<br />

Sig.<br />

a. Dependent Variable: sys<strong>to</strong>lic blood pressure<br />

Slope: 1.264<br />

y-<strong>in</strong>tercept of l<strong>in</strong>e: 10.552<br />

sbp = 10.552 + 1.264*gestage<br />

The rate of <strong>in</strong>creas<strong>in</strong>g <strong>in</strong> sys<strong>to</strong>lic blood pressure is 1.264 for per unit (week) <strong>in</strong>creases <strong>in</strong><br />

gestational age.<br />

1


Simple and Multiple Regression<br />

c) Ho: β = 0<br />

H1: β ≠ 0<br />

p-value is equal <strong>to</strong> .005, which is greater than .05 significance level, we reject the null<br />

hypothesis. There is significant l<strong>in</strong>ear relation between the response (sbp) and the explana<strong>to</strong>ry<br />

variables (gestational age).<br />

d) sbp = 10.552 + 1.264*gestage,<br />

gestage = 31 => sbp = 10.552 + 1.264*31 = 49.736<br />

e) The 95% prediction <strong>in</strong>terval for the mean sys<strong>to</strong>lic blood pressure is:<br />

(46.90159, 52.59409) This can be found <strong>in</strong> the data edi<strong>to</strong>r and <strong>in</strong> the case that has<br />

gestational age of 31.<br />

f) The predicted sys<strong>to</strong>lic blood pressure for the child is =10.552 + 1.264*31= 49.736.<br />

g) The 95% prediction <strong>in</strong>terval for the new value of sys<strong>to</strong>lic blood pressure is:<br />

(27.73488, 71.76080) This can be found <strong>in</strong> the data edi<strong>to</strong>r and <strong>in</strong> the case that has<br />

gestational age of 31.<br />

h) The coefficient of determ<strong>in</strong>ation is .079 that means about 8 percent of variability <strong>in</strong> the<br />

variation of sys<strong>to</strong>lic blood pressure can be expla<strong>in</strong>ed by gestational age. The variability of<br />

error seems pretty uniformly spread throughout the range of the predicted values. Therefore<br />

the regression model seems <strong>to</strong> fit the data.<br />

There is no evidence that the assumption of homoscedasticity has been violated or that a<br />

transformation of either the response or the explana<strong>to</strong>ry variable is necessary.<br />

Model Summary b<br />

Model<br />

1<br />

R<br />

R Square<br />

Adjusted R<br />

Square<br />

Std. Error of<br />

the Estimate<br />

.281 a .079 .070 11.00<br />

a. Predic<strong>to</strong>rs: (Constant), gestational age<br />

b. Dependent Variable: sys<strong>to</strong>lic blood pressure<br />

50<br />

40<br />

30<br />

20<br />

10<br />

Unstandardized Residual<br />

0<br />

-10<br />

-20<br />

-30<br />

38<br />

40<br />

42<br />

44<br />

46<br />

48<br />

50<br />

52<br />

54<br />

56<br />

Unstandardized Predicted Value<br />

2


Simple and Multiple Regression<br />

Chapter <strong>19</strong><br />

Multiple Regression<br />

8.<br />

a)<br />

90<br />

80<br />

response of sys<strong>to</strong>lic blood<br />

pressure versus five-m<strong>in</strong>ute apgar<br />

70<br />

sys<strong>to</strong>lic blood pressure<br />

60<br />

50<br />

40<br />

30<br />

20<br />

10<br />

-2<br />

0<br />

2<br />

4<br />

6<br />

8<br />

10<br />

apgar score at five m<strong>in</strong>utes<br />

It seems like there is l<strong>in</strong>ear but weak relationship between these two variables.<br />

b) apgar5 <strong>in</strong> SPSS file represents apgar score at five m<strong>in</strong>utes. Do the same th<strong>in</strong>g SPSS as the<br />

exercise <strong>in</strong> chapter <strong>18</strong>, except <strong>to</strong> <strong>in</strong>clude one more variable apgar5 <strong>in</strong> the <strong>in</strong>dependent variable<br />

list.<br />

Model<br />

1<br />

(Constant)<br />

gestational age<br />

apgar score at five m<strong>in</strong>utes<br />

a. Dependent Variable: sys<strong>to</strong>lic blood pressure<br />

Sbp =9.803+1.<strong>18</strong>5*gestage + .448*apgar5<br />

Coefficients a<br />

Unstandardized<br />

Coefficients<br />

B<br />

Std. Error<br />

Standardiz<br />

ed<br />

Coefficient<br />

s<br />

Beta<br />

9.803 12.663 .774 .441<br />

1.<strong>18</strong>5 .442 .263 2.678 .009<br />

.488 .461 .104 1.057 .293<br />

The 1.<strong>18</strong>5 means that rate of <strong>in</strong>creas<strong>in</strong>g <strong>in</strong> sys<strong>to</strong>lic blood pressure is 1.<strong>18</strong>5 for per unit (week)<br />

<strong>in</strong>creases <strong>in</strong> gestational age.<br />

The .448 means that rate of <strong>in</strong>creas<strong>in</strong>g <strong>in</strong> sys<strong>to</strong>lic blood pressure is .448 for per unit <strong>in</strong>creases<br />

<strong>in</strong> apgar score.<br />

t<br />

Sig.<br />

3


Simple and Multiple Regression<br />

c) Estimated mean sys<strong>to</strong>lic blood pressure is<br />

= 9.803+1.<strong>18</strong>5*31 + .448*7<br />

= 49.674<br />

d) The 95% prediction <strong>in</strong>terval for the sys<strong>to</strong>lic blood pressure at gestage = 31 and apgar5 = 7 is:<br />

(47.07655, 52.81469) See data edi<strong>to</strong>r for a case that gestage = 31 and apgar5 = 7.<br />

e) H0: β<br />

2<br />

= 0<br />

H1: β2<br />

≠ 0<br />

Refer <strong>to</strong> the SPSS table <strong>in</strong> question b, p-value is equal <strong>to</strong> .293 which is greater than the<br />

significance level .05, aga<strong>in</strong> we fail <strong>to</strong> reject the null hypothesis. This means the apgar score<br />

is not a significant fac<strong>to</strong>r <strong>in</strong> predict<strong>in</strong>g the sys<strong>to</strong>lic blood pressure.<br />

f)<br />

Model<br />

1<br />

R<br />

Model Summary b<br />

R Square<br />

Adjusted R<br />

Square<br />

Std. Error of<br />

the Estimate<br />

.299 a .089 .071 10.99<br />

a. Predic<strong>to</strong>rs: (Constant), apgar score at five m<strong>in</strong>utes,<br />

gestational age<br />

b. Dependent Variable: sys<strong>to</strong>lic blood pressure<br />

R square is <strong>in</strong>creased from .079 (model without gestage - table <strong>in</strong> f) <strong>to</strong> .089(model with<br />

gestage -table <strong>in</strong> d)<br />

g) The graph shows that there is no evidence for <strong>to</strong> say that the assumption of homoscedasticity<br />

has been violated or that a transformation of either the response or the explana<strong>to</strong>ry variable is<br />

necessary.<br />

40<br />

30<br />

20<br />

10<br />

Unstandardized Residual<br />

0<br />

-10<br />

-20<br />

-30<br />

30<br />

40<br />

50<br />

60<br />

Unstandardized Predicted Value<br />

4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!