08.08.2013 Views

Statistical inference for exploratory data analysis ... - Hadley Wickham

Statistical inference for exploratory data analysis ... - Hadley Wickham

Statistical inference for exploratory data analysis ... - Hadley Wickham

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

esiduals<br />

20<br />

10<br />

0<br />

−10<br />

−20<br />

20<br />

10<br />

0<br />

−10<br />

−20<br />

20<br />

10<br />

0<br />

−10<br />

−20<br />

20<br />

10<br />

0<br />

−10<br />

−20<br />

20<br />

10<br />

0<br />

−10<br />

−20<br />

1<br />

5<br />

9<br />

13<br />

17<br />

100 200 300 400 500<br />

<strong>Statistical</strong> <strong>inference</strong> <strong>for</strong> graphics 4375<br />

2<br />

6<br />

10<br />

14<br />

18<br />

100 200 300 400 500 100 200 300 400 500<br />

order<br />

3<br />

7<br />

11<br />

15<br />

19<br />

4<br />

8<br />

12<br />

16<br />

20<br />

100 200 300 400 500<br />

Figure 6. Residuals of the Boston Housing Data plotted against order in the <strong>data</strong>. What does the<br />

structure in the real plot indicate?<br />

In supervised classification problems, especially when the size of the sample<br />

is small in relation to the number of variables, the PDA index is suitable. This<br />

index is designed to overcome the variance estimation problems that arise in<br />

the LDA index. When the sample size is small and the number of variables is<br />

large, the LDA index produces unreliable results, which express themselves as<br />

Phil. Trans. R. Soc. A (2009)<br />

Downloaded from<br />

rsta.royalsocietypublishing.org on January 7, 2010

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!