24.06.2015 Views

Notes on Principal Component Analysis

Notes on Principal Component Analysis

Notes on Principal Component Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

variables or factors, and the estimati<strong>on</strong> of the factor structure). We<br />

will get sloppy ourselves later, but some people really get exercised<br />

about these things.<br />

We will begin working with the populati<strong>on</strong> (but everything translates<br />

more-or-less directly for a sample):<br />

Suppose [X 1 , X 2 , . . . , X p ] = X ′ is a set of p random variables, with<br />

mean vector µ and variance-covariance matrix Σ. I want to define p<br />

linear combinati<strong>on</strong>s of X ′ that represent the informati<strong>on</strong> in X ′ more<br />

parsim<strong>on</strong>iously. Specifically, find a 1 , . . . , a p such that a ′ 1X, . . . , a ′ pX<br />

gives the same informati<strong>on</strong> as X ′ , but the new random variables,<br />

a ′ 1X, . . . , a ′ pX, are “nicer”.<br />

Let λ 1 ≥ λ 2 ≥ · · · ≥ λ p ≥ 0 be the p roots (eigenvalues) of the<br />

matrix Σ, and let a 1 , . . . , a p be the corresp<strong>on</strong>ding eigenvectors. If<br />

some roots are not distinct, I can still pick corresp<strong>on</strong>ding eigenvectors<br />

to be orthog<strong>on</strong>al. Choose an eigenvector a i so a ′ ia i = 1, i.e., a<br />

normalized eigenvector. Then, a ′ iX is the i th principal comp<strong>on</strong>ent of<br />

the random variables in X ′ .<br />

Properties:<br />

1) Var(a ′ iX) = a ′ iΣa i = λ i<br />

We know Σa i = λ i a i , because a i is the eigenvector for λ i ; thus,<br />

a ′ iΣa i = a ′ iλ i a i = λ i . In words, the variance of the i th principal<br />

comp<strong>on</strong>ent is λ i , the root.<br />

Also, for all vectors b i such that b i is orthog<strong>on</strong>al to a 1 , . . . , a i−1 ,<br />

and b ′ ib i = 1, Var(b ′ iX) is the greatest it can be (i.e., λ i ) when<br />

b i = a i .<br />

2

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!