24.06.2015 Views

Notes on Principal Component Analysis

Notes on Principal Component Analysis

Notes on Principal Component Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

2) a i and a j are orthog<strong>on</strong>al, i.e., a ′ ia j = 0<br />

3) Cov(a ′ iX, a ′ jX) = a ′ iΣa j = a ′ iλ j a j = λ j a ′ ia j = 0<br />

4) Tr(Σ) = λ 1 + · · · + λ p = sum of variances for all p principal<br />

comp<strong>on</strong>ents, and for X 1 , . . . , X p . The importance of the i th principal<br />

comp<strong>on</strong>ent is<br />

λ i /Tr(Σ) ,<br />

which is equal to the variance of the i th principal comp<strong>on</strong>ent divided<br />

by the total variance in the system of p random variables,<br />

X 1 , . . . , X p ; it is the proporti<strong>on</strong> of the total variance explained by<br />

the i th comp<strong>on</strong>ent.<br />

If the first few principal comp<strong>on</strong>ents account for most of the variati<strong>on</strong>,<br />

then we might interpret these comp<strong>on</strong>ents as “factors” underlying<br />

the whole set X 1 , . . . , X p . This is the basis of principal factor<br />

analysis.<br />

The questi<strong>on</strong> of how many comp<strong>on</strong>ents (or factors, or clusters,<br />

or dimensi<strong>on</strong>s) usually has no definitive answer. Some attempt has<br />

been made to do what are called Scree plots, and graphically see how<br />

many comp<strong>on</strong>ents to retain. A plot is c<strong>on</strong>structed of the value of the<br />

eigenvalue <strong>on</strong> the y-axis and the number of the eigenvalue (e.g., 1, 2,<br />

3, and so <strong>on</strong>) <strong>on</strong> the x-axis, and you look for an “elbow” to see where<br />

to stop. Scree or talus is the pile of rock debris (detritus) at the foot<br />

of a cliff, i.e., the sloping mass of debris at the bottom of the cliff. I,<br />

unfortunately, can never see an “elbow”!<br />

If we let a populati<strong>on</strong> correlati<strong>on</strong> matrix corresp<strong>on</strong>ding to Σ be<br />

denoted as P, then Tr(P) = p, and we might use <strong>on</strong>ly those principal<br />

3

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!