24.06.2015 Views

Notes on Principal Component Analysis

Notes on Principal Component Analysis

Notes on Principal Component Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

comp<strong>on</strong>ents that have variance of λ i ≥ 1 — otherwise, the comp<strong>on</strong>ent<br />

would “explain” less variance than would a single variable.<br />

A major rub — if I do principal comp<strong>on</strong>ents <strong>on</strong> the correlati<strong>on</strong><br />

matrix, P, and <strong>on</strong> the original variance-covariance matrix, Σ, the<br />

structures obtained are generally different. This is <strong>on</strong>e reas<strong>on</strong> the<br />

“true believers” might prefer a factor analysis model over a PCA because<br />

the former holds out some hope for an invariance (to scaling).<br />

Generally, it seems more reas<strong>on</strong>able to always use the populati<strong>on</strong><br />

correlati<strong>on</strong> matrix, P; the units of the original variables become irrelevant,<br />

and it is much easier to interpret the principal comp<strong>on</strong>ents<br />

through their coefficients.<br />

The j th principal comp<strong>on</strong>ent is a ′ jX:<br />

Cov(a ′ jX, X i ) = Cov(a ′ jX, b ′ X), where b ′ = [0 · · · 0 1 0 · · · 0],<br />

with the 1 in the i th positi<strong>on</strong>, = a ′ jΣb = b ′ Σa j = b ′ λ j a j = λ j<br />

times the i th comp<strong>on</strong>ent of a j = λ j a ij . Thus, Cor(a ′ jX, X i ) =<br />

λ j a ij<br />

√<br />

λj σ i<br />

=<br />

√<br />

λj a ij<br />

σ i<br />

,<br />

where σ i is the standard deviati<strong>on</strong> of X i . This correlati<strong>on</strong> is called<br />

the loading of X i <strong>on</strong> the j th comp<strong>on</strong>ent. Generally, these correlati<strong>on</strong>s<br />

can be used to see the c<strong>on</strong>tributi<strong>on</strong> of each variable to each of the<br />

principal comp<strong>on</strong>ents.<br />

If the populati<strong>on</strong> covariance matrix, Σ, is replaced by the sample<br />

covariance matrix, S, we obtain sample principal comp<strong>on</strong>ents; if the<br />

populati<strong>on</strong> correlati<strong>on</strong> matrix, P, is replaced by the sample correlati<strong>on</strong><br />

matrix, R, we again obtain sample principal comp<strong>on</strong>ents. These<br />

structures are generally different.<br />

4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!