20.01.2015 Views

njit-etd2003-081 - New Jersey Institute of Technology

njit-etd2003-081 - New Jersey Institute of Technology

njit-etd2003-081 - New Jersey Institute of Technology

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

109<br />

3.14.1 Basic Concepts <strong>of</strong> Principal Components Analysis<br />

Suppose that there is a random sample <strong>of</strong> N observations on X 1 and X2 . For ease <strong>of</strong><br />

interpretation the sample mean is subtracted from each observation, thus obtaining<br />

Note that this technique makes the means <strong>of</strong> x 1 and x 2 equal to zero but does not<br />

alter the sample variances Si and S 2 or the correlation r. The basic idea is to create two<br />

new variables, C 1 and C2 called the principal components. These new variables are<br />

linear functions <strong>of</strong> x 1 and x 2 and can therefore be written as<br />

It should be noted that for any set <strong>of</strong> values <strong>of</strong> the coefficients a11 , a12 , a 21 , a 22 ,<br />

one can introduce the N observed .,j and x 2 and obtain N values <strong>of</strong> CI and C 2 . The<br />

means and variances <strong>of</strong> the N values <strong>of</strong> C 1 and C2 are<br />

where S,2 =VarXi. Equation 3.98 is true because the means <strong>of</strong> A and x 2 are zero.<br />

The coefficients are chosen to satisfy three requirements:<br />

1. The VarC1 is as large as possible.<br />

2. The N values <strong>of</strong> C 1 and C2 are uncorrelated.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!