MACHINE LEARNING TECHNIQUES - LASA
MACHINE LEARNING TECHNIQUES - LASA
MACHINE LEARNING TECHNIQUES - LASA
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
17<br />
2.1 Principal Component Analysis<br />
Principal component analysis (PCA) performs a linear transformation of the coordinate system, so<br />
as to maximize the variance of the data along the first principal axis of the new coordinate<br />
system, e.g. the first axis of the ellipse as illustrated in Figure 2-1.<br />
It is important that the data on which PCA is performed are correlated. If the data are<br />
independent, nothing can be achieved with PCA.<br />
Formalism:<br />
Consider a data set of<br />
i=<br />
1...<br />
{ j} j=<br />
1,...<br />
M<br />
MN -dimensional data points<br />
i i N<br />
X= x and x ∈ ° , i = 1,..., M :<br />
N<br />
PCA aims at finding a linear map A , such that:<br />
N A q<br />
A: ° ⎯⎯⎯⎯→ ° , with q ≤ N<br />
1<br />
{ }<br />
X ⎯⎯⎯⎯→ Y = AX , with Y = y ,...., y and each y ∈ °<br />
A M i q<br />
Algorithm:<br />
Classical batch algorithm for PCA goes as follows:<br />
The mean of the dataset is denoted by<br />
⎛⎛x<br />
+ .. + x<br />
⎝⎝ M<br />
+ .. + x<br />
M<br />
1 M 1<br />
M<br />
1 1 N N<br />
= ( 1, 1,..., N ) = E( X)<br />
=⎜⎜ ,..., ⎟⎟<br />
µ µ µ µ<br />
and the covariance matrix of the same data set is<br />
1 1<br />
⎛⎛x1 −µ 1,.........,<br />
xN<br />
−µ<br />
⎞⎞<br />
T<br />
N<br />
T BB ⎜⎜<br />
⎟⎟<br />
C = E{ X '( X ')<br />
} = , B = ......................................<br />
M ⎜⎜ ⎟⎟<br />
⎜⎜ M<br />
M<br />
x1 µ<br />
1,.........,<br />
x1<br />
µ ⎟⎟<br />
⎝⎝ −<br />
−<br />
N ⎠⎠<br />
whereby X ' = X- µ , i.e. X' is zero mean X.<br />
x<br />
⎞⎞<br />
⎠⎠<br />
(2.1)<br />
(2.2)<br />
The component cii<br />
is the variance of the vector of components i of all data points, denoted X .<br />
i<br />
The variance of a component indicates the spread of the component values around its mean<br />
value. The components of C, denoted by<br />
1<br />
c ( x )( x )<br />
M<br />
k<br />
k<br />
ij<br />
= ∑ i<br />
−µ i j<br />
−µ<br />
j<br />
M k = 1<br />
, are a measure of the<br />
© A.G.Billard 2004 – Last Update March 2011