01.11.2014 Views

MACHINE LEARNING TECHNIQUES - LASA

MACHINE LEARNING TECHNIQUES - LASA

MACHINE LEARNING TECHNIQUES - LASA

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

17<br />

2.1 Principal Component Analysis<br />

Principal component analysis (PCA) performs a linear transformation of the coordinate system, so<br />

as to maximize the variance of the data along the first principal axis of the new coordinate<br />

system, e.g. the first axis of the ellipse as illustrated in Figure 2-1.<br />

It is important that the data on which PCA is performed are correlated. If the data are<br />

independent, nothing can be achieved with PCA.<br />

Formalism:<br />

Consider a data set of<br />

i=<br />

1...<br />

{ j} j=<br />

1,...<br />

M<br />

MN -dimensional data points<br />

i i N<br />

X= x and x ∈ ° , i = 1,..., M :<br />

N<br />

PCA aims at finding a linear map A , such that:<br />

N A q<br />

A: ° ⎯⎯⎯⎯→ ° , with q ≤ N<br />

1<br />

{ }<br />

X ⎯⎯⎯⎯→ Y = AX , with Y = y ,...., y and each y ∈ °<br />

A M i q<br />

Algorithm:<br />

Classical batch algorithm for PCA goes as follows:<br />

The mean of the dataset is denoted by<br />

⎛⎛x<br />

+ .. + x<br />

⎝⎝ M<br />

+ .. + x<br />

M<br />

1 M 1<br />

M<br />

1 1 N N<br />

= ( 1, 1,..., N ) = E( X)<br />

=⎜⎜ ,..., ⎟⎟<br />

µ µ µ µ<br />

and the covariance matrix of the same data set is<br />

1 1<br />

⎛⎛x1 −µ 1,.........,<br />

xN<br />

−µ<br />

⎞⎞<br />

T<br />

N<br />

T BB ⎜⎜<br />

⎟⎟<br />

C = E{ X '( X ')<br />

} = , B = ......................................<br />

M ⎜⎜ ⎟⎟<br />

⎜⎜ M<br />

M<br />

x1 µ<br />

1,.........,<br />

x1<br />

µ ⎟⎟<br />

⎝⎝ −<br />

−<br />

N ⎠⎠<br />

whereby X ' = X- µ , i.e. X' is zero mean X.<br />

x<br />

⎞⎞<br />

⎠⎠<br />

(2.1)<br />

(2.2)<br />

The component cii<br />

is the variance of the vector of components i of all data points, denoted X .<br />

i<br />

The variance of a component indicates the spread of the component values around its mean<br />

value. The components of C, denoted by<br />

1<br />

c ( x )( x )<br />

M<br />

k<br />

k<br />

ij<br />

= ∑ i<br />

−µ i j<br />

−µ<br />

j<br />

M k = 1<br />

, are a measure of the<br />

© A.G.Billard 2004 – Last Update March 2011

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!