30.12.2012 Views

European Journal of Scientific Research - EuroJournals

European Journal of Scientific Research - EuroJournals

European Journal of Scientific Research - EuroJournals

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Feature Selection Based on Statistical Analysis 429<br />

KG-rule retains only those PCs whose variances, i.e. eigenvalues that are ≥ 1. Nevertheless, for<br />

large variable spaces p, the KG-rule usually retains too many PCs [4], [8][10].<br />

c) Cumulative Variance utters the criterion for choosing m is to select a cumulative variance<br />

threshold, t where t is at certain percentage <strong>of</strong> the total variance that the first m PCs should account<br />

for. The required number <strong>of</strong> PCs is then the smallest value <strong>of</strong> m for which the chosen percentage is<br />

exceeded [7][10]. From PCA theory, the variance <strong>of</strong> the i-th PC (eigenvector) is equal to its<br />

corresponding eigenvalues λi [6].<br />

Three different methods have been elucidated on how to retain the optimal number <strong>of</strong> PCs. In<br />

this study, we investigated the required number <strong>of</strong> PCs to be retained using these three methods.<br />

Statistical Analysis<br />

ANOVA is a standard technique for measuring the statistical significance <strong>of</strong> a set <strong>of</strong> independent<br />

variables. It takes a single feature and the class associated with the data samples and measures the<br />

significance <strong>of</strong> the class variables in predicting the means <strong>of</strong> the feature. The measure that ANOVA<br />

produces is the p-value for the feature set. On the other hand, Multiple Comparison Procedure (MCP)<br />

test is one that can be used to determine which means amongst a set <strong>of</strong> means differ from the rest. The<br />

ANOVA leads to a conclusion that there is evidence that the group means differ but the main goal is to<br />

determine which <strong>of</strong> the means are different. Therefore, the MCP is applied.<br />

The MCP test compares the difference between pair <strong>of</strong> means with appropriate adjustment for<br />

the multiple testing. The results <strong>of</strong> MCP present the p-value or confidence interval for each pair. In this<br />

study, after performing ANOVA on the eigenpostures, MCP is applied to assess which ‘eigenpostures’<br />

means are significantly different. Further, multiple range test is realized for testing homogeneous<br />

subsets <strong>of</strong> groups based on their group means. In doing so, the groups that differ significantly are<br />

revealed. Finally, we will determine the optimized number <strong>of</strong> eigenpostures that will act as inputs to<br />

the ANN for classification <strong>of</strong> the four main postures.<br />

Artificial Neural Network (ANN) as Classifier<br />

Artificial Neural Networks (ANN) are known for their ability to express highly nonlinear decision and<br />

makes them appropriate for recognition <strong>of</strong> complex pattern and the ability to maintain accuracy even<br />

when some input data are inapt. In this study, the multilayer perceptron is chosen for recognition<br />

purpose. A multilayer perceptron is a feed forward network structure in which neurons are connected<br />

only between two succeeding layers [9][12]. A feed forward neural net that performs the recognition<br />

part consist <strong>of</strong> one input layer with one neuron per feature, two hidden layers with four and three nodes<br />

in each layer and a single output.<br />

Experiments and Results<br />

A collection <strong>of</strong> 400 images <strong>of</strong> various human postures constitutes the database to generate the<br />

eigenpostures for this study. The various postures include all four main postures namely standing,<br />

sitting, bending and lying position for both gender with the subjects are either facing front or facing<br />

either side and no restriction impose on the type <strong>of</strong> clothing being worn. Initially, each image has m x n<br />

pixels, but eventually reshaped to a column vector <strong>of</strong> x mn. Then, the eigenvectors and eigenvalues are<br />

computed according to [3][11]. Implementing the three rules mentioned previously, we select the most<br />

suitable eigenpostures required as inputs to the classification system. In other words, we select the<br />

most relevant eigenvalues or PCs to be retained in this study.<br />

Figure 3 illustrates the results <strong>of</strong> the Scree test. The decrease in magnitude for successive<br />

eigenvalues implies that the first few principal components can approximate a large part <strong>of</strong> the original<br />

data’s variance. In this case, decision to retain the first thirty-five PCs is appropriate and they<br />

reasonably represent good approximation <strong>of</strong> the original data set. Next, from the PCA results, applying

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!