20.01.2015 Views

njit-etd2003-081 - New Jersey Institute of Technology

njit-etd2003-081 - New Jersey Institute of Technology

njit-etd2003-081 - New Jersey Institute of Technology

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

148<br />

4.2.11 Cluster Analysis<br />

The same data used for PCA in the previous section were also used for cluster analysis<br />

in separating the normal and COPD subjects and severity classification. The data set<br />

was entered into the Matlab clustering program. The program executed according to the<br />

following syntax:<br />

T = clusterdata(X,cut<strong>of</strong>f)<br />

where:<br />

X was a matrix <strong>of</strong> size m by n, interpreted as m observations <strong>of</strong> n variables.<br />

cut<strong>of</strong>f was a threshold value that determines how the cluster function creates clusters.<br />

The value <strong>of</strong> cut<strong>of</strong>f determined how clusterdata interpreted it.<br />

If 0 < cut<strong>of</strong>f < 1, cut<strong>of</strong>f was interpreted as the threshold for the inconsistency<br />

coefficient. The inconsistency coefficient quantified the degree <strong>of</strong> difference between<br />

objects in the hierarchical cluster tree. If the inconsistency coefficient <strong>of</strong> a link was<br />

greater than the threshold, the cluster function would use the link as a boundary for a<br />

cluster grouping.<br />

If cut<strong>of</strong>f >= 1 cut<strong>of</strong>f was interpreted as the maximum number <strong>of</strong> clusters to retain in the<br />

hierarchical tree.<br />

The output, T, was a vector <strong>of</strong> size m that identifies, by number, the cluster in<br />

which each object was grouped. The program used the clusterdata function to compute<br />

the distances between items in the dataset and create a hierarchical cluster tree from the<br />

dataset. Finally, the clusterdata function grouped the items in the dataset into the<br />

number <strong>of</strong> cluster specified by the value "cut<strong>of</strong>f'.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!