27.03.2013 Views

SPSS® 12.0 Command Syntax Reference

SPSS® 12.0 Command Syntax Reference

SPSS® 12.0 Command Syntax Reference

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

CLUSTER 229<br />

• The variable list is required except when matrix input is used. It must be specified before<br />

the optional subcommands.<br />

• If matrix input is used, the variable list can be omitted. The names for the items in the matrix<br />

are used to compute similarities or distances.<br />

• You can specify a variable list to override the names for the items in the matrix. This<br />

allows you to read in a subset of cases for analysis. Specifying a variable that does not<br />

exist in the matrix results in an error.<br />

MEASURE Subcommand<br />

MEASURE specifies the distance or similarity measure used to cluster cases.<br />

• If the MEASURE subcommand is omitted or included without specifications, squared<br />

Euclidean distances are used.<br />

• Only one measure can be specified.<br />

Measures for Interval Data<br />

For interval data, use any one of the following keywords on MEASURE:<br />

SEUCLID Squared Euclidean distance. The distance between two items, x and y, is the<br />

sum of the squared differences between the values for the items. SEUCLID is<br />

the measure commonly used with centroid, median, and Ward’s methods of<br />

clustering. SEUCLID is the default and can also be requested with keyword<br />

DEFAULT.<br />

SEUCLID( xy , ) Σi( xi – yi) 2<br />

=<br />

EUCLID Euclidean distance. This is the default specification for MEASURE. The distance<br />

between two items, x and y, is the square root of the sum of the squared<br />

differences between the values for the items.<br />

EUCLID( x, y)<br />

Σi( xi – yi) 2<br />

=<br />

CORRELATION Correlation between vectors of values. This is a pattern similarity measure.<br />

CORRELATION( xy , )<br />

Σi( ZxiZyi) = -------------------------<br />

N – 1<br />

where Z xi is the Z-score (standardized) value of x for the ith case or variable,<br />

and N is the number of cases or variables.<br />

COSINE Cosine of vectors of values. This is a pattern similarity measure.<br />

COSINE( x, y)<br />

Σi( xiy i)<br />

=<br />

----------------------------------<br />

( Σix 2<br />

i ) ( Σiy2 i )

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!