12.07.2015 Views

Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)

Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)

Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

14.2. Weights, Metrics, Transformations and Centerings 385PC framework above unless the w ij can be written as products w ij =ω i φ j ,i=1, 2,...,n; j =1, 2,...,p, although this method involves similarideas. The examples given by Gabriel and Zamir (1979) can be expressed ascontingency tables, so that correspondence analysis rather than PCA maybe more appropriate, and Greenacre (1984), too, develops generalized PCAas an offshoot of correspondence analysis (he shows that another specialcase of the generalized SVD (14.2.2) produces correspondence analysis, aresult which was discussed further in Section 13.1). The idea of weightingcould, however, be used in PCA for any type of data, provided that suitableweights can be defined.Gabriel and Zamir (1979) suggest a number of ways in which special casesof their weighted analysis may be used. As noted in Section 13.6, it canaccommodate missing data by giving zero weight to missing elements of X.Alternatively, the analysis can be used to look for ‘outlying cells’ in a datamatrix. This can be achieved by using similar ideas to those introducedin Section 6.1.5 in the context of choosing how many PCs to retain. Anyparticular element x ij of X is estimated by least squares based on a subsetof the data that does not include x ij . This (rank m) estimate mˆx ij isreadily found by equating to zero a subset of weights in (14.2.5), includingw ij , The difference between x ij and mˆx ij provides a better measure of the‘outlyingness’ of x ij compared to the remaining elements of X, than doesthe difference between x ij and a rank m estimate, m˜x ij , based on the SVDfor the entire matrix X. This result follows because mˆx ij is not affected byx ij , whereas x ij contributes to the estimate m˜x ij .Commandeur et al. (1999) describe how to introduce weights for bothvariables and observations into Meulman’s (1986) distance approach tononlinear multivariate data analysis (see Section 14.1.1).In the standard atmospheric science set-up, in which variables correspondto spatial locations, weights may be introduced to take account of unevenspacing between the locations where measurements are taken. The weightsreflect the size of the area for which a particular location (variable) isthe closest point. This type of weighting may also be necessary when thelocations are regularly spaced on a latitude/longitude grid. The areas of thecorresponding grid cells decrease towards the poles, and allowance shouldbe made for this if the latitudinal spread of the data is moderate or large. Anobvious strategy is to assign to the grid cells weights that are proportionalto their areas. However, if there is a strong positive correlation within cells,it can be argued that doubling the area, for example, does not double theamount of independent information and that weights should reflect this.Folland (1988) implies that weights should be proportional to (Area) c ,where c is between 1 2and 1. Hannachi and O’Neill (2001) weight their databy the cosine of latitude.Buell (1978) and North et al. (1982) derive weights for irregularly spacedatmospheric data by approximating a continuous version of PCA, based onan equation similar to (12.3.1).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!