Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)
Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s) Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)
11.3. Simplified Approximations to Principal Components 295are given that show how the optimum threshold changes as sample sizeincreases (it decreases) for unrotated, orthogonally rotated and obliquelyrotated PCs. The precision of the results is indicated by boxplots. Thepaper provides useful guidance on choosing a threshold in the spatial atmosphericscience setting if truncation is to be done. However, it largelyignores the fact stressed above that neither the size of loadings nor thatof correlations are necessarily reliable indicators of how to interpret a PC,and hence that truncation according to a fixed threshold should be usedwith extreme caution.Interpretation can play a rôle in variable selection. Section 6.3 defines anumber of criteria for choosing a subset of variables; these criteria are basedon how well the subset represents the full data set in one sense or another.Often there will be several, perhaps many for large p, subsets that do almostas well as the best in terms of the chosen criterion. To decide between themit may be desirable to select those variables that are most easily measuredor, alternatively, those that can be most easily interpreted. Taking this trainof thought a little further, if PCs are calculated from a subset of variables,it is preferable that the chosen subset gives PCs that are easy to interpret.Al-Kandari and Jolliffe (2001) take this consideration into account in astudy that compares the merits of a number of variable selection methodsand also compares criteria for assessing the value of subsets. A much fullerdiscussion is given by Al-Kandari (1998).11.3.1 Principal Components with Homogeneous, Contrastand Sparsity ConstraintsChipman and Gu (2001) present a number of related ideas that lead toresults which appear similar in form to those produced by the techniquesin Section 11.2. However, their reasoning is closely related to truncationso they are included here. Components with homogeneous constraints havetheir coefficients restricted to −1, 0 and +1 as with Hausmann (1982), butinstead of solving a different optimization problem we start with the PCsand approximate them. For a threshold τ H a vector of PC coefficients a kis replaced by a vector a H kwith coefficientsa H kj ={ sign(akj ) if a kj ≥ τ H0 if a kj 0and ∑ pj=1 aC kj = 0. Again a threshold τ C determines
296 11. Rotation and Interpretation of Principal Componentsthe coefficients. We have⎧⎪⎨ −c 1 if a kj ≤−τ Ca C kj = 0 if |a kj |
- Page 276 and 277: 10.1. Detection of Outliers Using P
- Page 278 and 279: 10.1. Detection of Outliers Using P
- Page 280 and 281: 10.2. Influential Observations in a
- Page 282 and 283: 10.2. Influential Observations in a
- Page 284 and 285: 10.2. Influential Observations in a
- Page 286 and 287: 10.2. Influential Observations in a
- Page 288 and 289: 10.2. Influential Observations in a
- Page 290 and 291: 10.3. Sensitivity and Stability 259
- Page 292 and 293: 10.3. Sensitivity and Stability 261
- Page 294 and 295: 10.4. Robust Estimation of Principa
- Page 296 and 297: 10.4. Robust Estimation of Principa
- Page 298 and 299: 10.4. Robust Estimation of Principa
- Page 300 and 301: 11Rotation and Interpretation ofPri
- Page 302 and 303: 11.1. Rotation of Principal Compone
- Page 304 and 305: oot of the corresponding eigenvalue
- Page 306 and 307: 11.1. Rotation of Principal Compone
- Page 308 and 309: 11.1. Rotation of Principal Compone
- Page 310 and 311: 11.2. Alternatives to Rotation 279w
- Page 312 and 313: 11.2. Alternatives to Rotation 281F
- Page 314 and 315: 11.2. Alternatives to Rotation 283F
- Page 316 and 317: 11.2. Alternatives to Rotation 285T
- Page 318 and 319: 11.2. Alternatives to Rotation 287T
- Page 320 and 321: 11.2. Alternatives to Rotation 289A
- Page 322 and 323: 11.2. Alternatives to Rotation 291
- Page 324 and 325: 11.3. Simplified Approximations to
- Page 328 and 329: 11.4. Physical Interpretation of Pr
- Page 330 and 331: 12Principal Component Analysis forT
- Page 332 and 333: 12.1. Introduction 301series is alm
- Page 334 and 335: 12.2. PCA and Atmospheric Time Seri
- Page 336 and 337: 12.2. PCA and Atmospheric Time Seri
- Page 338 and 339: and a typical row of the matrix is1
- Page 340 and 341: 12.2. PCA and Atmospheric Time Seri
- Page 342 and 343: 12.2. PCA and Atmospheric Time Seri
- Page 344 and 345: 12.2. PCA and Atmospheric Time Seri
- Page 346 and 347: 12.2. PCA and Atmospheric Time Seri
- Page 348 and 349: 12.3. Functional PCA 317A key refer
- Page 350 and 351: 12.3. Functional PCA 319The sample
- Page 352 and 353: 12.3. Functional PCA 321speed (mete
- Page 354 and 355: 12.3. Functional PCA 323of the data
- Page 356 and 357: 12.3. Functional PCA 325subject to
- Page 358 and 359: 12.3. Functional PCA 327series than
- Page 360 and 361: 12.4. PCA and Non-Independent Data
- Page 362 and 363: 12.4. PCA and Non-Independent Data
- Page 364 and 365: 12.4. PCA and Non-Independent Data
- Page 366 and 367: 12.4. PCA and Non-Independent Data
- Page 368 and 369: 12.4. PCA and Non-Independent Data
- Page 370 and 371: 13.1. Principal Component Analysis
- Page 372 and 373: 13.1. Principal Component Analysis
- Page 374 and 375: 13.2. Analysis of Size and Shape 34
11.3. Simplified Approximations to <strong>Principal</strong> <strong>Component</strong>s 295are given that show how the optimum threshold changes as sample sizeincreases (it decreases) for unrotated, orthogonally rotated and obliquelyrotated PCs. The precision of the results is indicated by boxplots. Thepaper provides useful guidance on choosing a threshold in the spatial atmosphericscience setting if truncation is to be done. However, it largelyignores the fact stressed above that neither the size of loadings nor thatof correlations are necessarily reliable indicators of how to interpret a PC,and hence that truncation according to a fixed threshold should be usedwith extreme caution.Interpretation can play a rôle in variable selection. Section 6.3 defines anumber of criteria for choosing a subset of variables; these criteria are basedon how well the subset represents the full data set in one sense or another.Often there will be several, perhaps many for large p, subsets that do almostas well as the best in terms of the chosen criterion. To decide between themit may be desirable to select those variables that are most easily measuredor, alternatively, those that can be most easily interpreted. Taking this trainof thought a little further, if PCs are calculated from a subset of variables,it is preferable that the chosen subset gives PCs that are easy to interpret.Al-Kandari and <strong>Jolliffe</strong> (2001) take this consideration into account in astudy that compares the merits of a number of variable selection methodsand also compares criteria for assessing the value of subsets. A much fullerdiscussion is given by Al-Kandari (1998).11.3.1 <strong>Principal</strong> <strong>Component</strong>s with Homogeneous, Contrastand Sparsity ConstraintsChipman and Gu (2001) present a number of related ideas that lead toresults which appear similar in form to those produced by the techniquesin Section 11.2. However, their reasoning is closely related to truncationso they are included here. <strong>Component</strong>s with homogeneous constraints havetheir coefficients restricted to −1, 0 and +1 as with Hausmann (1982), butinstead of solving a different optimization problem we start with the PCsand approximate them. For a threshold τ H a vector of PC coefficients a kis replaced by a vector a H kwith coefficientsa H kj ={ sign(akj ) if a kj ≥ τ H0 if a kj 0and ∑ pj=1 aC kj = 0. Again a threshold τ C determines