Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)
Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s) Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)
6.2. Choosing m, the Number of Components: Examples 135Table 6.3. First six eigenvalues for the covariance matrix, gas chromatographydata.Component number 1 2 3 4 5 6Eigenvalue, l k 312187 2100 768 336 190 149l k /¯l ∑ 9.88 0.067 0.024 0.011 0.006 0.005mk=1t m = 100k∑ p l 98.8 99.5 99.7 99.8 99.9 99.94k=1 kl k−1 − l k 310087 1332 432 146 51R 0.02 0.43 0.60 0.70 0.83 0.99W 494.98 4.95 1.90 0.92 0.41 0.54the inclusion of five PCs in this example but, in fact, he slightly modifieshis criterion for retaining PCs. His nominal cut-off for including the kthPC is R
136 6. Choosing a Subset of Principal Components or VariablesFigure 6.2. LEV diagram for the covariance matrix: gas chromatography data.logue of the scree diagram. However, although this interpretation may bevalid for the correlation matrices in his simulations, it does not seem tohold for the dominant variance structures exhibited in Tables 6.2 and 6.3.For correlation matrices, and presumably for covariance matrices withless extreme variation among eigenvalues, the ad hoc methods and thecross-validatory criteria are likely to give more similar results. This is illustratedby a simulation study in Krzanowski (1983), where W is comparedwith the first two ad hoc rules with cut-offs at t ∗ = 75% and l ∗ = 1,respectively. Bartlett’s test, described in Section 6.1.4, is also included inthe comparison but, as expected from the earlier discussion, it retains toomany PCs in most circumstances. The behaviour of W compared with thetwo ad hoc rules is the reverse of that observed in the example above. Wretains fewer PCs than the t m > 75% criterion, despite the fairly low cutoffof 75%. Similar numbers of PCs are retained for W and for the rulebased on l k > 1. The latter rule retains more PCs if the cut-off is loweredto 0.7 rather than 1.0, as suggested in Section 6.1.2. It can also be arguedthat the cut-off for W should be reduced below unity (see Section 6.1.5),in which case all three rules will give similar results.
- Page 116 and 117: 5.2. Principal Coordinate Analysis
- Page 118 and 119: 5.2. Principal Coordinate Analysis
- Page 120 and 121: 5.2. Principal Coordinate Analysis
- Page 122 and 123: 5.3. Biplots 91columns, L is an (r
- Page 124 and 125: 5.3. Biplots 93ButandSubstituting i
- Page 126 and 127: 5.3. Biplots 95The vector gi ∗ co
- Page 128 and 129: 5.3. Biplots 97Figure 5.3. Biplot u
- Page 130 and 131: 5.3. Biplots 99Table 5.2. First two
- Page 132 and 133: 5.3. Biplots 101Figure 5.5. Biplot
- Page 134 and 135: 5.4. Correspondence Analysis 103of
- Page 136 and 137: 5.4. Correspondence Analysis 105Fig
- Page 138 and 139: 5.6. Displaying Intrinsically High-
- Page 140 and 141: 5.6. Displaying Intrinsically High-
- Page 142 and 143: 6Choosing a Subset of PrincipalComp
- Page 144 and 145: 6.1. How Many Principal Components?
- Page 146 and 147: 6.1. How Many Principal Components?
- Page 148 and 149: 6.1. How Many Principal Components?
- Page 150 and 151: 6.1. How Many Principal Components?
- Page 152 and 153: 6.1. How Many Principal Components?
- Page 154 and 155: 6.1. How Many Principal Components?
- Page 156 and 157: 6.1. How Many Principal Components?
- Page 158 and 159: 6.1. How Many Principal Components?
- Page 160 and 161: 6.1. How Many Principal Components?
- Page 162 and 163: 6.1. How Many Principal Components?
- Page 164 and 165: 6.2. Choosing m, the Number of Comp
- Page 168 and 169: 6.3. Selecting a Subset of Variable
- Page 170 and 171: 6.3. Selecting a Subset of Variable
- Page 172 and 173: 6.3. Selecting a Subset of Variable
- Page 174 and 175: 6.3. Selecting a Subset of Variable
- Page 176 and 177: 6.4. Examples Illustrating Variable
- Page 178 and 179: 6.4. Examples Illustrating Variable
- Page 180 and 181: 6.4. Examples Illustrating Variable
- Page 182 and 183: 7.1. Models for Factor Analysis 151
- Page 184 and 185: 7.2. Estimation of the Factor Model
- Page 186 and 187: 7.2. Estimation of the Factor Model
- Page 188 and 189: 7.2. Estimation of the Factor Model
- Page 190 and 191: 7.3. Comparisons Between Factor and
- Page 192 and 193: 7.4. An Example of Factor Analysis
- Page 194 and 195: 7.4. An Example of Factor Analysis
- Page 196 and 197: 7.5. Concluding Remarks 165To illus
- Page 198 and 199: 8Principal Components in Regression
- Page 200 and 201: 8.1. Principal Component Regression
- Page 202 and 203: 8.1. Principal Component Regression
- Page 204 and 205: 8.2. Selecting Components in Princi
- Page 206 and 207: 8.2. Selecting Components in Princi
- Page 208 and 209: 8.3. Connections Between PC Regress
- Page 210 and 211: 8.4. Variations on Principal Compon
- Page 212 and 213: 8.4. Variations on Principal Compon
- Page 214 and 215: 8.4. Variations on Principal Compon
6.2. Choosing m, the Number of <strong>Component</strong>s: Examples 135Table 6.3. First six eigenvalues for the covariance matrix, gas chromatographydata.<strong>Component</strong> number 1 2 3 4 5 6Eigenvalue, l k 312187 2100 768 336 190 149l k /¯l ∑ 9.88 0.067 0.024 0.011 0.006 0.005mk=1t m = 100k∑ p l 98.8 99.5 99.7 99.8 99.9 99.94k=1 kl k−1 − l k 310087 1332 432 146 51R 0.02 0.43 0.60 0.70 0.83 0.99W 494.98 4.95 1.90 0.92 0.41 0.54the inclusion of five PCs in this example but, in fact, he slightly modifieshis criterion for retaining PCs. His nominal cut-off for including the kthPC is R