Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)
Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s) Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)
13.7. PCA in Statistical Process Control 367• One- or two-dimensional plots of PC scores. It was noted in Section10.1 that both the first few and the last few PCs may be usefulfor detecting (different types of) outliers, and plots of both are usedin process control. In the published discussion of Roes and Does(1995), Sullivan et al. (1995) argue that the last few PCs are perhapsmore useful in SPC than the first few, but in their reply tothe discussion Roes and Does disagree. If p is not too large, sucharguments can be overcome by using a scatterplot matrix to displayall two-dimensional plots of PC scores simultaneously. Plots can beenhanced by including equal-probability contours, assuming approximatemultivariate normality, corresponding to warning and actionlimits for those points that fall outside them (Jackson, 1991, Section1.7; Martin et al., 1999).• Hotelling’s T 2 . It was seen in Section 10.1 that this is a special casefor q = p of the statistic d 2 2i in equation (10.1.2). If multivariate normalityis assumed, the distribution of T 2 is known, and control limitscan be set based on that distribution (Jackson, 1991, Section 1.7).• The squared prediction error (SPE). This is none other than thestatistic d 2 1i in equation (10.1.1). It was proposed by Jackson andMudholkar (1979), who constructed control limits based on anapproximation to its distribution. They prefer d 2 1i to d2 2i for computationalreasons and because of its intuitive appeal as a sum ofsquared residuals from the (p − q)-dimensional space defined by thefirst (p − q) PCs. However, Jackson and Hearne (1979) indicate thatthe complement of d 2 2i , in which the sum of squares of the first fewrather than the last few renormalized PCs is calculated, may be usefulin process control when the objective is to look for groups of‘out-of-control’ or outlying observations, rather than single outliers.Their basic statistic is decomposed to give separate information aboutvariation within the sample (group) of potentially outlying observations,and about the difference between the sample mean and someknown standard value. In addition, they propose an alternative statisticbased on absolute, rather than squared, values of PCs. Jacksonand Mudholkar (1979) also extend their proposed control procedure,based on d 2 1i , to the multiple-outlier case, and Jackson (1991, Figure6.2) gives a sequence of significance tests for examining subgroups ofobservations in which each test is based on PCs in some way.Eggett and Pulsipher (1989) compare T 2 , SPE, and the complement ofd 2 2i suggested by Jackson and Hearne (1979), in a simulation study and findthe third of these statistics to be inferior to the other two. On the basis oftheir simulations, they recommend Hotelling’s T 2 for large samples, withSPE or univariate control charts preferred for small samples. They alsodiscussed the possibility of constructing CUSUM charts based on the threestatistics.
368 13. Principal Component Analysis for Special Types of DataThe control limits described so far are all based on the assumption ofapproximate multivariate normality. Martin and Morris (1996) introducea non-parametric procedure that provides warning and action contourson plots of PCs. These contours can be very different from the normalbasedellipses. The idea of the procedure is to generate bootstrap samplesfrom the data set and from each of these calculate the value of a (possiblyvector-valued) statistic of interest. A smooth approximation to theprobability density of this statistic is then constructed using kernel densityestimation, and the required contours are derived from this distribution.Coleman (1985) suggests that when using PCs in quality control, the PCsshould be estimated robustly (see Section 10.4). Sullivan et al. (1995) dothis by omitting some probable outliers, identified from an initial scan ofthe data, before carrying out a PCA.When a variable is used to monitor a process over time, its successivevalues are likely to be correlated unless the spacing between observations islarge. One possibility for taking into account this autocorrelation is to plotan exponentially weighted moving average of the observed values. Wold(1994) suggests that similar ideas should be used when the monitoringvariables are PC scores, and he describes an algorithm for implementing‘exponentially weighted moving principal components analysis.’Data often arise in SPC for which, as well as different variables and differenttimes of measurement, there is a third ‘mode,’ namely different batches.So-called multiway, or three-mode, PCA can then be used (see Section 14.5and Nomikos and MacGregor (1995)). Grimshaw et al. (1998) note thepossible use of multiway PCA simultaneously on both the variables monitoringthe process and the variables measuring inputs or initial conditions,though they prefer a regression-based approach involving modifications ofHotelling’s T 2 and the SPE statistic.Boyles (1996) addresses the situation in which the number of variablesexceeds the number of observations. The sample covariance matrix S isthen singular and Hotelling’s T 2 cannot be calculated. One possibility isto replace S −1 by ∑ rk=1 l−1 k a ka ′ kfor r < n, based on the first r termsin the spectral decomposition of S (the sample version of Property A3 inSection 2.1). However, the data of interest to Boyles (1996) have variablesmeasured at points of a regular lattice on the manufactured product. Thisstructure implies that a simple pattern exists in the population covariancematrix Σ. Using knowledge of this pattern, a positive definite estimate ofΣ can be calculated and used in T 2 in place of S. Boyles finds appropriateestimates for three different regular lattices.Lane et al. (2001) consider the case where a several products or processesare monitored simultaneously. They apply Flury’s common PC subspacemodel (Section 13.5) to this situation. McCabe (1986) suggests the useof principal variables (see Section 6.3) to replace principal components inquality control.
- Page 348 and 349: 12.3. Functional PCA 317A key refer
- Page 350 and 351: 12.3. Functional PCA 319The sample
- Page 352 and 353: 12.3. Functional PCA 321speed (mete
- Page 354 and 355: 12.3. Functional PCA 323of the data
- Page 356 and 357: 12.3. Functional PCA 325subject to
- Page 358 and 359: 12.3. Functional PCA 327series than
- Page 360 and 361: 12.4. PCA and Non-Independent Data
- Page 362 and 363: 12.4. PCA and Non-Independent Data
- Page 364 and 365: 12.4. PCA and Non-Independent Data
- Page 366 and 367: 12.4. PCA and Non-Independent Data
- Page 368 and 369: 12.4. PCA and Non-Independent Data
- Page 370 and 371: 13.1. Principal Component Analysis
- Page 372 and 373: 13.1. Principal Component Analysis
- Page 374 and 375: 13.2. Analysis of Size and Shape 34
- Page 376 and 377: 13.2. Analysis of Size and Shape 34
- Page 378 and 379: 13.3. Principal Component Analysis
- Page 380 and 381: 13.3. Principal Component Analysis
- Page 382 and 383: 13.4. Principal Component Analysis
- Page 384 and 385: 13.4. Principal Component Analysis
- Page 386 and 387: 13.5. Common Principal Components 3
- Page 388 and 389: 13.5. Common Principal Components 3
- Page 390 and 391: 13.5. Common Principal Components 3
- Page 392 and 393: 13.5. Common Principal Components 3
- Page 394 and 395: 13.6. Principal Component Analysis
- Page 396 and 397: 13.6. Principal Component Analysis
- Page 400 and 401: 13.8. Some Other Types of Data 369A
- Page 402 and 403: 13.8. Some Other Types of Data 371d
- Page 404 and 405: 14Generalizations and Adaptations o
- Page 406 and 407: 14.1. Non-Linear Extensions of Prin
- Page 408 and 409: 14.1. Additive Principal Components
- Page 410 and 411: 14.1. Additive Principal Components
- Page 412 and 413: 14.1. Additive Principal Components
- Page 414 and 415: 14.2. Weights, Metrics, Transformat
- Page 416 and 417: 14.2. Weights, Metrics, Transformat
- Page 418 and 419: 14.2. Weights, Metrics, Transformat
- Page 420 and 421: 14.2. Weights, Metrics, Transformat
- Page 422 and 423: 14.2. Weights, Metrics, Transformat
- Page 424 and 425: 14.3. PCs in the Presence of Second
- Page 426 and 427: 14.4. PCA for Non-Normal Distributi
- Page 428 and 429: 14.5. Three-Mode, Multiway and Mult
- Page 430 and 431: 14.5. Three-Mode, Multiway and Mult
- Page 432 and 433: 14.6. Miscellanea 401• Linear App
- Page 434 and 435: 14.6. Miscellanea 40314.6.3 Regress
- Page 436 and 437: 14.7. Concluding Remarks 405space o
- Page 438 and 439: Appendix AComputation of Principal
- Page 440 and 441: A.1. Numerical Calculation of Princ
- Page 442 and 443: A.1. Numerical Calculation of Princ
- Page 444 and 445: A.1. Numerical Calculation of Princ
- Page 446 and 447: ReferencesAguilera, A.M., Gutiérre
13.7. PCA in Statistical Process Control 367• One- or two-dimensional plots of PC scores. It was noted in Section10.1 that both the first few and the last few PCs may be usefulfor detecting (different types of) outliers, and plots of both are usedin process control. In the published discussion of Roes and Does(1995), Sullivan et al. (1995) argue that the last few PCs are perhapsmore useful in SPC than the first few, but in their reply tothe discussion Roes and Does disagree. If p is not too large, sucharguments can be overcome by using a scatterplot matrix to displayall two-dimensional plots of PC scores simultaneously. Plots can beenhanced by including equal-probability contours, assuming approximatemultivariate normality, corresponding to warning and actionlimits for those points that fall outside them (Jackson, 1991, Section1.7; Martin et al., 1999).• Hotelling’s T 2 . It was seen in Section 10.1 that this is a special casefor q = p of the statistic d 2 2i in equation (10.1.2). If multivariate normalityis assumed, the distribution of T 2 is known, and control limitscan be set based on that distribution (Jackson, 1991, Section 1.7).• The squared prediction error (SPE). This is none other than thestatistic d 2 1i in equation (10.1.1). It was proposed by Jackson andMudholkar (1979), who constructed control limits based on anapproximation to its distribution. They prefer d 2 1i to d2 2i for computationalreasons and because of its intuitive appeal as a sum ofsquared residuals from the (p − q)-dimensional space defined by thefirst (p − q) PCs. However, Jackson and Hearne (1979) indicate thatthe complement of d 2 2i , in which the sum of squares of the first fewrather than the last few renormalized PCs is calculated, may be usefulin process control when the objective is to look for groups of‘out-of-control’ or outlying observations, rather than single outliers.Their basic statistic is decomposed to give separate information aboutvariation within the sample (group) of potentially outlying observations,and about the difference between the sample mean and someknown standard value. In addition, they propose an alternative statisticbased on absolute, rather than squared, values of PCs. Jacksonand Mudholkar (1979) also extend their proposed control procedure,based on d 2 1i , to the multiple-outlier case, and Jackson (1991, Figure6.2) gives a sequence of significance tests for examining subgroups ofobservations in which each test is based on PCs in some way.Eggett and Pulsipher (1989) compare T 2 , SPE, and the complement ofd 2 2i suggested by Jackson and Hearne (1979), in a simulation study and findthe third of these statistics to be inferior to the other two. On the basis oftheir simulations, they recommend Hotelling’s T 2 for large samples, withSPE or univariate control charts preferred for small samples. They alsodiscussed the possibility of constructing CUSUM charts based on the threestatistics.