Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)

Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s) Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)

cda.psych.uiuc.edu
from cda.psych.uiuc.edu More from this publisher
12.07.2015 Views

13.7. PCA in Statistical Process Control 367• One- or two-dimensional plots of PC scores. It was noted in Section10.1 that both the first few and the last few PCs may be usefulfor detecting (different types of) outliers, and plots of both are usedin process control. In the published discussion of Roes and Does(1995), Sullivan et al. (1995) argue that the last few PCs are perhapsmore useful in SPC than the first few, but in their reply tothe discussion Roes and Does disagree. If p is not too large, sucharguments can be overcome by using a scatterplot matrix to displayall two-dimensional plots of PC scores simultaneously. Plots can beenhanced by including equal-probability contours, assuming approximatemultivariate normality, corresponding to warning and actionlimits for those points that fall outside them (Jackson, 1991, Section1.7; Martin et al., 1999).• Hotelling’s T 2 . It was seen in Section 10.1 that this is a special casefor q = p of the statistic d 2 2i in equation (10.1.2). If multivariate normalityis assumed, the distribution of T 2 is known, and control limitscan be set based on that distribution (Jackson, 1991, Section 1.7).• The squared prediction error (SPE). This is none other than thestatistic d 2 1i in equation (10.1.1). It was proposed by Jackson andMudholkar (1979), who constructed control limits based on anapproximation to its distribution. They prefer d 2 1i to d2 2i for computationalreasons and because of its intuitive appeal as a sum ofsquared residuals from the (p − q)-dimensional space defined by thefirst (p − q) PCs. However, Jackson and Hearne (1979) indicate thatthe complement of d 2 2i , in which the sum of squares of the first fewrather than the last few renormalized PCs is calculated, may be usefulin process control when the objective is to look for groups of‘out-of-control’ or outlying observations, rather than single outliers.Their basic statistic is decomposed to give separate information aboutvariation within the sample (group) of potentially outlying observations,and about the difference between the sample mean and someknown standard value. In addition, they propose an alternative statisticbased on absolute, rather than squared, values of PCs. Jacksonand Mudholkar (1979) also extend their proposed control procedure,based on d 2 1i , to the multiple-outlier case, and Jackson (1991, Figure6.2) gives a sequence of significance tests for examining subgroups ofobservations in which each test is based on PCs in some way.Eggett and Pulsipher (1989) compare T 2 , SPE, and the complement ofd 2 2i suggested by Jackson and Hearne (1979), in a simulation study and findthe third of these statistics to be inferior to the other two. On the basis oftheir simulations, they recommend Hotelling’s T 2 for large samples, withSPE or univariate control charts preferred for small samples. They alsodiscussed the possibility of constructing CUSUM charts based on the threestatistics.

368 13. Principal Component Analysis for Special Types of DataThe control limits described so far are all based on the assumption ofapproximate multivariate normality. Martin and Morris (1996) introducea non-parametric procedure that provides warning and action contourson plots of PCs. These contours can be very different from the normalbasedellipses. The idea of the procedure is to generate bootstrap samplesfrom the data set and from each of these calculate the value of a (possiblyvector-valued) statistic of interest. A smooth approximation to theprobability density of this statistic is then constructed using kernel densityestimation, and the required contours are derived from this distribution.Coleman (1985) suggests that when using PCs in quality control, the PCsshould be estimated robustly (see Section 10.4). Sullivan et al. (1995) dothis by omitting some probable outliers, identified from an initial scan ofthe data, before carrying out a PCA.When a variable is used to monitor a process over time, its successivevalues are likely to be correlated unless the spacing between observations islarge. One possibility for taking into account this autocorrelation is to plotan exponentially weighted moving average of the observed values. Wold(1994) suggests that similar ideas should be used when the monitoringvariables are PC scores, and he describes an algorithm for implementing‘exponentially weighted moving principal components analysis.’Data often arise in SPC for which, as well as different variables and differenttimes of measurement, there is a third ‘mode,’ namely different batches.So-called multiway, or three-mode, PCA can then be used (see Section 14.5and Nomikos and MacGregor (1995)). Grimshaw et al. (1998) note thepossible use of multiway PCA simultaneously on both the variables monitoringthe process and the variables measuring inputs or initial conditions,though they prefer a regression-based approach involving modifications ofHotelling’s T 2 and the SPE statistic.Boyles (1996) addresses the situation in which the number of variablesexceeds the number of observations. The sample covariance matrix S isthen singular and Hotelling’s T 2 cannot be calculated. One possibility isto replace S −1 by ∑ rk=1 l−1 k a ka ′ kfor r < n, based on the first r termsin the spectral decomposition of S (the sample version of Property A3 inSection 2.1). However, the data of interest to Boyles (1996) have variablesmeasured at points of a regular lattice on the manufactured product. Thisstructure implies that a simple pattern exists in the population covariancematrix Σ. Using knowledge of this pattern, a positive definite estimate ofΣ can be calculated and used in T 2 in place of S. Boyles finds appropriateestimates for three different regular lattices.Lane et al. (2001) consider the case where a several products or processesare monitored simultaneously. They apply Flury’s common PC subspacemodel (Section 13.5) to this situation. McCabe (1986) suggests the useof principal variables (see Section 6.3) to replace principal components inquality control.

13.7. PCA in Statistical Process Control 367• One- or two-dimensional plots of PC scores. It was noted in Section10.1 that both the first few and the last few PCs may be usefulfor detecting (different types of) outliers, and plots of both are usedin process control. In the published discussion of Roes and Does(1995), Sullivan et al. (1995) argue that the last few PCs are perhapsmore useful in SPC than the first few, but in their reply tothe discussion Roes and Does disagree. If p is not too large, sucharguments can be overcome by using a scatterplot matrix to displayall two-dimensional plots of PC scores simultaneously. Plots can beenhanced by including equal-probability contours, assuming approximatemultivariate normality, corresponding to warning and actionlimits for those points that fall outside them (Jackson, 1991, Section1.7; Martin et al., 1999).• Hotelling’s T 2 . It was seen in Section 10.1 that this is a special casefor q = p of the statistic d 2 2i in equation (10.1.2). If multivariate normalityis assumed, the distribution of T 2 is known, and control limitscan be set based on that distribution (Jackson, 1991, Section 1.7).• The squared prediction error (SPE). This is none other than thestatistic d 2 1i in equation (10.1.1). It was proposed by Jackson andMudholkar (1979), who constructed control limits based on anapproximation to its distribution. They prefer d 2 1i to d2 2i for computationalreasons and because of its intuitive appeal as a sum ofsquared residuals from the (p − q)-dimensional space defined by thefirst (p − q) PCs. However, Jackson and Hearne (1979) indicate thatthe complement of d 2 2i , in which the sum of squares of the first fewrather than the last few renormalized PCs is calculated, may be usefulin process control when the objective is to look for groups of‘out-of-control’ or outlying observations, rather than single outliers.Their basic statistic is decomposed to give separate information aboutvariation within the sample (group) of potentially outlying observations,and about the difference between the sample mean and someknown standard value. In addition, they propose an alternative statisticbased on absolute, rather than squared, values of PCs. Jacksonand Mudholkar (1979) also extend their proposed control procedure,based on d 2 1i , to the multiple-outlier case, and Jackson (1991, Figure6.2) gives a sequence of significance tests for examining subgroups ofobservations in which each test is based on PCs in some way.Eggett and Pulsipher (1989) compare T 2 , SPE, and the complement ofd 2 2i suggested by Jackson and Hearne (1979), in a simulation study and findthe third of these statistics to be inferior to the other two. On the basis oftheir simulations, they recommend Hotelling’s T 2 for large samples, withSPE or univariate control charts preferred for small samples. They alsodiscussed the possibility of constructing CUSUM charts based on the threestatistics.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!