Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)
Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s) Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)
12.2. PCA and Atmospheric Time Series 315April than in June. Similarly, the joint distribution for April and August,which have a four-month separation, may be the same in different years,but different from the joint distribution for June and October, even thoughthe latter are also separated by four months. Such behaviour is known ascyclo-stationarity, and both PCA and POP analysis have been modifiedfor such data.The modification is easier to explain for POP analysis than for PCA.Suppose that τ is the length of the cycle; for example τ = 12 for monthlydata and an annual cycle. Then for a time series of length n = n ′ τ equation(12.2.1) is replaced by(x (sτ+t+1) − µ (t+1) )=Υ t (x (sτ+t) − µ t )+ɛ (sτ+t) , (12.2.3)t =0, 1,...,(τ − 2); s =0, 1, 2,...,(n ′ − 1),with (t + 1) replaced by 0, and s replaced by (s + 1) on the left-hand sideof (12.2.3) when t =(τ − 1) and s =0, 1, 2,...,(n ′ − 2). Here the mean µ tand the matrix Υ t can vary within cycles, but not between cycles. CyclostationaryPOP analysis estimates Υ 0 , Υ 1 ,...,Υ (τ−1) and is based on aneigenanalysis of the product of those estimates ˆΥ 0 ˆΥ1 ... ˆΥ (τ−1) .The cyclo-stationary variety of PCA is summarized by Kim and Wu(1999) but is less transparent in its justification than cyclo-stationaryPOP analysis. First, vectors a 0t , a 1t ,...,a (τ−1)t are found such that x t =∑ τ−1j=0 a jte 2πijtτ, and a new vector of variables is then constructed byconcatenating a 0t , a 1t ,...,a (τ−1)t . Cyclo-stationary EOFs (CSEOFs) areobtained as eigenvectors of the covariance matrix formed from this vectorof variables. Kim and Wu (1999) give examples of the technique, and alsogive references explaining how to calculate CSEOFs.Kim and Wu (1999) describe an additional modification of PCA thatdeals with periodicity in a time series, and which they call a periodicallyextended EOF technique. It works by dividing a series of length n = n ′ τinto n ′ blocks of length τ. A covariance matrix, S EX , is then computed as⎡⎢⎣S EX11 S EX12 ··· S EX1n ′S EX21 S EX22 ··· S EX2n ′. ..S EXn ′ 1.S EXn ′ 2 ··· S EXn ′ n ′in which the (i, j)th element of S EXklis the sample covariance betweenthe measurements at the ith location at time k in each block and at thejth location at time l in the same block. The description of the techniquein Kim and Wu (1999) is sketchy, but it appears that whereas in ordinaryPCA, covariances are calculated by averaging over all time points, here theaveraging is done over times at the same point within each block acrossblocks. A similar averaging is implicit in cyclo-stationary POP analysis.Examples of periodically extended EOFs are given by Kim and Wu (1999).⎤⎥⎦ ,
316 12. PCA for Time Series and Other Non-Independent Data12.2.6 Examples and ComparisonsRelatively few examples have been given in this section, due in part to theircomplexity and their rather specialized nature. Several of the techniquesdescribed claim to be able to detect stationary and propagating wavesin the standard type of spatio-temporal meteorological data. Each of themethods has proponents who give nice examples, both real and artificial, inwhich their techniques appear to work well. However, as with many otheraspects of PCA and related procedures, caution is needed, lest enthusiasmleads to ‘over-interpretation.’ With this caveat firmly in mind, examples ofSSA can be found in Elsner and Tsonis (1996), Vautard (1995); MSSA inPlaut and Vautard (1994), Mote et al. (2000); POPs in Kooperberg andO’Sullivan (1996) with cyclo-stationary POPs in addition in von Storchand Zwiers (1999, Chapter 15); Hilbert EOFs in Horel (1984), Cai andBaines (2001); MTM-SVD in Mann and Park (1999); cyclo-stationary andperiodically extended EOFs in Kim and Wu (1999). This last paper compareseight techniques (PCA, PCA plus rotation, extended EOFs (MSSA),Hilbert EOFs, cyclo-stationary EOFs, periodically extended EOFs, POPsand cyclo-stationary POPs) on artificial data sets with stationary patterns,with patterns that are stationary in space but which have amplitudeschanging in time, and with patterns that have periodic oscillations in space.The results largely confirm what might be expected. The procedures designedto find oscillatory patterns do not perform well for stationary data,the converse holds when oscillations are present, and those techniquesdevised for cyclo-stationary data do best on such data.12.3 Functional PCAThere are many circumstances when the data are curves. A field in whichsuch data are common is that of chemical spectroscopy (see, for example,Krzanowski et al. (1995), Mertens (1998)). Other examples includethe trace on a continuously recording meteorological instrument such as abarograph, or the trajectory of a diving seal. Other data, although measuredat discrete intervals, have an underlying continuous functional form.Examples include the height of a child at various ages, the annual cycle oftemperature recorded as monthly means, or the speed of an athlete duringa race.The basic ideas of PCA carry over to this continuous (functional) case,but the details are different. In Section 12.3.1 we describe the generalset-up in functional PCA (FPCA), and discuss methods for estimating functionalPCs in Section 12.3.2. Section 12.3.3 presents an example. Finally,Section 12.3.4 briefly covers some additional topics including curve registration,bivariate FPCA, smoothing, principal differential analysis, prediction,discrimination, rotation, density estimation and robust FPCA.
- Page 296 and 297: 10.4. Robust Estimation of Principa
- Page 298 and 299: 10.4. Robust Estimation of Principa
- Page 300 and 301: 11Rotation and Interpretation ofPri
- Page 302 and 303: 11.1. Rotation of Principal Compone
- Page 304 and 305: oot of the corresponding eigenvalue
- Page 306 and 307: 11.1. Rotation of Principal Compone
- Page 308 and 309: 11.1. Rotation of Principal Compone
- Page 310 and 311: 11.2. Alternatives to Rotation 279w
- Page 312 and 313: 11.2. Alternatives to Rotation 281F
- Page 314 and 315: 11.2. Alternatives to Rotation 283F
- Page 316 and 317: 11.2. Alternatives to Rotation 285T
- Page 318 and 319: 11.2. Alternatives to Rotation 287T
- Page 320 and 321: 11.2. Alternatives to Rotation 289A
- Page 322 and 323: 11.2. Alternatives to Rotation 291
- Page 324 and 325: 11.3. Simplified Approximations to
- Page 326 and 327: 11.3. Simplified Approximations to
- Page 328 and 329: 11.4. Physical Interpretation of Pr
- Page 330 and 331: 12Principal Component Analysis forT
- Page 332 and 333: 12.1. Introduction 301series is alm
- Page 334 and 335: 12.2. PCA and Atmospheric Time Seri
- Page 336 and 337: 12.2. PCA and Atmospheric Time Seri
- Page 338 and 339: and a typical row of the matrix is1
- Page 340 and 341: 12.2. PCA and Atmospheric Time Seri
- Page 342 and 343: 12.2. PCA and Atmospheric Time Seri
- Page 344 and 345: 12.2. PCA and Atmospheric Time Seri
- Page 348 and 349: 12.3. Functional PCA 317A key refer
- Page 350 and 351: 12.3. Functional PCA 319The sample
- Page 352 and 353: 12.3. Functional PCA 321speed (mete
- Page 354 and 355: 12.3. Functional PCA 323of the data
- Page 356 and 357: 12.3. Functional PCA 325subject to
- Page 358 and 359: 12.3. Functional PCA 327series than
- Page 360 and 361: 12.4. PCA and Non-Independent Data
- Page 362 and 363: 12.4. PCA and Non-Independent Data
- Page 364 and 365: 12.4. PCA and Non-Independent Data
- Page 366 and 367: 12.4. PCA and Non-Independent Data
- Page 368 and 369: 12.4. PCA and Non-Independent Data
- Page 370 and 371: 13.1. Principal Component Analysis
- Page 372 and 373: 13.1. Principal Component Analysis
- Page 374 and 375: 13.2. Analysis of Size and Shape 34
- Page 376 and 377: 13.2. Analysis of Size and Shape 34
- Page 378 and 379: 13.3. Principal Component Analysis
- Page 380 and 381: 13.3. Principal Component Analysis
- Page 382 and 383: 13.4. Principal Component Analysis
- Page 384 and 385: 13.4. Principal Component Analysis
- Page 386 and 387: 13.5. Common Principal Components 3
- Page 388 and 389: 13.5. Common Principal Components 3
- Page 390 and 391: 13.5. Common Principal Components 3
- Page 392 and 393: 13.5. Common Principal Components 3
- Page 394 and 395: 13.6. Principal Component Analysis
12.2. PCA and Atmospheric Time Series 315April than in June. Similarly, the joint distribution for April and August,which have a four-month separation, may be the same in different years,but different from the joint distribution for June and October, even thoughthe latter are also separated by four months. Such behaviour is known ascyclo-stationarity, and both PCA and POP analysis have been modifiedfor such data.The modification is easier to explain for POP analysis than for PCA.Suppose that τ is the length of the cycle; for example τ = 12 for monthlydata and an annual cycle. Then for a time series of length n = n ′ τ equation(12.2.1) is replaced by(x (sτ+t+1) − µ (t+1) )=Υ t (x (sτ+t) − µ t )+ɛ (sτ+t) , (12.2.3)t =0, 1,...,(τ − 2); s =0, 1, 2,...,(n ′ − 1),with (t + 1) replaced by 0, and s replaced by (s + 1) on the left-hand sideof (12.2.3) when t =(τ − 1) and s =0, 1, 2,...,(n ′ − 2). Here the mean µ tand the matrix Υ t can vary within cycles, but not between cycles. CyclostationaryPOP analysis estimates Υ 0 , Υ 1 ,...,Υ (τ−1) and is based on aneigenanalysis of the product of those estimates ˆΥ 0 ˆΥ1 ... ˆΥ (τ−1) .The cyclo-stationary variety of PCA is summarized by Kim and Wu(1999) but is less transparent in its justification than cyclo-stationaryPOP analysis. First, vectors a 0t , a 1t ,...,a (τ−1)t are found such that x t =∑ τ−1j=0 a jte 2πijtτ, and a new vector of variables is then constructed byconcatenating a 0t , a 1t ,...,a (τ−1)t . Cyclo-stationary EOFs (CSEOFs) areobtained as eigenvectors of the covariance matrix formed from this vectorof variables. Kim and Wu (1999) give examples of the technique, and alsogive references explaining how to calculate CSEOFs.Kim and Wu (1999) describe an additional modification of PCA thatdeals with periodicity in a time series, and which they call a periodicallyextended EOF technique. It works by dividing a series of length n = n ′ τinto n ′ blocks of length τ. A covariance matrix, S EX , is then computed as⎡⎢⎣S EX11 S EX12 ··· S EX1n ′S EX21 S EX22 ··· S EX2n ′. ..S EXn ′ 1.S EXn ′ 2 ··· S EXn ′ n ′in which the (i, j)th element of S EXklis the sample covariance betweenthe measurements at the ith location at time k in each block and at thejth location at time l in the same block. The description of the techniquein Kim and Wu (1999) is sketchy, but it appears that whereas in ordinaryPCA, covariances are calculated by averaging over all time points, here theaveraging is done over times at the same point within each block acrossblocks. A similar averaging is implicit in cyclo-stationary POP analysis.Examples of periodically extended EOFs are given by Kim and Wu (1999).⎤⎥⎦ ,