12.07.2015 Views

Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)

Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)

Jolliffe I. Principal Component Analysis (2ed., Springer, 2002)(518s)

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

392 14. Generalizations and Adaptations of <strong>Principal</strong> <strong>Component</strong> <strong>Analysis</strong>14.3 <strong>Principal</strong> <strong>Component</strong>s in the Presence ofSecondary or Instrumental VariablesRao (1964) describes two modifications of PCA that involve what he calls‘instrumental variables.’ These are variables which are of secondary importance,but which may be useful in various ways in examining thevariables that are of primary concern. The term ‘instrumental variable’ isin widespread use in econometrics, but in a rather more restricted context(see, for example, Darnell (1994, pp. 197–200)).Suppose that x is, as usual, a p-element vector of primary variables,and that w is a vector of s secondary, or instrumental, variables. Rao(1964) considers the following two problems, described respectively as ‘principalcomponents of instrumental variables’ and ‘principal components...uncorrelated with instrumental variables’:(i) Find linear functions γ ′ 1w, γ ′ 2w,..., of w that best predict x.(ii) Find linear functions α ′ 1x, α ′ 2x,... with maximum variances that,as well as being uncorrelated with each other, are also uncorrelatedwith w.For (i), Rao (1964) notes that w may contain some or all of the elementsof x, and gives two possible measures of predictive ability, corresponding tothe trace and Euclidean norm criteria discussed with respect to PropertyA5 in Section 2.1. He also mentions the possibility of introducing weightsinto the analysis. The two criteria lead to different solutions to (i), oneof which is more straightforward to derive than the other. There is a superficialresemblance between the current problem and that of canonicalcorrelation analysis, where relationships between two sets of variables arealso investigated (see Section 9.3), but the two situations are easily seen tobe different. However, as noted in Sections 6.3 and 9.3.4, the methodologyof Rao’s (1964) PCA of instrumental variables has reappeared under othernames. In particular, it is equivalent to redundancy analysis (van den Wollenberg,1977) and to one way of fitting a reduced rank regression model(Davies and Tso, 1982).The same technique is derived by Esposito (1998). He projects the matrixX onto the space spanned by W, where X, W are data matrices associatedwith x, w, and then finds principal components of the projected data. Thisleads to an eigenequationS XW S −1WW S WXa k = l k a k ,which is the same as equation (9.3.5). Solving that equation leads to redundancyanalysis. Kazi-Aoual et al. (1995) provide a permutation test,using the test statistic tr(S WX S −1XX S XW) to decide whether there is anyrelationship between the x and w variables.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!