Canonical correlation

The correlation between linear functions of two sets of random variables determined by the maximization of this correlation subject to certain constraints. In the theory of canonical correlations the random variables $X_{1} \dots X_{s}$ and $X_{s + 1} \dots X_{s + t}$ , $s < t$ , are linearly transformed into the so-called canonical random variables $Y_{1} \dots Y_{s}$ and $Y_{s + 1} \dots Y_{s + t}$ so that a) all $Y$ have mathematical expectation zero and variance one; b) within each of the two sets the variables $Y$ are uncorrelated; c) each $Y$ in the first set is correlated with just one $Y$ of the second set; d) the non-zero correlation coefficients between $Y$ ' s of different sets have (consecutively) maximum values, subject to the requirement of zero correlation with previous $Y$ ' s.

In the particular case $s = 1$ , the canonical correlation is the multiple correlation between $X_{1}$ and $X_{2} \dots X_{1 + t}$ . The transformation to the canonical random variables corresponds to the algebraic problem of reducing a quadric to canonical form. In multivariate statistical analysis the method of canonical correlations is used, in the study of interdependence of two sets of components of a vector of observations, to realize a transition to a new coordinate system in which the correlation between $X_{1} \dots X_{s}$ and $X_{s + 1} \dots X_{s + t}$ becomes transparently clear. As a result of the analysis of canonical correlations it may turn out that the interdependence between the two sets is completely described by the correlation between several canonical random variables.

References[edit]

[1]	H. Hotelling, "Relations between two sets of variates" Biometrika , 28 (1936) pp. 321–377
[2]	T.W. Anderson, "Introduction to multivariate statistical analysis" , Wiley (1958)
[3]	J.K. Ord, "The advanced theory of statistics" , 3 , Griffin (1983)