From Encyclopedia of Mathematics - Reading time: 1 min
The correlation between linear functions of two sets of random variables determined by the maximization of this correlation subject to certain constraints. In the theory of canonical correlations the random variables
and ,
,
are linearly transformed into the so-called canonical random variables
and
so that a) all
have mathematical expectation zero and variance one; b) within each of the two sets the variables
are uncorrelated; c) each
in the first set is correlated with just one
of the second set; d) the non-zero correlation coefficients between '
s of different sets have (consecutively) maximum values, subject to the requirement of zero correlation with previous '
s.
In the particular case ,
the canonical correlation is the multiple correlation between
and .
The transformation to the canonical random variables corresponds to the algebraic problem of reducing a quadric to canonical form. In multivariate statistical analysis the method of canonical correlations is used, in the study of interdependence of two sets of components of a vector of observations, to realize a transition to a new coordinate system in which the correlation between
and
becomes transparently clear. As a result of the analysis of canonical correlations it may turn out that the interdependence between the two sets is completely described by the correlation between several canonical random variables.
References[edit]
[1] | H. Hotelling, "Relations between two sets of variates" Biometrika , 28 (1936) pp. 321–377 |
[2] | T.W. Anderson, "Introduction to multivariate statistical analysis" , Wiley (1958) |
[3] | J.K. Ord, "The advanced theory of statistics" , 3 , Griffin (1983) |