First, we make a linear combination of two groups of variables, namely UA1x1a2x2LAPP' xvb1Y2Lbqyqb' yaa1a2LAP. If B b 1 b2 L bq is an arbitrary non-zero constant coefficient vector, you can get'', var ua' cov x aa' ∑11a var VB' cov bb' ∑ 22b Cov ua' cov x bb' ∑12 b to call u and v typical variables. That is, the problem of A' ∑12b ρ ∑1ab' ∑ 22b canonical correlation analysis is how to choose the optimal linear combination of typical variables.
The selection principle is: among all linear combinations U and V that make U1a ′ x11,choose U and V with the largest typical correlation coefficient, that is, choose A and B with the largest correlation coefficient with V1b ′ y (among all U and V)/. Then choose A and B to make the correlation coefficient between U2A ′ 2x and V2B ′ 2y the largest among the combinations U and V which are not related to U 1 and V 1, and continue until all the linear combinations U which are not related to U 1 are U 1 and V1v2lvp, respectively.
V At this time, P is equal to the rank typical variables U 1 and V 1 of the covariance matrix between variables X and Y, and U 2 and V2 ... u P and V p are extracted from large columns and small pairs according to the correlation coefficient of U P and V P until the correlation between the two groups of variables is decomposed.