Perturbation-Based Eigenvector Updates for On-Line Principal Components Analysis and Canonical Correlation Analysis

Author(s):  
Anant Hegde ◽  
Jose C. Principe ◽  
Deniz Erdogmus ◽  
Umut Ozertem ◽  
Yadunandana N. Rao ◽  
...  
1983 ◽  
Vol 61 (6) ◽  
pp. 1637-1646 ◽  
Author(s):  
J. W. Sheard ◽  
Dorothy W. Geale

Vegetation–environment relationships are defined with the aid of principal-components analysis and canonical correlation analysis. In both the uplands and lowlands a moisture gradient, determined by measuring gravimetric moisture and indicated by organic carbon, is the most important environmental influence on the vegetation. In the uplands this gradient is also associated with snow depth (drifting) and in the lowlands with conductivity. The second environmental gradient in the uplands is associated with depth to permafrost and its soil textural correlates. Thus soil texture, independent of its effect on soil moisture status, influences the distribution of plant communities. In the lowlands the second environmental gradient is less clear but is also associated with depth to permafrost and, in addition, elevation and CaCO3 equivalent. Canonical correlation analysis shows that the components extracted by principal-components analysis of the vegetation data did not conform to the important trends of variation in the environmental data. Principal-components analysis is nevertheless an essential means of data reduction prior to the application of canonical correlation. The statistical model used in the study has potential advantages over the independent use of either principal-components analysis or canonical correlation.


1997 ◽  
Vol 25 ◽  
pp. 347-352 ◽  
Author(s):  
Chris Derksen ◽  
Kkevin Misurak ◽  
Ellsworth Ledrew ◽  
Joe Piwowar ◽  
Barry Goodison

The stochastic relationships between terrestrial snow water equivalent (SWE) and measures of the atmospheric circulation were investigated for the Canadian Prairies and the American Great Plains for the winter of 1988. Snow-cover extent, derived from EASE-grid SSM/I satellite data, and griddcd atmospheric data from the National Meteorological Center were averaged at five day intervals. Principal components analysis (PCA) were performed for the time series of SSM/I snow-cover imagery as well as for 700 mb geopotential height and temperature, 500 mb height and 700–500 mb thickness. Canonical correlation analysis of the derived principal component weights was used to identify relationships between atmospheric variables and SWE. Results of the PCA indicate that a high degree of variance in upper air variables (>75%) can be explained by the first three principal components, while the first three SWE components account for over 90% of the variance in the original data. Results of the canonical correlation analysis show positive relationships between snow-cover accumulation and a meridional pressure distribution pattern, while snow ablation is linked to a zonal atmospheric pressure pattern.


2018 ◽  
Author(s):  
Brielin C Brown ◽  
Nicolas L. Bray ◽  
Lior Pachter

AbstractPopulation structure in genotype data has been extensively studied, and is revealed by looking at the principal components of the genotype matrix. However, no similar analysis of population structure in gene expression data has been conducted, in part because a naïve principal components analysis of the gene expression matrix does not cluster by population. We identify a linear projection that reveals population structure in gene expression data. Our approach relies on the coupling of the principal components of genotype to the principal components of gene expression via canonical correlation analysis. Futhermore, we analyze the variance of each gene within the projection matrix to determine which genes significantly influence the projection. We identify thousands of significant genes, and show that a number of the top genes have been implicated in diseases that disproportionately impact African Americans.Author SummaryHigh dimensional, multi-modal genomics datasets are becoming increasingly common, which warrants investigation into analysis techniques that can reveal structure in the data without over-fitting. Here, we show that the coupling of principal component analysis to canonical correlation analysis offers an efficient approach to exploratory analysis of this kind of data. We apply this method to the GEUVADIS dataset of genotype and gene expression values of European and Yoruban individuals, finding as-of-yet unstudied population structure in the gene expression values. Moreover, many of the top genes identified by our method have been previously implicated in diseases that disproportionately impact African Americans.


1997 ◽  
Vol 25 ◽  
pp. 347-352 ◽  
Author(s):  
Chris Derksen ◽  
Kkevin Misurak ◽  
Ellsworth Ledrew ◽  
Joe Piwowar ◽  
Barry Goodison

The stochastic relationships between terrestrial snow water equivalent (SWE) and measures of the atmospheric circulation were investigated for the Canadian Prairies and the American Great Plains for the winter of 1988. Snow-cover extent, derived from EASE-grid SSM/I satellite data, and griddcd atmospheric data from the National Meteorological Center were averaged at five day intervals. Principal components analysis (PCA) were performed for the time series of SSM/I snow-cover imagery as well as for 700 mb geopotential height and temperature, 500 mb height and 700–500 mb thickness. Canonical correlation analysis of the derived principal component weights was used to identify relationships between atmospheric variables and SWE. Results of the PCA indicate that a high degree of variance in upper air variables (>75%) can be explained by the first three principal components, while the first three SWE components account for over 90% of the variance in the original data. Results of the canonical correlation analysis show positive relationships between snow-cover accumulation and a meridional pressure distribution pattern, while snow ablation is linked to a zonal atmospheric pressure pattern.


Sign in / Sign up

Export Citation Format

Share Document