A Comparative Study of Kernel and Robust Canonical Correlation Analysis

AbstractMotivationRecent developments in technology have enabled researchers to collect multiple OMICS datasets for the same individuals. The conventional approach for understanding the relationships between the collected datasets and the complex trait of interest would be through the analysis of each OMIC dataset separately from the rest, or to test for associations between the OMICS datasets. In this work we show that by integrating multiple OMICS datasets together, instead of analysing them separately, improves our understanding of their in-between relationships as well as the predictive accuracy for the tested trait. As OMICS datasets are heterogeneous and high-dimensional (p >> n) integrating them can be done through Sparse Canonical Correlation Analysis (sCCA) that penalises the canonical variables for producing sparse latent variables while achieving maximal correlation between the datasets. Over the last years, a number of approaches for implementing sCCA have been proposed, where they differ on their objective functions, iterative algorithm for obtaining the sparse latent variables and make different assumptions about the original datasets.ResultsThrough a comparative study we have explored the performance of the conventional CCA proposed by Parkhomenko et al. [2009], penalised matrix decomposition CCA proposed by Witten and Tibshirani [2009] and its extension proposed by Suo et al. [2017]. The aferomentioned methods were modified to allow for different penalty functions. Although sCCA is an unsupervised learning approach for understanding of the in-between relationships, we have twisted the problem as a supervised learning one and investigated how the computed latent variables can be used for predicting complex traits. The approaches were extended to allow for multiple (more than two) datasets where the trait was included as one of the input datasets. Both ways have shown improvement over conventional predictive models that include one or multiple [email protected]

Download Full-text

A Comparative Study of Deep Neural Network-Aided Canonical Correlation Analysis-Based Process Monitoring and Fault Detection Methods

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2021.3072491 ◽

2021 ◽

pp. 1-15

Author(s):

Zhiwen Chen ◽

Ketian Liang ◽

Steven X. Ding ◽

Chao Yang ◽

Tao Peng ◽

...

Keyword(s):

Neural Network ◽

Fault Detection ◽

Correlation Analysis ◽

Comparative Study ◽

Process Monitoring ◽

Canonical Correlation Analysis ◽

Canonical Correlation ◽

Deep Neural Network ◽

Detection Methods

Download Full-text

Analysis of Maximum Expiratory Flow Volume Curves Using Canonical Correlation Analysis

Methods of Information in Medicine ◽

10.1055/s-0038-1635359 ◽

1985 ◽

Vol 24 (02) ◽

pp. 91-100 ◽

Cited By ~ 3

Author(s):

W. van Pelt ◽

Ph. H. Quanjer ◽

M. E. Wise ◽

E. van der Burg ◽

R. van der Lende

Keyword(s):

Correlation Analysis ◽

Canonical Correlation Analysis ◽

Canonical Correlation ◽

Flow Volume ◽

Non Linear ◽

Maximum Expiratory Flow ◽

Expiratory Flow ◽

Relationship Of ◽

The Relationship ◽

Age And Sex

SummaryAs part of a population study on chronic lung disease in the Netherlands, an investigation is made of the relationship of both age and sex with indices describing the maximum expiratory flow-volume (MEFV) curve. To determine the relationship, non-linear canonical correlation was used as realized in the computer program CANALS, a combination of ordinary canonical correlation analysis (CCA) and non-linear transformations of the variables. This method enhances the generality of the relationship to be found and has the advantage of showing the relative importance of categories or ranges within a variable with respect to that relationship. The above is exemplified by describing the relationship of age and sex with variables concerning respiratory symptoms and smoking habits. The analysis of age and sex with MEFV curve indices shows that non-linear canonical correlation analysis is an efficient tool in analysing size and shape of the MEFV curve and can be used to derive parameters concerning the whole curve.

Download Full-text