Canonical correlation analysis in high dimensions with structured regularization

Canonical correlation analysis (CCA) is a technique for measuring the association between two multivariate data matrices. A regularized modification of canonical correlation analysis (RCCA) which imposes an [Formula: see text] penalty on the CCA coefficients is widely used in applications with high-dimensional data. One limitation of such regularization is that it ignores any data structure, treating all the features equally, which can be ill-suited for some applications. In this article we introduce several approaches to regularizing CCA that take the underlying data structure into account. In particular, the proposed group regularized canonical correlation analysis (GRCCA) is useful when the variables are correlated in groups. We illustrate some computational strategies to avoid excessive computations with regularized CCA in high dimensions. We demonstrate the application of these methods in our motivating application from neuroscience, as well as in a small simulation example.

Download Full-text

Fault Detection of Diesel Engine Air and after-Treatment Systems with High-Dimensional Data: A Novel Fault-Relevant Feature Selection Method

Processes ◽

10.3390/pr9020259 ◽

2021 ◽

Vol 9 (2) ◽

pp. 259

Author(s):

Qilan Ran ◽

Yedong Song ◽

Wenli Du ◽

Wei Du ◽

Xin Peng

Keyword(s):

Genetic Algorithm ◽

Diesel Engine ◽

Fault Detection ◽

Correlation Analysis ◽

Canonical Correlation Analysis ◽

Diesel Engines ◽

Canonical Correlation ◽

High Dimensional Data ◽

High Dimensional ◽

Relevant Variables

In order to reduce pollutants of the emission from diesel vehicles, complex after-treatment technologies have been proposed, which make the fault detection of diesel engines become increasingly difficult. Thus, this paper proposes a canonical correlation analysis detection method based on fault-relevant variables selected by an elitist genetic algorithm to realize high-dimensional data-driven faults detection of diesel engines. The method proposed establishes a fault detection model by the actual operation data to overcome the limitations of the traditional methods, merely based on benchmark. Moreover, the canonical correlation analysis is used to extract the strong correlation between variables, which constructs the residual vector to realize the fault detection of the diesel engine air and after-treatment system. In particular, the elitist genetic algorithm is used to optimize the fault-relevant variables to reduce detection redundancy, eliminate additional noise interference, and improve the detection rate of the specific fault. The experiments are carried out by implementing the practical state data of a diesel engine, which show the feasibility and efficiency of the proposed approach.

Download Full-text

Parallel Computing Method of Canonical Correlation Analysis for High-Dimensional Data Streams in Irregular Streams

Journal of Software ◽

10.3724/sp.j.1001.2012.04008 ◽

2012 ◽

Vol 23 (5) ◽

pp. 1053-1072 ◽

Cited By ~ 2

Author(s):

Yong ZHOU ◽

Xiao-Wei LU ◽

Chun-Tian CHENG

Keyword(s):

Parallel Computing ◽

Correlation Analysis ◽

Canonical Correlation Analysis ◽

Data Streams ◽

Canonical Correlation ◽

High Dimensional Data ◽

High Dimensional ◽

Computing Method

Download Full-text

Sparse kernel canonical correlation analysis for discovery of nonlinear interactions in high-dimensional data

BMC Bioinformatics ◽

10.1186/s12859-017-1543-x ◽

2017 ◽

Vol 18 (1) ◽

Cited By ~ 18

Author(s):

Kosuke Yoshida ◽

Junichiro Yoshimoto ◽

Kenji Doya

Keyword(s):

Correlation Analysis ◽

Canonical Correlation Analysis ◽

Canonical Correlation ◽

High Dimensional Data ◽

High Dimensional ◽

Nonlinear Interactions ◽

Kernel Canonical Correlation Analysis ◽

Sparse Kernel

Download Full-text

Canonical correlation analysis of high-dimensional data with very small sample support

Signal Processing ◽

10.1016/j.sigpro.2016.05.020 ◽

2016 ◽

Vol 128 ◽

pp. 449-458 ◽

Cited By ~ 32

Author(s):

Yang Song ◽

Peter J. Schreier ◽

David Ramírez ◽

Tanuj Hasija

Keyword(s):

Correlation Analysis ◽

Canonical Correlation Analysis ◽

Canonical Correlation ◽

High Dimensional Data ◽

Small Sample ◽

High Dimensional ◽

Sample Support

Download Full-text

Kernel functional canonical correlation analysis

Acta Universitatis Lodziensis Folia oeconomica ◽

10.18778/0208-6018.325.12 ◽

2017 ◽

Vol 5 (325) ◽

Author(s):

Mirosław Krzyśko ◽

Łukasz Waszak

Keyword(s):

Correlation Analysis ◽

Canonical Correlation Analysis ◽

Functional Data ◽

Canonical Correlation ◽

Multivariate Data ◽

Research Interest ◽

Canonical Correlations ◽

Canonical Variables ◽

Correlation Methods ◽

The Subject

Canonical correlation methods for data representing functions or curves have received much attention in recent years. Such data, known in the literature as functional data (Ramsay and Silverman, 2005), has been the subject of much recent research interest. Examples of functional data can be found in several application domains, such as medicine, economics, meteorology and many others. Unfortunately, the multivariate data canonical correlation methods cannot be used directly for functional data, because of the problem of dimensionality and difficulty in taking into account the correlation and order of functional data. The problem of constructing canonical correlations and canonical variables for functional data was addressed by Leurgans et al. (1993), and further developments were made by Ramsay and Silverman (2005). In this paper we propose a new method of constructing canonical correlations and canonical variables for functional data.

Download Full-text