A Hybrid Feature Extraction Selection Approach for High-Dimensional Non-Gaussian Data Clustering

Projection pursuit is a classical exploratory data analysis method to detect interesting low-dimensional structures in multivariate data. Originally, projection pursuit was applied mostly to data of moderately low dimension. Motivated by contemporary applications, we here study its properties in high-dimensional settings. Specifically, we analyze the asymptotic properties of projection pursuit on structureless multivariate Gaussian data with an identity covariance, as both dimension p and sample size n tend to infinity, with p/n→γ∈[0,∞]. Our main results are that (i) if γ=∞, then there exist projections whose corresponding empirical cumulative distribution function can approximate any arbitrary distribution; and (ii) if γ∈(0,∞), not all limiting distributions are possible. However, depending on the value of γ, various non-Gaussian distributions may still be approximated. In contrast, if we restrict to sparse projections, involving only a few of the p variables, then asymptotically all empirical cumulative distribution functions are Gaussian. And (iii) if γ=0, then asymptotically all projections are Gaussian. Some of these results extend to mean-centered sub-Gaussian data and to projections into k dimensions. Hence, in the “small n, large p” setting, unless sparsity is enforced, and regardless of the chosen projection index, projection pursuit may detect an apparent structure that has no statistical significance. Furthermore, our work reveals fundamental limitations on the ability to detect non-Gaussian signals in high-dimensional data, in particular through independent component analysis and related non-Gaussian component analysis.

Download Full-text

Mistreatment Multiclass in Handwritten Character Recognition SVM Classification with Hybrid Feature Extraction

International Journal of Innovative Research in Computer and Communication Engineering ◽

10.15680/ijircce.2014.0212011 ◽

2014 ◽

Vol 02 (12) ◽

pp. 7202-7206

Author(s):

Dr.Kathir. Viswalingam ◽

G.Ayy appan

Keyword(s):

Feature Extraction ◽

Character Recognition ◽

Handwritten Character Recognition ◽

Svm Classification ◽

Handwritten Character ◽

Hybrid Feature Extraction

Download Full-text

High-Dimensional Elliptical Sliced Inverse Regression in non-Gaussian Distributions

Journal of Business and Economic Statistics ◽

10.1080/07350015.2021.1910041 ◽

2021 ◽

pp. 1-39

Author(s):

Xin Chen ◽

Jia Zhang ◽

Wang Zhou

Keyword(s):

Sliced Inverse Regression ◽

High Dimensional ◽

Inverse Regression ◽

Gaussian Distributions ◽

Non Gaussian

Download Full-text

A Nonlinear Maximum Correntropy Information Filter for High-Dimensional Neural Decoding

Entropy ◽

10.3390/e23060743 ◽

2021 ◽

Vol 23 (6) ◽

pp. 743

Author(s):

Xi Liu ◽

Shuhang Chen ◽

Xiang Shen ◽

Xiang Zhang ◽

Yiwen Wang

Keyword(s):

State Estimation ◽

Measurement Model ◽

High Dimensional ◽

Neural Firing ◽

The Neural Network ◽

Information Filter ◽

Critical Technology ◽

Dimensional Measurements ◽

Non Gaussian ◽

Low Dimensional

Neural signal decoding is a critical technology in brain machine interface (BMI) to interpret movement intention from multi-neural activity collected from paralyzed patients. As a commonly-used decoding algorithm, the Kalman filter is often applied to derive the movement states from high-dimensional neural firing observation. However, its performance is limited and less effective for noisy nonlinear neural systems with high-dimensional measurements. In this paper, we propose a nonlinear maximum correntropy information filter, aiming at better state estimation in the filtering process for a noisy high-dimensional measurement system. We reconstruct the measurement model between the high-dimensional measurements and low-dimensional states using the neural network, and derive the state estimation using the correntropy criterion to cope with the non-Gaussian noise and eliminate large initial uncertainty. Moreover, analyses of convergence and robustness are given. The effectiveness of the proposed algorithm is evaluated by applying it on multiple segments of neural spiking data from two rats to interpret the movement states when the subjects perform a two-lever discrimination task. Our results demonstrate better and more robust state estimation performance when compared with other filters.

Download Full-text

Feature extraction and artificial neural networks for the on-the-fly classification of high-dimensional thermochemical spaces in adaptive-chemistry simulations – ERRATUM

Data-Centric Engineering ◽

10.1017/dce.2021.4 ◽

2021 ◽

Vol 2 ◽

Author(s):

Giuseppe D’Alessio ◽

Alberto Cuoci ◽

Alessandro Parente

Keyword(s):

Neural Networks ◽

Feature Extraction ◽

Artificial Neural Networks ◽

High Dimensional ◽

Artificial Neural ◽

Adaptive Chemistry

Download Full-text