High-dimensional Data Classification Based on Principal Component Analysis Dimension Reduction and Improved BP Algorithm

Kernel entropy component analysis (KECA) reveals the original data’s structure by kernel matrix. This structure is related to the Renyi entropy of the data. KECA maintains the invariance of the original data’s structure by keeping the data’s Renyi entropy unchanged. This paper described the original data by several components on the purpose of dimension reduction. Then the KECA was applied in celestial spectra reduction and was compared with Principal Component Analysis (PCA) and Kernel Principal Component Analysis (KPCA) by experiments. Experimental results show that the KECA is a good method in high-dimensional data reduction.

Download Full-text

Principal Component Analysis (PCA) for high-dimensional data. PCA is dead. Long live PCA

Perspectives on Big Data Analysis - Contemporary Mathematics ◽

10.1090/conm/622/12430 ◽

2014 ◽

pp. 1-10 ◽

Cited By ~ 1

Author(s):

Fan Yang ◽

Kjell Doksum ◽

Kam-Wah Tsui

Keyword(s):

Principal Component Analysis ◽

High Dimensional Data ◽

Principal Component ◽

Component Analysis ◽

High Dimensional

Download Full-text

Multilevel Functional Principal Component Analysis for High-Dimensional Data

Journal of Computational and Graphical Statistics ◽

10.1198/jcgs.2011.10122 ◽

2011 ◽

Vol 20 (4) ◽

pp. 852-873 ◽

Cited By ~ 33

Author(s):

Vadim Zipunnikov ◽

Brian Caffo ◽

David M. Yousem ◽

Christos Davatzikos ◽

Brian S. Schwartz ◽

...

Keyword(s):

Principal Component Analysis ◽

High Dimensional Data ◽

Principal Component ◽

Component Analysis ◽

Functional Principal Component Analysis ◽

High Dimensional ◽

Functional Principal Component

Download Full-text

Impact of Bone Marrow Radiation Dose on Acute Hematologic Toxicity in Cervical Cancer: Principal Component Analysis on High Dimensional Data

International Journal of Radiation Oncology*Biology*Physics ◽

10.1016/j.ijrobp.2009.11.062 ◽

2010 ◽

Vol 78 (3) ◽

pp. 912-919 ◽

Cited By ~ 28

Author(s):

Yun Liang ◽

Karen Messer ◽

Brent S. Rose ◽

John H. Lewis ◽

Steve B. Jiang ◽

...

Keyword(s):

Cervical Cancer ◽

Principal Component Analysis ◽

Bone Marrow ◽

Radiation Dose ◽

High Dimensional Data ◽

Principal Component ◽

Component Analysis ◽

High Dimensional ◽

Hematologic Toxicity

Download Full-text

Performance Analysis of Dimensionality Reduction Techniques in the Context of Clustering

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2019.8.s3.2084 ◽

2019 ◽

Vol 8 (S3) ◽

pp. 66-71

Author(s):

T. Sudha ◽

P. Nagendra Kumar

Keyword(s):

Principal Component Analysis ◽

Dimensionality Reduction ◽

High Dimensional Data ◽

Principal Component ◽

Component Analysis ◽

High Dimensional ◽

Reduction Techniques ◽

Dimensionality Reduction Techniques ◽

Low Dimensional ◽

Probabilistic Principal Component Analysis

Data mining is one of the major areas of research. Clustering is one of the main functionalities of datamining. High dimensionality is one of the main issues of clustering and Dimensionality reduction can be used as a solution to this problem. The present work makes a comparative study of dimensionality reduction techniques such as t-distributed stochastic neighbour embedding and probabilistic principal component analysis in the context of clustering. High dimensional data have been reduced to low dimensional data using dimensionality reduction techniques such as t-distributed stochastic neighbour embedding and probabilistic principal component analysis. Cluster analysis has been performed on the high dimensional data as well as the low dimensional data sets obtained through t-distributed stochastic neighbour embedding and Probabilistic principal component analysis with varying number of clusters. Mean squared error; time and space have been considered as parameters for comparison. The results obtained show that time taken to convert the high dimensional data into low dimensional data using probabilistic principal component analysis is higher than the time taken to convert the high dimensional data into low dimensional data using t-distributed stochastic neighbour embedding.The space required by the data set reduced through Probabilistic principal component analysis is less than the storage space required by the data set reduced through t-distributed stochastic neighbour embedding.

Download Full-text