A Review of Principal Component Analysis Algorithm for Dimensionality Reduction

Author(s):  
Basna Mohammed Salih Hasan ◽  
◽  
Adnan Mohsin Abdulazeez ◽  

Big databases are increasingly widespread and are therefore hard to understand, in exploratory biomedicine science, big data in health research is highly exciting because data-based analyses can travel quicker than hypothesis-based research. Principal Component Analysis (PCA) is a method to reduce the dimensionality of certain datasets. Improves interpretability but without losing much information. It achieves this by creating new covariates that are not related to each other. Finding those new variables, or what we call the main components, will reduce the eigenvalue /eigenvectors solution problem. (PCA) can be said to be an adaptive data analysis technology because technology variables are developed to adapt to different data types and structures. This review will start by introducing the basic ideas of (PCA), describe some concepts related to (PCA), and discussing. What it can do, and reviewed fifteen articles of (PCA) that have been introduced and published in the last three years.

Author(s):  
Ian T. Jolliffe ◽  
Jorge Cadima

Large datasets are increasingly common and are often difficult to interpret. Principal component analysis (PCA) is a technique for reducing the dimensionality of such datasets, increasing interpretability but at the same time minimizing information loss. It does so by creating new uncorrelated variables that successively maximize variance. Finding such new variables, the principal components, reduces to solving an eigenvalue/eigenvector problem, and the new variables are defined by the dataset at hand, not a priori , hence making PCA an adaptive data analysis technique. It is adaptive in another sense too, since variants of the technique have been developed that are tailored to various different data types and structures. This article will begin by introducing the basic ideas of PCA, discussing what it can and cannot do. It will then describe some variants of PCA and their application.


2020 ◽  
Vol 1 (1) ◽  
pp. 40
Author(s):  
Samsul Arifin ◽  
Anna Islamiyati ◽  
Raupong Raupong

In the formation of a regression model there is a possibility of a relationship between one predictor variable with other predictor variables known as multicollinearity. In the parametric approach, multicollinearity can be overcome by the principal component analysis method. Principal component analysis (PCA) is a multivariate analysis that transforms the originating variables that are correlated into new variables that are not correlated by reducing a number of these variables so that they have smaller dimensions but can account for most of the diversity of the original variables. In some research data that do not form parametric patterns also allows the occurrence of multicollinearity on the predictor variables. This study examines the ability of spline estimators in the analysis of the main components. The data contained multicollinearity and was applied to diabetes mellitus data by taking cholesterol type factors as predictors. Based on the estimation results, one main component is obtained to explain the diversity of variables in diabetes data with the best linear spline model at one knot point.


Optik ◽  
2016 ◽  
Vol 127 (9) ◽  
pp. 3935-3944 ◽  
Author(s):  
Lingjun Li ◽  
Shigang Liu ◽  
Yali Peng ◽  
Zengguo Sun

2013 ◽  
Vol 2013 ◽  
pp. 1-12 ◽  
Author(s):  
Asloob Ahmad Mudassar ◽  
Saira Butt

A retinal image has blood vessels, optic disc, fovea, and so forth as the main components of an image. Segmentation of these components has been investigated extensively. Principal component analysis (PCA) is one of the techniques that have been applied to segment the optic disc, but only a limited work has been reported. To our knowledge, fovea segmentation problem has not been reported in the literature using PCA. In this paper, we are presenting the segmentation of optic disc and fovea using PCA. The PCA was trained on optic discs and foveae using ten retinal images and then applied on seventy retinal images with a success rate of 97% in case of optic discs and 94.3% in case of fovea. Conventional algorithms feed one patch at a time from a test retinal image, and the next patch separated by one pixel part is fed. This process is continued till the full image area is covered. This is time consuming. We are suggesting techniques to cut down the processing time with the help of binary vessel tree of a given test image. Results are presented to validate our idea.


2021 ◽  
Vol 10 (3) ◽  
pp. 168
Author(s):  
RAHMAD RAHMAD WIDODO ◽  
I PUTU EKA NILA KENCANA ◽  
NI LUH PUTU SUCIPTAWATI

Controlling the quality of learning is very important and influences the accreditation of study programs at the Faculty of Mathematics and Natural Sciences Udayana University, as a guarantor of the quality of graduates. Apply pricipal component analysis to reduce the number of determinant attributes of learning quality, with the aim of looking at the data structure with fewer variables. The control chart is a multivariate control chart that is used to view the potrait of the quality of learning in the Mathematics and Natural Sciences Faculty, using new variables obtained from principal component analysis. The results obtained from principal component analysis show that the contribution of the learning quality indicators is univen. The potrait of the quality of learning at the Faculty of Mathematics and Natural Sciences obtained from the individual-moving range (I-MR) and the control chart shows the need for corrective actions and monitor regularly to improve the quality of learning.


Sign in / Sign up

Export Citation Format

Share Document