On the number of principal components in high dimensions

AbstractThe use of information criteria, especially AIC (Akaike’s information criterion) and BIC (Bayesian information criterion), for choosing an adequate number of principal components is illustrated.

Download Full-text

Improved Performance of Fault Detection Based on Selection of the Optimal Number of Principal Components

Acta Automatica Sinica ◽

10.1016/s1874-1029(08)60123-8 ◽

2009 ◽

Vol 35 (12) ◽

pp. 1550-1557 ◽

Cited By ~ 10

Author(s):

Yuan LI ◽

Xiao-Chu TANG

Keyword(s):

Fault Detection ◽

Principal Components ◽

Optimal Number ◽

Number Of Principal Components ◽

Improved Performance ◽

Selection Of

Download Full-text

A study on the number of principal components and sensitivity of fault detection using PCA

Computers & Chemical Engineering ◽

10.1016/j.compchemeng.2006.09.004 ◽

2007 ◽

Vol 31 (9) ◽

pp. 1035-1046 ◽

Cited By ~ 73

Author(s):

Masayuki Tamura ◽

Shinsuke Tsujita

Keyword(s):

Fault Detection ◽

Principal Components ◽

Number Of Principal Components

Download Full-text

On the number of principal components: A test of dimensionality based on measurements of similarity between matrices

Computational Statistics & Data Analysis ◽

10.1016/j.csda.2007.07.015 ◽

2008 ◽

Vol 52 (4) ◽

pp. 2228-2237 ◽

Cited By ~ 73

Author(s):

Stéphane Dray

Keyword(s):

Principal Components ◽

Number Of Principal Components

Download Full-text

REPRESENTATION BOUND FOR HUMAN FACIAL MIMIC WITH THE AID OF PRINCIPAL COMPONENT ANALYSIS

International Journal of Image and Graphics ◽

10.1142/s0219467810003810 ◽

2010 ◽

Vol 10 (03) ◽

pp. 343-363

Author(s):

ULRIK SÖDERSTRÖM ◽

HAIBO LI

Keyword(s):

Principal Component Analysis ◽

Facial Expressions ◽

Principal Components ◽

Color Image ◽

Signal To Noise Ratio ◽

Principal Component ◽

Component Analysis ◽

Exact Representation ◽

Basic Emotions ◽

Number Of Principal Components

In this paper, we examine how much information is needed to represent the facial mimic, based on Paul Ekman's assumption that the facial mimic can be represented with a few basic emotions. Principal component analysis is used to compact the important facial expressions. Theoretical bounds for facial mimic representation are presented both for using a certain number of principal components and a certain number of bits. When 10 principal components are used to reconstruct color image video at a resolution of 240 × 176 pixels the representation bound is on average 36.8 dB, measured in peak signal-to-noise ratio. Practical confirmation of the theoretical bounds is demonstrated. Quantization of projection coefficients affects the representation, but a quantization with approximately 7-8 bits is found to match an exact representation, measured in mean square error.

Download Full-text

PCR, PLS, or OPLS Evaluation of different regression techniques for hypothesis generation

10.20944/preprints202111.0549.v1 ◽

2021 ◽

Author(s):

Avani Ahuja

Keyword(s):

Least Squares ◽

Partial Least Squares ◽

Principal Components ◽

Selection Process ◽

Principal Component ◽

Hypothesis Generation ◽

Multivariate Techniques ◽

Response Variable ◽

Number Of Principal Components ◽

Latent Structures

In the current era of ‘big data’, scientists are able to quickly amass enormous amount of data in a limited number of experiments. The investigators then try to hypothesize about the root cause based on the observed trends for the predictors and the response variable. This involves identifying the discriminatory predictors that are most responsible for explaining variation in the response variable. In the current work, we investigated three related multivariate techniques: Principal Component Regression (PCR), Partial Least Squares or Projections to Latent Structures (PLS), and Orthogonal Partial Least Squares (OPLS). To perform a comparative analysis, we used a publicly available dataset for Parkinson’ disease patien ts. We first performed the analysis using a cross-validated number of principal components for the aforementioned techniques. Our results demonstrated that PLS and OPLS were better suited than PCR for identifying the discriminatory predictors. Since the X data did not exhibit a strong correlation, we also performed Multiple Linear Regression (MLR) on the dataset. A comparison of the top five discriminatory predictors identified by the four techniques showed a substantial overlap between the results obtained by PLS, OPLS, and MLR, and the three techniques exhibited a significant divergence from the variables identified by PCR. A further investigation of the data revealed that PCR could be used to identify the discriminatory variables successfully if the number of principal components in the regression model were increased. In summary, we recommend using PLS or OPLS for hypothesis generation and systemizing the selection process for principal components when using PCR.rewordexplain later why MLR can be used on a dataset with no correlation

Download Full-text