Principal Component Analysis and Factor Analysis: differences and similarities in Nutritional Epidemiology application

ABSTRACT: Introduction: Statistical methods such as Principal Component Analysis (PCA) and Factor Analysis (FA) are increasingly popular in Nutritional Epidemiology studies. However, misunderstandings regarding the choice and application of these methods have been observed. Objectives: This study aims to compare and present the main differences and similarities between FA and PCA, focusing on their applicability to nutritional studies. Methods: PCA and FA were applied on a matrix of 34 variables expressing the mean food intake of 1,102 individuals from a population-based study. Results: Two factors were extracted and, together, they explained 57.66% of the common variance of food group variables, while five components were extracted, explaining 26.25% of the total variance of food group variables. Among the main differences of these two methods are: normality assumption, matrices of variance-covariance/correlation and its explained variance, factorial scores, and associated error. The similarities are: both analyses are used for data reduction, the sample size usually needs to be big, correlated data, and they are based on matrices of variance-covariance. Conclusion: PCA and FA should not be treated as equal statistical methods, given that the theoretical rationale and assumptions for using these methods as well as the interpretation of results are different.

Download Full-text

Assessment of dietary patterns in nutritional epidemiology: principal component analysis compared with confirmatory factor analysis

American Journal of Clinical Nutrition ◽

10.3945/ajcn.112.038109 ◽

2012 ◽

Vol 96 (5) ◽

pp. 1079-1092 ◽

Cited By ~ 49

Author(s):

Raphaëlle Varraso ◽

Judith Garcia-Aymerich ◽

Florent Monier ◽

Nicole Le Moual ◽

Jordi De Batlle ◽

...

Keyword(s):

Principal Component Analysis ◽

Factor Analysis ◽

Confirmatory Factor Analysis ◽

Dietary Patterns ◽

Principal Component ◽

Component Analysis ◽

Nutritional Epidemiology ◽

Confirmatory Factor

Download Full-text

STUDY ON PLANKTON OF DIFFERENT CATEGORIES OF LAKES IN SUMMER BY MEANS OF PRINCIPAL COMPONENT ANALYSIS, FACTOR ANALYSIS AND CLUSTER ANALYSIS

Acta Hydrobiologica Sinica ◽

10.3724/sp.j.1035.2010.00043 ◽

2010 ◽

Vol 36 (1) ◽

pp. 43-50

Author(s):

Luo-Jun GONG ◽

Shi-Ping ZHANG ◽

Bang-Xi XIONG ◽

Ding-Zhu LIU ◽

Jin-Zhong LI ◽

...

Keyword(s):

Principal Component Analysis ◽

Cluster Analysis ◽

Factor Analysis ◽

Principal Component ◽

Component Analysis ◽

And Cluster Analysis

Download Full-text

Principal component analysis and exploratory factor analysis using SAS software package

Journal of Chinese Integrative Medicine ◽

10.3736/jcim20100613 ◽

2010 ◽

Vol 8 (6) ◽

pp. 589-593 ◽

Cited By ~ 1

Author(s):

WW Liu

Keyword(s):

Principal Component Analysis ◽

Factor Analysis ◽

Exploratory Factor Analysis ◽

Software Package ◽

Principal Component ◽

Component Analysis

Download Full-text

Application of unfold principal component analysis and parallel factor analysis to the exploratory analysis of olive oils by means of excitation–emission matrix fluorescence spectroscopy

Analytica Chimica Acta ◽

10.1016/j.aca.2004.01.008 ◽

2004 ◽

Vol 515 (1) ◽

pp. 75-85 ◽

Cited By ~ 94

Author(s):

Francesca Guimet ◽

Joan Ferré ◽

Ricard Boqué ◽

F.Xavier Rius

Keyword(s):

Principal Component Analysis ◽

Factor Analysis ◽

Fluorescence Spectroscopy ◽

Principal Component ◽

Component Analysis ◽

Parallel Factor Analysis ◽

Exploratory Analysis ◽

Excitation Emission Matrix ◽

Olive Oils ◽

Parallel Factor

Download Full-text

Reliability Alpha, Principal Component Analysis, and Exploratory Factor Analysis

Statistical Analysis of Management Data ◽

10.1007/978-1-4614-8594-0_3 ◽

2013 ◽

pp. 31-76

Author(s):

Hubert Gatignon

Keyword(s):

Principal Component Analysis ◽

Factor Analysis ◽

Exploratory Factor Analysis ◽

Principal Component ◽

Component Analysis

Download Full-text

Principal Component Analysis, Factor Analysis, and Structural Equation Modeling: A Very Brief Introduction

Statistical Test Theory for the Behavioral Sciences ◽

10.1201/9781584889595-11 ◽

2007 ◽

pp. 141-148

Keyword(s):

Principal Component Analysis ◽

Factor Analysis ◽

Structural Equation Modeling ◽

Structural Equation ◽

Principal Component ◽

Component Analysis ◽

Equation Modeling

Download Full-text

Influencing Factors Study on Green Development Using Principal Component Analysis and Factor Analysis

Journal of Physics Conference Series ◽

10.1088/1742-6596/1952/4/042120 ◽

2021 ◽

Vol 1952 (4) ◽

pp. 042120

Author(s):

Yihong Sun ◽

Han Gao ◽

Xuemei Yuan

Keyword(s):

Principal Component Analysis ◽

Factor Analysis ◽

Influencing Factors ◽

Principal Component ◽

Component Analysis ◽

Green Development

Download Full-text

Principal Component Analysis and Factor Analysis

A Handbook of Statistical Analyses Using SPSS ◽

10.1201/9780203009765.ch11 ◽

2003 ◽

Keyword(s):

Principal Component Analysis ◽

Factor Analysis ◽

Principal Component ◽

Component Analysis

Download Full-text

Principal component analysis (factor analysis), a more accurate method to be used in epidemiological studies of blood pressure in children.

American Journal of Hypertension ◽

10.1016/0895-7061(95)97913-c ◽

1995 ◽

Vol 8 (4) ◽

pp. 147A

Author(s):

M MACEDO

Keyword(s):

Blood Pressure ◽

Principal Component Analysis ◽

Factor Analysis ◽

Principal Component ◽

Accurate Method ◽

Component Analysis ◽

Epidemiological Studies

Download Full-text

Physical-oriented and machine learning-based emission modeling in a diesel compression ignition engine: Dimensionality reduction and regression

International Journal of Engine Research ◽

10.1177/14680874211070736 ◽

2022 ◽

pp. 146808742110707

Author(s):

Aran Mohammad ◽

Reza Rezaei ◽

Christopher Hayduk ◽

Thaddaeus Delebinski ◽

Saeid Shahpouri ◽

...

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Factor Analysis ◽

Dimensionality Reduction ◽

Principal Component ◽

Component Analysis ◽

Data Driven ◽

Support Vector ◽

Emission Models ◽

Emission Modeling

The development of internal combustion engines is affected by the exhaust gas emissions legislation and the striving to increase performance. This demands for engine-out emission models that can be used for engine optimization for real driving emission controls. The prediction capability of physically and data-driven engine-out emission models is influenced by the system inputs, which are specified by the user and can lead to an improved accuracy with increasing number of inputs. Thereby the occurrence of irrelevant inputs becomes more probable, which have a low functional relation to the emissions and can lead to overfitting. Alternatively, data-driven methods can be used to detect irrelevant and redundant inputs. In this work, thermodynamic states are modeled based on 772 stationary measured test bench data from a commercial vehicle diesel engine. Afterward, 37 measured and modeled variables are led into a data-driven dimensionality reduction. For this purpose, approaches of supervised learning, such as lasso regression and linear support vector machine, and unsupervised learning methods like principal component analysis and factor analysis are applied to select and extract the relevant features. The selected and extracted features are used for regression by the support vector machine and the feedforward neural network to model the NOx, CO, HC, and soot emissions. This enables an evaluation of the modeling accuracy as a result of the dimensionality reduction. Using the methods in this work, the 37 variables are reduced to 25, 22, 11, and 16 inputs for NOx, CO, HC, and soot emission modeling while maintaining the accuracy. The features selected using the lasso algorithm provide more accurate learning of the regression models than the extracted features through principal component analysis and factor analysis. This results in test errors RMSETe for modeling NOx, CO, HC, and soot emissions 19.22 ppm, 6.46 ppm, 1.29 ppm, and 0.06 FSN, respectively.

Download Full-text