Model-Selection-Based Approaches to Identifying the Optimal Number of Factors in Multilevel Exploratory Factor Analysis

Abstract: The objective article is the comparative analysis of Likert rating scale based on the following range of response categories, i.e. 5, 7, 9 and 11 in context of the appropriate process of factors extraction in exploratory factor analysis (EFA). The problem which is being addressed in article relates primarily to the methodological aspects, both in selection of the optimal number of response categories of the measured items (constituting the Likert scale) and identification of possible changes, differences or similarities associated (as a result of the impact of four types of scales) with extraction and determination the appropriate number of factors in EFA model.Keywords: Exploratory factor analysis, Likert scale, experiment research, marketing

Download Full-text

Survey on establishing the optimal number of factors in exploratory factor analysis applied to data mining

Wiley Interdisciplinary Reviews Data Mining and Knowledge Discovery ◽

10.1002/widm.1294 ◽

2018 ◽

Vol 9 (2) ◽

pp. e1294 ◽

Cited By ~ 1

Author(s):

Laszlo Barna Iantovics ◽

Corina Rotar ◽

Florica Morar

Keyword(s):

Data Mining ◽

Factor Analysis ◽

Exploratory Factor Analysis ◽

Optimal Number ◽

Number Of Factors

Download Full-text

Using Fit Statistic Differences to Determine the Optimal Number of Factors to Retain in an Exploratory Factor Analysis

Educational and Psychological Measurement ◽

10.1177/0013164419865769 ◽

2019 ◽

Vol 80 (2) ◽

pp. 217-241 ◽

Cited By ~ 3

Author(s):

W. Holmes Finch

Keyword(s):

Factor Analysis ◽

Exploratory Factor Analysis ◽

Optimal Number ◽

Parallel Analysis ◽

Fit Indices ◽

Number Of Factors ◽

Indicator Variables ◽

Model Fit Indices ◽

Fit Index ◽

Normally Distributed

Exploratory factor analysis (EFA) is widely used by researchers in the social sciences to characterize the latent structure underlying a set of observed indicator variables. One of the primary issues that must be resolved when conducting an EFA is determination of the number of factors to retain. There exist a large number of statistical tools designed to address this question, with none being universally optimal across applications. Recently, researchers have investigated the use of model fit indices that are commonly used in the conduct of confirmatory factor analysis to determine the number of factors to retain in EFA. These results have yielded mixed results, appearing to be effective when used in conjunction with normally distributed indicators, but not being as effective for categorical indicators. The purpose of this simulation study was to compare the performance of difference values for several fit indices as a method for identifying the optimal number of factors to retain in an EFA, with parallel analysis, which is one of the most reliable such extant methods. Results of the simulation demonstrated that the use of fit index difference values outperformed parallel analysis for categorical indicators, and for normally distributed indicators when factor loadings were small. Implications of these findings are discussed.

Download Full-text

Supplemental Material for One Model to Rule Them All? Using Machine Learning Algorithms to Determine the Number of Factors in Exploratory Factor Analysis

Psychological Methods ◽

10.1037/met0000262.supp ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Factor Analysis ◽

Exploratory Factor Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Number Of Factors

Download Full-text

An exploratory factor analysis of the spontaneous reporting of severe cutaneous adverse reactions

Therapeutic Advances in Drug Safety ◽

10.1177/2042098616670799 ◽

2016 ◽

Vol 8 (1) ◽

pp. 4-16 ◽

Cited By ~ 6

Author(s):

Manfred Hauben ◽

Eric Hung ◽

Wen-Yaw Hsieh

Keyword(s):

Factor Analysis ◽

T Cell ◽

Exploratory Factor Analysis ◽

Adverse Reactions ◽

Overlap Syndromes ◽

Number Of Factors ◽

Shared Variance ◽

Severe Cutaneous Adverse Reactions ◽

Cutaneous Adverse Reactions ◽

Variance Explained

Background: Severe cutaneous adverse reactions (SCARs) are prominent in pharmacovigilance (PhV). They have some commonalities such as nonimmediate nature and T-cell mediation and rare overlap syndromes have been documented, most commonly involving acute generalized exanthematous pustulosis (AGEP) and drug rash with eosinophilia and systemic symptoms (DRESS), and DRESS and toxic epidermal necrolysis (TEN). However, they display diverse clinical phenotypes and variations in specific T-cell immune response profiles, plus some specific genotype–phenotype associations. A question is whether causation of a given SCAR by a given drug supports causality of the same drug for other SCARs. If so, we might expect significant intercorrelations between SCARs with respect to overall drug-reporting patterns. SCARs with significant intercorrelations may reflect a unified underlying concept. Methods: We used exploratory factor analysis (EFA) on data from the United States Food and Drug Administration Adverse Event Reporting System (FAERS) to assess reporting intercorrelations between six SCARs [AGEP, DRESS, erythema multiforme (EM), Stevens–Johnson syndrome (SJS), TEN, exfoliative dermatitis (ExfolDerm)]. We screened the data using visual inspection of scatterplot matrices for problematic data patterns. We assessed factorability via Bartlett’s test of sphericity, Kaiser-Myer-Olkin (KMO) statistic, initial estimates of communality and the anti-image correlation matrix. We extracted factors via principle axis factoring (PAF). The number of factors was determined by scree plot/Kaiser’s rule. We also examined solutions with an additional factor. We applied various oblique rotations. We assessed the strength of the solution by percentage of variance explained, minimum number of factors loading per major factor, the magnitude of the communalities, loadings and crossloadings, and reproduced- and residual correlations. Results: The data were generally adequate for factor analysis but the amount of variance explained, shared variance, and communalities were low, suggesting caution in general against extrapolating causality between SCARs. SJS and TEN displayed most shared variance. AGEP and DRESS, the other SCAR pair most often observed in overlap syndromes, demonstrated modest shared variance, along with maculopapular rash (MPR). DRESS and TEN, another of the more commonly diagnosed pairs in overlap syndromes, did not. EM was uncorrelated with SJS and TEN. Conclusions: The notion that causality of a drug for one SCAR bolsters support for causality of the same drug with other SCARs was generally not supported.

Download Full-text

A Demonstration of the Impact of Outliers on the Decisions About the Number of Factors in Exploratory Factor Analysis

Educational and Psychological Measurement ◽

10.1177/0013164411410878 ◽

2011 ◽

Vol 72 (2) ◽

pp. 181-199 ◽

Cited By ~ 4

Author(s):

Yan Liu ◽

Bruno D. Zumbo ◽

Amery D. Wu

Keyword(s):

Factor Analysis ◽

Exploratory Factor Analysis ◽

Number Of Factors ◽

The Impact

Download Full-text

The Poor Fit of Model Fit for Selecting Number of Factors in Exploratory Factor Analysis for Scale Evaluation

Educational and Psychological Measurement ◽

10.1177/0013164420942899 ◽

2020 ◽

pp. 001316442094289

Author(s):

Amanda K. Montoya ◽

Michael C. Edwards

Keyword(s):

Factor Analysis ◽

Root Mean Square ◽

Exploratory Factor Analysis ◽

Model Fit ◽

Mean Square ◽

Fit Indices ◽

Scale Evaluation ◽

Number Of Factors ◽

Model Fit Indices ◽

The Impact

Model fit indices are being increasingly recommended and used to select the number of factors in an exploratory factor analysis. Growing evidence suggests that the recommended cutoff values for common model fit indices are not appropriate for use in an exploratory factor analysis context. A particularly prominent problem in scale evaluation is the ubiquity of correlated residuals and imperfect model specification. Our research focuses on a scale evaluation context and the performance of four standard model fit indices: root mean square error of approximate (RMSEA), standardized root mean square residual (SRMR), comparative fit index (CFI), and Tucker–Lewis index (TLI), and two equivalence test-based model fit indices: RMSEAt and CFIt. We use Monte Carlo simulation to generate and analyze data based on a substantive example using the positive and negative affective schedule ( N = 1,000). We systematically vary the number and magnitude of correlated residuals as well as nonspecific misspecification, to evaluate the impact on model fit indices in fitting a two-factor exploratory factor analysis. Our results show that all fit indices, except SRMR, are overly sensitive to correlated residuals and nonspecific error, resulting in solutions that are overfactored. SRMR performed well, consistently selecting the correct number of factors; however, previous research suggests it does not perform well with categorical data. In general, we do not recommend using model fit indices to select number of factors in a scale evaluation framework.

Download Full-text

Exploratory factor analysis with structured residuals for brain network data

Network Neuroscience ◽

10.1162/netn_a_00162 ◽

2020 ◽

pp. 1-27

Author(s):

Erik-Jan van Kesteren ◽

Rogier A. Kievit

Keyword(s):

Factor Analysis ◽

Dimension Reduction ◽

Exploratory Factor Analysis ◽

A Priori ◽

Brain Regions ◽

Brain Network ◽

R Software ◽

Network Analyses ◽

Number Of Factors ◽

The Brain

Dimension reduction is widely used and often necessary to make network analyses and their interpretation tractable by reducing high-dimensional data to a small number of underlying variables. Techniques such as exploratory factor analysis (EFA) are used by neuroscientists to reduce measurements from a large number of brain regions to a tractable number of factors. However, dimension reduction often ignores relevant a priori knowledge about the structure of the data. For example, it is well established that the brain is highly symmetric. In this paper, we (a) show the adverse consequences of ignoring a priori structure in factor analysis, (b) propose a technique to accommodate structure in EFA by using structured residuals (EFAST), and (c) apply this technique to three large and varied brain-imaging network datasets, demonstrating the superior fit and interpretability of our approach. We provide an R software package to enable researchers to apply EFAST to other suitable datasets.

Download Full-text

How to determine the number of factors to retain in exploratory factor analysis: A comparison of extraction methods under realistic conditions.

Psychological Methods ◽

10.1037/met0000200 ◽

2019 ◽

Vol 24 (4) ◽

pp. 468-491 ◽

Cited By ~ 21

Author(s):

Max Auerswald ◽

Morten Moshagen

Keyword(s):

Factor Analysis ◽

Exploratory Factor Analysis ◽

Extraction Methods ◽

Number Of Factors

Download Full-text