A Comparison of Penalized Maximum Likelihood Estimation and Markov Chain Monte Carlo Techniques for Estimating Confirmatory Factor Analysis Models With Small Sample Sizes

With small to modest sample sizes and complex models, maximum likelihood (ML) estimation of confirmatory factor analysis (CFA) models can show serious estimation problems such as non-convergence or parameter estimates outside the admissible parameter space. In this article, we distinguish different Bayesian estimators that can be used to stabilize the parameter estimates of a CFA: the mode of the joint posterior distribution that is obtained from penalized maximum likelihood (PML) estimation, and the mean (EAP), median (Med), or mode (MAP) of the marginal posterior distribution that are calculated by using Markov Chain Monte Carlo (MCMC) methods. In two simulation studies, we evaluated the performance of the Bayesian estimators from a frequentist point of view. The results show that the EAP produced more accurate estimates of the latent correlation in many conditions and outperformed the other Bayesian estimators in terms of root mean squared error (RMSE). We also argue that it is often advantageous to choose a parameterization in which the main parameters of interest are bounded, and we suggest the four-parameter beta distribution as a prior distribution for loadings and correlations. Using simulated data, we show that selecting weakly informative four-parameter beta priors can further stabilize parameter estimates, even in cases when the priors were mildly misspecified. Finally, we derive recommendations and propose directions for further research.

Download Full-text

A Comparison of Penalized Maximum Likelihood Estimation and Markov Chain Monte Carlo Techniques for Estimating Confirmatory Factor Analysis Models with Small Sample Sizes

10.31234/osf.io/u3qag ◽

2020 ◽

Author(s):

Oliver Lüdtke ◽

Esther Ulitzsch ◽

Alexander Robitzsch

Keyword(s):

Monte Carlo ◽

Factor Analysis ◽

Markov Chain ◽

Markov Chain Monte Carlo ◽

Maximum Likelihood ◽

Posterior Distribution ◽

Parameter Estimates ◽

Penalized Maximum Likelihood ◽

Confirmatory Factor ◽

Joint Posterior Distribution

With small to modest sample sizes and complex models, maximum likelihood (ML) estimation of confirmatory factor analysis (CFA) models can show serious estimation problems such as nonconvergence or parameter estimates that are outside the admissible parameter space. In the present article, we discuss two Bayesian estimation methods for stabilizing parameter estimates of a CFA: Penalized maximum likelihood (PML) estimation and Markov Chain Monte Carlo (MCMC) methods. We clarify that these use different Bayesian point estimates from the joint posterior distribution—the mode (PML) of the joint posterior distribution, and the mean (EAP) or mode (MAP) of the marginal posterior distribution—and discuss under which conditions the two methods produce different results. In a simulation study, we show that the MCMC method clearly outperforms PML and that these performance gains can be explained by the fact that MCMC uses the EAP as a point estimate. We also argue that it is often advantageous to choose a parameterization in which the main parameters of interest are bounded and suggest the four-parameter beta distribution as a prior distribution for loadings and correlations. Using simulated data, we show that selecting weakly informative four-parameter beta priors can further stabilize parameter estimates, even in cases when the priors were mildly misspecified. Finally, we derive recommendations and propose directions for further research.

Download Full-text

Sample Size and Number of Parameter Estimates in Maximum Likelihood Confirmatory Factor Analysis: A Monte Carlo Investigation

Structural Equation Modeling A Multidisciplinary Journal ◽

10.1207/s15328007sem0802_3 ◽

2001 ◽

Vol 8 (2) ◽

pp. 205-223 ◽

Cited By ~ 113

Author(s):

Dennis L. Jackson

Keyword(s):

Monte Carlo ◽

Factor Analysis ◽

Maximum Likelihood ◽

Confirmatory Factor Analysis ◽

Sample Size ◽

Parameter Estimates ◽

Confirmatory Factor ◽

Monte Carlo Investigation

Download Full-text

The Effects Of Estimator Choice And Weighting Strategies On Confirmatory Factor Analysis With Stratified Samples

Applied Multivariate Research ◽

10.22329/amr.v13i2.3019 ◽

2010 ◽

Vol 13 (2) ◽

pp. 113 ◽

Cited By ~ 1

Author(s):

Bradley J. Brummel ◽

Fritz Drasgow

Keyword(s):

Factor Analysis ◽

Maximum Likelihood ◽

Confirmatory Factor Analysis ◽

Likelihood Estimation ◽

Standard Errors ◽

Parameter Estimates ◽

Estimation Technique ◽

Confirmatory Factor ◽

Pseudo Maximum Likelihood Estimation ◽

Pseudo Maximum Likelihood

Survey researchers often design stratified sampling strategies to target specific subpopulations within the larger population. This stratification can influence the population parameter estimates from these samples because they are not simple random samples of the population. There are three typical estimation options that account for the effects of this stratification in latent variable models: unweighted maximum likelihood, weighted maximum likelihood, and pseudo-maximum likelihood estimation. This paper examines the effects of these procedures on parameter estimates, standard errors, and fit statistics in Lisrel 8.7 (Jöreskog & Sörbom, 2004) and Mplus 3.0 (Muthén & Muthén, 2004). Options using several estimation methods will be compared to pseudo-maximum likelihood estimation. Results indicated the choice of estimation technique does not have a substantial effect on confirmatory factor analysis parameter estimates in large samples. However, standard errors of those parameter estimates and RMSEA values for assessing of model fit can be substantially affected by estimation technique.

Download Full-text

A Cautionary Note on Incremental Fit Indices Reported by LISREL

Methodology ◽

10.1027/1614-1881.1.2.81 ◽

2005 ◽

Vol 1 (2) ◽

pp. 81-85 ◽

Cited By ~ 5

Author(s):

Stefan C. Schmukle ◽

Jochen Hardt

Keyword(s):

Factor Analysis ◽

Maximum Likelihood ◽

Structural Equation ◽

Structural Equation Models ◽

Null Model ◽

Likelihood Estimation ◽

Parameter Estimates ◽

Fit Indices ◽

Cautionary Note ◽

Confirmatory Factor

Abstract. Incremental fit indices (IFIs) are regularly used when assessing the fit of structural equation models. IFIs are based on the comparison of the fit of a target model with that of a null model. For maximum-likelihood estimation, IFIs are usually computed by using the χ2 statistics of the maximum-likelihood fitting function (ML-χ2). However, LISREL recently changed the computation of IFIs. Since version 8.52, IFIs reported by LISREL are based on the χ2 statistics of the reweighted least squares fitting function (RLS-χ2). Although both functions lead to the same maximum-likelihood parameter estimates, the two χ2 statistics reach different values. Because these differences are especially large for null models, IFIs are affected in particular. Consequently, RLS-χ2 based IFIs in combination with conventional cut-off values explored for ML-χ2 based IFIs may lead to a wrong acceptance of models. We demonstrate this point by a confirmatory factor analysis in a sample of 2449 subjects.

Download Full-text

Using the Omnibus Maximum Likelihood F-Test in Confirmatory Factor Analysis Using R

Measurement and Evaluation in Counseling and Development ◽

10.1080/07481756.2021.1906158 ◽

2021 ◽

pp. 1-28

Author(s):

Georgios D. Sideridis ◽

Fathima Jaffari

Keyword(s):

Factor Analysis ◽

Maximum Likelihood ◽

Confirmatory Factor Analysis ◽

F Test ◽

Confirmatory Factor

Download Full-text

A Monte Carlo Simulation Study on the Influence of Unequal Group Sizes on Parameter Estimation in Multilevel Confirmatory Factor Analysis

Structural Equation Modeling A Multidisciplinary Journal ◽

10.1080/10705511.2021.1913594 ◽

2021 ◽

pp. 1-12

Author(s):

Felix Jonas Eßer ◽

Jana Holtmann ◽

Michael Eid

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Factor Analysis ◽

Parameter Estimation ◽

Confirmatory Factor Analysis ◽

Simulation Study ◽

Monte Carlo Simulation Study ◽

Multilevel Confirmatory Factor Analysis ◽

Confirmatory Factor

Download Full-text

Parameter stability and consistency in an alongshore-current model determined with Markov chain Monte Carlo

Journal of Hydroinformatics ◽

10.2166/hydro.2008.016 ◽

2008 ◽

Vol 10 (2) ◽

pp. 153-162 ◽

Cited By ~ 2

Author(s):

B. G. Ruessink

Keyword(s):

Monte Carlo ◽

Markov Chain ◽

Markov Chain Monte Carlo ◽

Parameter Estimates ◽

Model Parameters ◽

Parameter Stability ◽

Small Uncertainty ◽

Best Fit ◽

Stability And Consistency

When a numerical model is to be used as a practical tool, its parameters should preferably be stable and consistent, that is, possess a small uncertainty and be time-invariant. Using data and predictions of alongshore mean currents flowing on a beach as a case study, this paper illustrates how parameter stability and consistency can be assessed using Markov chain Monte Carlo. Within a single calibration run, Markov chain Monte Carlo estimates the parameter posterior probability density function, its mode being the best-fit parameter set. Parameter stability is investigated by stepwise adding new data to a calibration run, while consistency is examined by calibrating the model on different datasets of equal length. The results for the present case study indicate that various tidal cycles with strong (say, >0.5 m/s) currents are required to obtain stable parameter estimates, and that the best-fit model parameters and the underlying posterior distribution are strongly time-varying. This inconsistent parameter behavior may reflect unresolved variability of the processes represented by the parameters, or may represent compensational behavior for temporal violations in specific model assumptions.

Download Full-text

Bayesian estimation of parameters in a regional hydrological model

Hydrology and Earth System Sciences ◽

10.5194/hess-6-883-2002 ◽

2002 ◽

Vol 6 (5) ◽

pp. 883-898 ◽

Cited By ~ 55

Author(s):

K. Engeland ◽

L. Gottschalk

Keyword(s):

Monte Carlo ◽

Markov Chain ◽

Markov Chain Monte Carlo ◽

Hydrological Model ◽

Parameter Estimates ◽

Model Parameters ◽

Parameter Uncertainties ◽

Daily Streamflow ◽

Likelihood Model ◽

Simulation Errors

Abstract. This study evaluates the applicability of the distributed, process-oriented Ecomag model for prediction of daily streamflow in ungauged basins. The Ecomag model is applied as a regional model to nine catchments in the NOPEX area, using Bayesian statistics to estimate the posterior distribution of the model parameters conditioned on the observed streamflow. The distribution is calculated by Markov Chain Monte Carlo (MCMC) analysis. The Bayesian method requires formulation of a likelihood function for the parameters and three alternative formulations are used. The first is a subjectively chosen objective function that describes the goodness of fit between the simulated and observed streamflow, as defined in the GLUE framework. The second and third formulations are more statistically correct likelihood models that describe the simulation errors. The full statistical likelihood model describes the simulation errors as an AR(1) process, whereas the simple model excludes the auto-regressive part. The statistical parameters depend on the catchments and the hydrological processes and the statistical and the hydrological parameters are estimated simultaneously. The results show that the simple likelihood model gives the most robust parameter estimates. The simulation error may be explained to a large extent by the catchment characteristics and climatic conditions, so it is possible to transfer knowledge about them to ungauged catchments. The statistical models for the simulation errors indicate that structural errors in the model are more important than parameter uncertainties. Keywords: regional hydrological model, model uncertainty, Bayesian analysis, Markov Chain Monte Carlo analysis

Download Full-text