Estimating Classification Errors Under Edit Restrictions in Composite Survey-Register Data Using Multiple Imputation Latent Class Modelling (MILC)

Abstract Both registers and surveys can contain classification errors. These errors can be estimated by making use of a composite data set. We propose a new method based on latent class modelling to estimate the number of classification errors across several sources while taking into account impossible combinations with scores on other variables. Furthermore, the latent class model, by multiply imputing a new variable, enhances the quality of statistics based on the composite data set. The performance of this method is investigated by a simulation study, which shows that whether or not the method can be applied depends on the entropy R2 of the latent class model and the type of analysis a researcher is planning to do. Finally, the method is applied to public data from Statistics Netherlands.

Download Full-text

How do chocolate lovers balance taste and ethical considerations?

British Food Journal ◽

10.1108/bfj-06-2015-0208 ◽

2016 ◽

Vol 118 (2) ◽

pp. 343-361 ◽

Cited By ~ 16

Author(s):

Eline Poelmans ◽

Sandra Rousseau

Keyword(s):

Latent Class ◽

Latent Class Model ◽

Preference Heterogeneity ◽

Conditional Logit Model ◽

Ethical Considerations ◽

Class Model ◽

Data Set ◽

Content Type ◽

Conditional Logit ◽

Percent Response Rate

Purpose – The purpose of this paper is to investigate how chocolate lovers balance taste and ethical considerations when selecting chocolate products. Design/methodology/approach – The data set was collected through a survey at the 2014 “Salon du Chocolat” in Brussels, Belgium. The authors distributed 700 copies and received 456 complete responses (65 percent response rate). Choice experiments were used to estimate the relative importance of different chocolate characteristics and to predict respondents’ willingness to pay for marginal changes in those characteristics. The authors estimate both a conditional logit model and a latent class model to take possible preference heterogeneity into account. Findings – On average, respondents were willing to pay 11 euros more for 250 g fairtrade labeled chocolate compared to conventional chocolate. However, taste clearly dominates ethical considerations. The authors could distinguish three consumer segments, each with a different tradeoff between taste and fairtrade. One group clearly valued fairtrade positively, a second group valued fairtrade to a lesser extent and a third group did not seem to value fairtrade. Originality/value – Chocolate can be seen as a self-indulgent treat where taste is likely to dominate other characteristics. Therefore it is unsure to what extent ethical factors are included in consumer decisions. Interestingly the results indicate that a significant share of chocolate buyers still positively value fairtrade characteristics when selecting chocolate varieties.

Download Full-text

How consumers assess the quality of external appearance of unshelled oysters in Japanese oyster bars: a conjoint analysis using a latent class model

NIPPON SUISAN GAKKAISHI ◽

10.2331/suisan.20-00059 ◽

2021 ◽

Author(s):

TSUTOM MIYATA ◽

RYUTARO KAMIYAMA ◽

HIROKI WAKAMATSU

Keyword(s):

Conjoint Analysis ◽

Latent Class ◽

Latent Class Model ◽

Class Model ◽

External Appearance ◽

Japanese Oyster

Download Full-text

Bayesian Latent Class Models for the Multiple Imputation of Categorical Data

Methodology ◽

10.1027/1614-2241/a000146 ◽

2018 ◽

Vol 14 (2) ◽

pp. 56-68 ◽

Cited By ~ 5

Author(s):

Davide Vidotto ◽

Jeroen K. Vermunt ◽

Katrijn Van Deun

Keyword(s):

Model Selection ◽

Multiple Imputation ◽

Parameter Uncertainty ◽

Categorical Data ◽

Dirichlet Process ◽

Latent Class ◽

Latent Class Model ◽

Class Model ◽

Frequentist Approach ◽

Number Of Classes

Abstract. Latent class analysis has been recently proposed for the multiple imputation (MI) of missing categorical data, using either a standard frequentist approach or a nonparametric Bayesian model called Dirichlet process mixture of multinomial distributions (DPMM). The main advantage of using a latent class model for multiple imputation is that it is very flexible in the sense that it can capture complex relationships in the data given that the number of latent classes is large enough. However, the two existing approaches also have certain disadvantages. The frequentist approach is computationally demanding because it requires estimating many LC models: first models with different number of classes should be estimated to determine the required number of classes and subsequently the selected model is reestimated for multiple bootstrap samples to take into account parameter uncertainty during the imputation stage. Whereas the Bayesian Dirichlet process models perform the model selection and the handling of the parameter uncertainty automatically, the disadvantage of this method is that it tends to use a too small number of clusters during the Gibbs sampling, leading to an underfitting model yielding invalid imputations. In this paper, we propose an alternative approach which combined the strengths of the two existing approaches; that is, we use the Bayesian standard latent class model as an imputation model. We show how model selection can be performed prior to the imputation step using a single run of the Gibbs sampler and, moreover, show how underfitting is prevented by using large values for the hyperparameters of the mixture weights. The results of two simulation studies and one real-data study indicate that with a proper setting of the prior distributions, the Bayesian latent class model yields valid imputations and outperforms competing methods.

Download Full-text

A Study of Korean Consumer's Choice Behavior of Brand Using Latent Class Model - Competitive Structure Analysis of Acid Beverage Market -

Productivity Review ◽

10.15843/kpapr.21.4.200712.149 ◽

2007 ◽

Vol 21 (4) ◽

pp. 149-170 ◽

Cited By ~ 1

Author(s):

양진호 ◽

Hwang Yun Seop ◽

Kim,Chul

Keyword(s):

Structure Analysis ◽

Choice Behavior ◽

Latent Class ◽

Latent Class Model ◽

Class Model ◽

Competitive Structure

Download Full-text

Latent class model characterization of neighborhood socioeconomic status

Cancer Causes & Control ◽

10.1007/s10552-015-0711-4 ◽

2016 ◽

Vol 27 (3) ◽

pp. 445-452 ◽

Cited By ~ 8

Author(s):

Aimee Palumbo ◽

Yvonne Michael ◽

Terry Hyslop

Keyword(s):

Socioeconomic Status ◽

Latent Class ◽

Latent Class Model ◽

Class Model ◽

Neighborhood Socioeconomic Status

Download Full-text

Bayesian latent class model estimates of diagnostic accuracy for three test methods designed to detect spring viremia of carp virus

Preventive Veterinary Medicine ◽

10.1016/j.prevetmed.2021.105338 ◽

2021 ◽

pp. 105338

Author(s):

Sharon C. Clouthier ◽

Carol McClure ◽

Tamara Schroeder ◽

Eric D. Anderson

Keyword(s):

Diagnostic Accuracy ◽

Latent Class ◽

Latent Class Model ◽

Test Methods ◽

Class Model

Download Full-text

A Discussion of “Using Angler Characteristics and Attitudinal Data to Identify Environmental Preference Classes: A Latent-Class Model”

Environmental and Resource Economics ◽

10.1007/s10640-005-3793-8 ◽

2006 ◽

Vol 34 (1) ◽

pp. 117-124 ◽

Cited By ~ 29

Author(s):

Bill Provencher ◽

Rebecca Moore

Keyword(s):

Latent Class ◽

Latent Class Model ◽

Class Model ◽

Environmental Preference ◽

Attitudinal Data

Download Full-text

Customers’ Mode Choice Behaviors of Express Service Based on Latent Class Analysis and Logit Model

Mathematical Problems in Engineering ◽

10.1155/2015/610673 ◽

2015 ◽

Vol 2015 ◽

pp. 1-8 ◽

Cited By ~ 2

Author(s):

Lian Lian ◽

Shuo Zhang ◽

Zhong Wang ◽

Kai Liu ◽

Lihuan Cao

Keyword(s):

Logit Model ◽

Choice Behavior ◽

Latent Class ◽

Latent Class Model ◽

Policy Research ◽

Multinomial Logit Model ◽

Class Model ◽

Taste Heterogeneity ◽

Using Data ◽

Service Choice

As the parcel delivery service is booming in China, the competition among express companies intensifies. This paper employed multinomial logit model (MNL) and latent class model (LCM) to investigate customers’ express service choice behavior, using data from a SP survey. The attributes and attribute levels that matter most to express customers are identified. Meanwhile, the customers are divided into two segments (penny pincher segment and high-end segment) characterized by their taste heterogeneity. The results indicate that the LCM performs statistically better than MNL in our sample. Therefore, more attention should be paid to the taste heterogeneity, especially for further academic and policy research in freight choice behavior.

Download Full-text

A Comparison of Mixture Modeling Approaches in Latent Class Models With External Variables Under Small Samples

Educational and Psychological Measurement ◽

10.1177/0013164417726828 ◽

2017 ◽

Vol 78 (6) ◽

pp. 925-951 ◽

Cited By ~ 3

Author(s):

Unkyung No ◽

Sehee Hong

Keyword(s):

Sample Size ◽

Latent Class ◽

Latent Class Model ◽

Mixture Modeling ◽

Small Sample ◽

Outcome Variable ◽

Parameter Estimates ◽

Class Model ◽

Modeling Approaches ◽

Distal Outcome

The purpose of the present study is to compare performances of mixture modeling approaches (i.e., one-step approach, three-step maximum-likelihood approach, three-step BCH approach, and LTB approach) based on diverse sample size conditions. To carry out this research, two simulation studies were conducted with two different models, a latent class model with three predictor variables and a latent class model with one distal outcome variable. For the simulation, data were generated under the conditions of different sample sizes (100, 200, 300, 500, 1,000), entropy (0.6, 0.7, 0.8, 0.9), and the variance of a distal outcome (homoscedasticity, heteroscedasticity). For evaluation criteria, parameter estimates bias, standard error bias, mean squared error, and coverage were used. Results demonstrate that the three-step approaches produced more stable and better estimations than the other approaches even with a small sample size of 100. This research differs from previous studies in the sense that various models were used to compare the approaches and smaller sample size conditions were used. Furthermore, the results supporting the superiority of the three-step approaches even in poorly manipulated conditions indicate the advantage of these approaches.

Download Full-text