scholarly journals On the correspondence of deviances and maximum-likelihood and interval estimates from log-linear to logistic regression modelling

2020 ◽  
Vol 7 (1) ◽  
pp. 191483
Author(s):  
W. Jing ◽  
M. Papathomas

Consider a set of categorical variables P where at least one, denoted by Y , is binary. The log-linear model that describes the contingency table counts implies a logistic regression model, with outcome Y . Extending results from Christensen (1997, Log-linear models and logistic regression , 2nd edn. New York, NY, Springer), we prove that the maximum-likelihood estimates (MLE) of the logistic regression parameters equals the MLE for the corresponding log-linear model parameters, also considering the case where contingency table factors are not present in the corresponding logistic regression and some of the contingency table cells are collapsed together. We prove that, asymptotically, standard errors are also equal. These results demonstrate the extent to which inferences from the log-linear framework translate to inferences within the logistic regression framework, on the magnitude of main effects and interactions. Finally, we prove that the deviance of the log-linear model is equal to the deviance of the corresponding logistic regression, provided that no cell observations are collapsed together when one or more factors in P ∖ { Y } become obsolete. We illustrate the derived results with the analysis of a real dataset.

2020 ◽  
pp. 1-7
Author(s):  
Fatin N.S.A. ◽  
Norlida M.N. ◽  
Siti Z.M.J.

Log-linear model is a technique used to analyze the cross-classification categorical data or the contingency table. It is used to obtain the parsimony models that describe the interaction between the categorical variables in contingency tables. Log-linear models are commonly used in evaluating higher dimensional contingency tables that involves more than two categorical variables. This study focuses on analyzing data of poisoned patients from 2012 to 2014 using log-linear model. There are two model analyzed; model for demographic data of patients and model of poisoning information. For the first model, the variables involved are gender, age, race and state. Variables for the second model are circumstance of exposure, type of exposure, location of exposure, route of exposure and types of poison. Both log-linear models are developed to investigate the association between variables in the model. As a result of this study, the best model for demographic data and poisoning information are the model with three-ways interaction. For the best model of demographic data, there is an association between gender, age and race, race, gender and state as well as age, race and state. Meanwhile, the best model for poisoning information reveals that there is relationship between circumstance of exposure, route of exposure and type of poison, location of exposure, route of exposure and type of poison, circumstance of exposure, type of exposure and route of exposure, circumstance of exposure, location of exposure and route of exposure, circumstance of exposure, type of exposure and type of poison and also type of exposure, location of exposure and type of poison. Keywords: log-linear; demographic; gender; age; race; state; circumstance of exposure; type of exposure; location of exposure; route of exposure; types of poison


2016 ◽  
Vol 16 (1) ◽  
pp. 264-273
Author(s):  
Justyna Brzezińska

Abstract A log-linear analysis is a method providing a comprehensive scheme to describe the association for categorical variables in a contingency table. The log-linear model specifies how the expected counts depend on the levels of the categorical variables for these cells and provide detailed information on the associations. The aim of this paper is to present theoretical, as well as empirical, aspects of ordinal log-linear models used for contingency tables with ordinal variables. We introduce log-linear models for ordinal variables: linear-by-linear association, row effect model, column effect model and RC Goodman’s model. Algorithm, advantages and disadvantages will be discussed in the paper. An empirical analysis will be conducted with the use of R.


2004 ◽  
Vol 8 (2) ◽  
pp. 67-86 ◽  
Author(s):  
Eric J. Beh ◽  
Pamela J. Davy

Log-linear modeling is a popular statistical tool for analysing a contingency table. This presentation focuses on an alternative approach to modeling ordinal categorical data. The technique, based on orthogonal polynomials, provides a much simpler method of model fitting than the conventional approach of maximum likelihood estimation, as it does not require iterative calculations nor the fitting and re-fitting to search for the best model. Another advantage is that quadratic and higher order effects can readily be included, in contrast to conventional log-linear models which incorporate linear terms only.The focus of the discussion is the application of the new parameter estimation technique to multi-way contingency tables with at least one ordered variable. This will also be done by considering singly and doubly ordered two-way contingency tables. It will be shown by example that the resulting parameter estimates are numerically similar to corresponding maximum likelihood estimates for ordinal log-linear models.


1983 ◽  
Vol 15 (6) ◽  
pp. 801-813 ◽  
Author(s):  
B Fingleton

Log-linear models are an appropriate means of determining the magnitude and direction of interactions between categorical variables that in common with other statistical models assume independent observations. Spatial data are often dependent rather than independent and thus the analysis of spatial data by log-linear models may erroneously detect interactions between variables that are spurious and are the consequence of pairwise correlations between observations. A procedure is described in this paper to accommodate these effects that requires only very minimal assumptions about the nature of the autocorrelation process given systematic sampling at intersection points on a square lattice.


Author(s):  
Necva Bölücü ◽  
Burcu Can

Part of speech (PoS) tagging is one of the fundamental syntactic tasks in Natural Language Processing, as it assigns a syntactic category to each word within a given sentence or context (such as noun, verb, adjective, etc.). Those syntactic categories could be used to further analyze the sentence-level syntax (e.g., dependency parsing) and thereby extract the meaning of the sentence (e.g., semantic parsing). Various methods have been proposed for learning PoS tags in an unsupervised setting without using any annotated corpora. One of the widely used methods for the tagging problem is log-linear models. Initialization of the parameters in a log-linear model is very crucial for the inference. Different initialization techniques have been used so far. In this work, we present a log-linear model for PoS tagging that uses another fully unsupervised Bayesian model to initialize the parameters of the model in a cascaded framework. Therefore, we transfer some knowledge between two different unsupervised models to leverage the PoS tagging results, where a log-linear model benefits from a Bayesian model’s expertise. We present results for Turkish as a morphologically rich language and for English as a comparably morphologically poor language in a fully unsupervised framework. The results show that our framework outperforms other unsupervised models proposed for PoS tagging.


Author(s):  
Zubair Ahmad Ahmad ◽  
Eisa Mahmoudi Mahmoudi ◽  
G. G. Hamedani

Actuaries are often in search of nding an adequate loss model in the scenario of actuarial and financial risk management problems. In this work, we propose a new approach to obtain a new class of loss distributions. A special sub-model of the proposed family, called the Weibull-loss model isconsidered in detail. Some mathematical properties are derived and maximum likelihood estimates of the model parameters are obtained. Certain characterizations of the proposed family are also provided. A simulation study is done to evaluate the performance of the maximum likelihood estimators. Finally, an application of the proposed model to the vehicle insurance loss data set is presented.


2020 ◽  
Vol 9 (1) ◽  
pp. 61-81
Author(s):  
Lazhar BENKHELIFA

A new lifetime model, with four positive parameters, called the Weibull Birnbaum-Saunders distribution is proposed. The proposed model extends the Birnbaum-Saunders distribution and provides great flexibility in modeling data in practice. Some mathematical properties of the new distribution are obtained including expansions for the cumulative and density functions, moments, generating function, mean deviations, order statistics and reliability. Estimation of the model parameters is carried out by the maximum likelihood estimation method. A simulation study is presented to show the performance of the maximum likelihood estimates of the model parameters. The flexibility of the new model is examined by applying it to two real data sets.


Sign in / Sign up

Export Citation Format

Share Document