Estimation and Inference Are Missing Data Problems: Unifying Social Science Statistics via Bayesian Simulation

2000 ◽  
Vol 8 (4) ◽  
pp. 307-332 ◽  
Author(s):  
Simon Jackman

Bayesian simulation is increasingly exploited in the social sciences for estimation and inference of model parameters. But an especially useful (if often overlooked) feature of Bayesian simulation is that it can be used to estimate any function of model parameters, including “auxiliary” quantities such as goodness-of-fit statistics, predicted values, and residuals. Bayesian simulation treats these quantities as if they were missing data, sampling from their implied posterior densities. Exploiting this principle also lets researchers estimate models via Bayesian simulation where maximum-likelihood estimation would be intractable. Bayesian simulation thus provides a unified solution for quantitative social science. I elaborate these ideas in a variety of contexts: these include generalized linear models for binary responses using data on bill cosponsorship recently reanalyzed in Political Analysis, item—response models for the measurement of respondent's levels of political information in public opinion surveys, the estimation and analysis of legislators' ideal points from roll-call data, and outlier-resistant regression estimates of incumbency advantage in U.S. Congressional elections

2013 ◽  
Vol 2013 ◽  
pp. 1-13 ◽  
Author(s):  
Helena Mouriño ◽  
Maria Isabel Barão

Missing-data problems are extremely common in practice. To achieve reliable inferential results, we need to take into account this feature of the data. Suppose that the univariate data set under analysis has missing observations. This paper examines the impact of selecting an auxiliary complete data set—whose underlying stochastic process is to some extent interdependent with the former—to improve the efficiency of the estimators for the relevant parameters of the model. The Vector AutoRegressive (VAR) Model has revealed to be an extremely useful tool in capturing the dynamics of bivariate time series. We propose maximum likelihood estimators for the parameters of the VAR(1) Model based on monotone missing data pattern. Estimators’ precision is also derived. Afterwards, we compare the bivariate modelling scheme with its univariate counterpart. More precisely, the univariate data set with missing observations will be modelled by an AutoRegressive Moving Average (ARMA(2,1)) Model. We will also analyse the behaviour of the AutoRegressive Model of order one, AR(1), due to its practical importance. We focus on the mean value of the main stochastic process. By simulation studies, we conclude that the estimator based on the VAR(1) Model is preferable to those derived from the univariate context.


2015 ◽  
Vol 58 (3) ◽  
pp. 674-690 ◽  
Author(s):  
Jana D. Canary ◽  
Leigh Blizzard ◽  
Ronald P. Barry ◽  
David W. Hosmer ◽  
Stephen J. Quinn

2016 ◽  
Vol 11 (2) ◽  
pp. 15-34 ◽  
Author(s):  
José de Jesús Návar Cháidez ◽  
Nicolás González ◽  
José Graciano

In this research, we present predictions of carbon sequestration by pines growing in reforested sites of Durango, Mexico. Four methodologies to predict carbon stocks in standing aboveground biomass were tested. Two models at the whole stand scale and two hybrid models between whole stand, stand class, and individual trees were fitted. A chronosequence of 23 small-scale reforested sites and stem analysis conducted on 60 trees were used to fit model parameters and estimate goodness of fit statistics. A stand class model produced a better fit to measure carbon stocks in aboveground standing biomass. 3 Reforested sites with native pine species are sequestering carbon at differential rates partially explained by density, species, micro site, climate and age of pines. Society is benefiting from the environmental services provided by carbon sequestration and conservation of native coniferous forests.


2013 ◽  
Vol 853 ◽  
pp. 590-595
Author(s):  
Shi Fu Zhang ◽  
Tian Min Liu ◽  
Qi Xin Zhang ◽  
Xiu Mei Shu ◽  
Shi Qiang Song ◽  
...  

Collapsible fabric tank which is an important part of oil equipment equips army with large reserves. The evaluation of collapsible fabric tank storage life, lacking of corresponding test data support, is on the conservative side, which causes manage and storage uncertainty to army. Based on reliability theory, storage life evaluation method of collapsible fabric tank is established. Failure data of collapsible fabric tank under several accelerated stress levels is obtained. According to probability distribution hypothesis test, Weibull distribution has better goodness of fit and life prediction model is obtained. A new maximum likelihood estimation is proposed as statistical analysis method and model parameters are evaluated. Storage life and reliability index of collapsible fabric tank under normal storage temperature are evaluated which brings collapsible fabric tank storage life potential into play and improves use economy.


2018 ◽  
Vol 13 (2) ◽  
Author(s):  
Gary Venter

Abstract Bayesian regularization, a relatively new method for estimating model parameters, shrinks estimates towards the overall mean by shrinking the parameters. It has been proven to lower estimation and prediction variances from those of MLE for linear models, such as regression or GLM. It has a goodness-of-fit measure, and can readily be applied using available software. This can be used for any type of actuarial linear modeling, but it is slightly more complicated for mortality and loss reserving models that use row, column, and diagonal effects for array data. These are called age-period-cohort, or APC models by statisticians. The problem is that the row, column and diagonal effects are not what should be shrunk. These models can easily become over-parameterized, and actuaries often reduce parameters with smooth curves or cubic splines. We discuss an alternative smoothing method that uses regularization, with its reduction in estimation errors, and illustrate both its classical and Bayesian forms and their application to APC modeling. Typical actuarial models and some generalizations are used as examples.


Author(s):  
Arun Kumar Chaudhary ◽  
Vijay Kumar

Here, in this paper, a continuous distribution called ArcTan Lomax distribution with three-parameter has been introduced along with some relevant properties of statistics and mathematics pertaining to the distribution. With the help of three established estimations methods including maximum likelihood estimation (MLE), estimation of the presented distribution’s model parameters is done. Also with the help of a real set of data, the distribution’s goodness-of-fit is examined in contrast to some established models in survival analysis.


Author(s):  
Muhammad Aslam ◽  
Zawar Hussain ◽  
Zahid Asghar

In this article, we propose a new family of distributions using the T-X family named as modified generalized Marshall-Olkin family of distributions. Comprehensive mathematical and statistical properties of this family of distributions are provided. The model parameters are estimated by maximum likelihood method. The maximum likelihood estimation under Type-II censoring is also discussed. Two lifetime data sets are used to show the suitability and applicability of the new family of distributions. For comparison purposes, different goodness of fit tests are used.  


2014 ◽  
Vol 2014 ◽  
pp. 1-15 ◽  
Author(s):  
Virginie Konlack Socgnia ◽  
Diane Wilcox

We discuss the calibration of the univariate and multivariate generalized hyperbolic distributions, as well as their hyperbolic, variance gamma, normal inverse Gaussian, and skew Student’st-distribution subclasses for the daily log-returns of seven of the most liquid mining stocks listed on the Johannesburg Stocks Exchange. To estimate the model parameters from historic distributions, we use an expectation maximization based algorithm for the univariate case and a multicycle expectation conditional maximization estimation algorithm for the multivariate case. We assess the goodness of fit statistics using the log-likelihood, the Akaike information criterion, and the Kolmogorov-Smirnov distance. Finally, we inspect the temporal stability of parameters and note implications as criteria for distinguishing between models. To better understand the dependence structure of the stocks, we fit the MGHD and subclasses to both the stock returns and the two leading principal components derived from the price data. While the MGHD could fit both data subsets, we observed that the multivariate normality of the stock return residuals, computed by removing shared components, suggests that the departure from normality can be explained by the structure in the common factors.


In this article, we have introduced a new distribution based on type I half logistic-G family and exponential extension as a base distribution known as Half Logistic Exponential Extension (HLEE) distribution. The statistical properties of this model are also explored, such as the behavior of probability density, hazard rate, and quantile functions are investigated. The Maximum likelihood estimation (MLE) method is used to estimate model parameters. For the potentiality of the proposed model we have compared the goodness of fit with some others models. We have proven the importance and flexibility of the new distribution in modeling with real data applications empirically.


2017 ◽  
Vol 2017 ◽  
pp. 1-14 ◽  
Author(s):  
Xiao Hu ◽  
Yufeng Zhang ◽  
Li Deng ◽  
Guanghui Cai ◽  
Qinghui Zhang ◽  
...  

Objective. This paper presents an assessment of physical meanings of parameter and goodness of fit for homodyned K (HK) distribution modeling ultrasonic speckles from scatterer distributions with wide-varying spatial organizations. Methods. A set of 3D scatterer phantoms based on gamma distributions is built to be implemented from the clustered to random to uniform scatterer distributions continuously. The model parameters are obtained by maximum likelihood estimation (MLE) from statistical histograms of the ultrasonic envelope data and then compared with those by the optimally fitting models chosen from three single distributions. Results show that the parameters of the HK distribution still present their respective physical meanings of independent contributions in the scatterer distributions. Moreover, the HK distribution presents better goodness of fit with a maximum relative MLE difference of 6.23% for random or clustered scatterers with a well-organized periodic structure. Experiments based on ultrasonic envelope data from common carotid arterial B-mode images of human subjects validate the modeling performance of HK distribution. Conclusion. We conclude that the HK model for ultrasonic speckles is a better choice for characterizing tissue with a wide variety of spatial organizations, especially the emphasis on the goodness of fit for the tissue in practical applications.


Sign in / Sign up

Export Citation Format

Share Document