A Bayesian Split Population Survival Model for Duration Data With Misclassified Failure Events

2019 ◽  
Vol 27 (4) ◽  
pp. 415-434 ◽  
Author(s):  
Benjamin E. Bagozzi ◽  
Minnie M. Joo ◽  
Bomin Kim ◽  
Bumba Mukherjee

We develop a new Bayesian split population survival model for the analysis of survival data with misclassified event failures. Within political science survival data, right-censored survival cases are often erroneously misclassified as failure cases due to measurement error. Treating these cases as failure events within survival analyses will underestimate the duration of some events. This will bias coefficient estimates, especially in situations where such misclassification is associated with covariates of interest. Our split population survival estimator addresses this challenge by using a system of two equations to explicitly model the misclassification of failure events alongside a parametric survival process of interest. After deriving this model, we use Bayesian estimation via slice sampling to evaluate its performance with simulated data, and in several political science applications. We find that our proposed “misclassified failure” survival model allows researchers to accurately account for misclassified failure events within the contexts of civil war duration and democratic survival.

2020 ◽  
Author(s):  
Paul Robert Connor ◽  
Ellen Riemke Katrien Evers

Payne, Vuletich, and Lundberg’s bias-of-crowds model proposes that a number of empirical puzzles can be resolved by conceptualizing implicit bias as a feature of situations rather than a feature of individuals. In the present article we argue against this model and propose that, given the existing evidence, implicit bias is best understood as an individual-level construct measured with substantial error. First, using real and simulated data, we show how each of Payne and colleagues’ proposed puzzles can be explained as being the result of measurement error and its reduction via aggregation. Second, we discuss why the authors’ counterarguments against this explanation have been unconvincing. Finally, we test a hypothesis derived from the bias-of-crowds model about the effect of an individually targeted “implicit-bias-based expulsion program” within universities and show the model to lack empirical support. We conclude by considering the implications of conceptualizing implicit bias as a noisily measured individual-level construct for ongoing implicit-bias research. All data and code are available at https://osf.io/tj8u6/.


1969 ◽  
Vol 21 (4) ◽  
pp. 641-654 ◽  
Author(s):  
Edward R. Tufte

Students of politics use statistical and quantitative techniques to: summarize a large body of numbers into a small collection of typical values;confirm (and perhaps sanctify) the results of the analysis by using tests of statistical significance that help protect against sampling and measurement error;discover what's going on in their data and expose some new relationships; andinform their audience what's going on in the data.


Entropy ◽  
2018 ◽  
Vol 20 (9) ◽  
pp. 642 ◽  
Author(s):  
Erlandson Saraiva ◽  
Adriano Suzuki ◽  
Luis Milan

In this paper, we study the performance of Bayesian computational methods to estimate the parameters of a bivariate survival model based on the Ali–Mikhail–Haq copula with marginal distributions given by Weibull distributions. The estimation procedure was based on Monte Carlo Markov Chain (MCMC) algorithms. We present three version of the Metropolis–Hastings algorithm: Independent Metropolis–Hastings (IMH), Random Walk Metropolis (RWM) and Metropolis–Hastings with a natural-candidate generating density (MH). Since the creation of a good candidate generating density in IMH and RWM may be difficult, we also describe how to update a parameter of interest using the slice sampling (SS) method. A simulation study was carried out to compare the performances of the IMH, RWM and SS. A comparison was made using the sample root mean square error as an indicator of performance. Results obtained from the simulations show that the SS algorithm is an effective alternative to the IMH and RWM methods when simulating values from the posterior distribution, especially for small sample sizes. We also applied these methods to a real data set.


2021 ◽  
pp. 096228022110326
Author(s):  
Kristine Gierz ◽  
Kayoung Park ◽  
Peihua Qiu

In general, the change point problem considers inference of a change in distribution for a set of time-ordered observations. This has applications in a large variety of fields, and can also apply to survival data. In survival analysis, most existing methods compare two treatment groups for the entirety of the study period. Some treatments may take a length of time to show effects in subjects. This has been called the time-lag effect in the literature, and in cases where time-lag effect is considerable, such methods may not be appropriate to detect significant differences between two groups. In this paper, we propose a novel non-parametric approach for estimating the point of treatment time-lag effect by using an empirical divergence measure. Theoretical properties of the estimator are studied. The results from the simulated data and the applications to real data examples support our proposed method.


2021 ◽  
pp. 229-244
Author(s):  
Sarah Cubaynes ◽  
Simon Galas ◽  
Myriam Richaud ◽  
Ana Sanz Aguilar ◽  
Roger Pradel ◽  
...  

Survival analyses are a key tool for demographers, ecologists, and evolutionary biologists. This chapter presents the most common methods and illustrates their use for species across the Tree of Life. It discusses the challenges associated with various types of survival data, how to model species with a complex life cycle, and includes the impact of environmental factors and individual heterogeneity. It covers the analysis of ‘known-fate’ data collected in lab conditions, using the Kaplan–Meier estimator and Cox’s proportional hazard regression analysis. Alternatively, survival data collected on free-ranging populations usually involve individuals missing at certain monitoring occasions and unknown time at death. The chapter provides an overview of capture–mark–recapture (CMR) models, from single-state to multi-state and multi-event models, and their use in animal and plant demography to estimate demographic parameters while correcting for imperfect detection of individuals. It discusses various inference frameworks available to implement CMR models using a frequentist or Bayesian approach. Only humans are an exception among free-ranging populations, with the existence of several consequent databases with perfect knowledge of age and cause of death for all individuals. The chapter presents an overview of the most common models used to describe mortality patterns over age and time using human mortality data. Throughout, focus is placed on eight case studies, which involve lab organisms, free-ranging animal populations, plant populations, and human populations. Each example includes data and codes, together with step-by-step guidance to run the survival analysis.


Author(s):  
Michael J. Crowther

In this article, I present the community-contributed stmixed command for fitting multilevel survival models. It serves as both an alternative to Stata’s official mestreg command and a complimentary command with substantial extensions. stmixed can fit multilevel survival models with any number of levels and random effects at each level, including flexible spline-based approaches (such as Royston–Parmar and the log-hazard equivalent) and user-defined hazard models. Simple or complex time-dependent effects can be included, as can expected mortality for a relative survival model. Left-truncation (delayed entry) is supported, and t-distributed random effects are provided as an alternative to Gaussian random effects. I illustrate the methods with a commonly used dataset of patients with kidney disease suffering recurrent infections and a simulated example illustrating a simple approach to simulating clustered survival data using survsim (Crowther and Lambert 2012, Stata Journal 12: 674–687; 2013, Statistics in Medicine 32: 4118–4134). stmixed is part of the merlin family (Crowther 2017, arXiv Working Paper No. arXiv:1710.02223; 2018, arXiv Working Paper No. arXiv:1806.01615).


2019 ◽  
Vol 8 (1) ◽  
pp. 55
Author(s):  
NI MADE SRI WAHYUNI ◽  
I WAYAN SUMARJAYA ◽  
NI LUH PUTU SUCIPTAWATI

Parametric survival analysis is one of the survival analysis that has a distribution of survival data that follows a certain distribution. Weibull distribution is a distribution that is often used in parametric survival analysis. The purpose of this study is to determine parametric survival models using the Weibull distribution and to determine  the factors that can influence the recovery of stroke patients. This study uses data on stroke patients in the Wangaya hospital, Denpasar in 2017. The best model obtained in this study is a model that consists of two predictor variables, namely the age and the body mass index (BMI).Therefore the  factors that can influence the recovery of stroke patients are age and BMI.


Sign in / Sign up

Export Citation Format

Share Document