scholarly journals Weight Smoothing for Generalized Linear Models Using a Laplace Prior

2016 ◽  
Vol 32 (2) ◽  
pp. 507-539 ◽  
Author(s):  
Xi Xia ◽  
Michael R. Elliott

Abstract When analyzing data sampled with unequal inclusion probabilities, correlations between the probability of selection and the sampled data can induce bias if the inclusion probabilities are ignored in the analysis. Weights equal to the inverse of the probability of inclusion are commonly used to correct possible bias. When weights are uncorrelated with the descriptive or model estimators of interest, highly disproportional sample designs resulting in large weights can introduce unnecessary variability, leading to an overall larger mean square error compared to unweighted methods. We describe an approach we term ‘weight smoothing’ that models the interactions between the weights and the estimators as random effects, reducing the root mean square error (RMSE) by shrinking interactions toward zero when such shrinkage is allowed by the data. This article adapts a flexible Laplace prior distribution for the hierarchical Bayesian model to gain a more robust bias-variance tradeoff than previous approaches using normal priors. Simulation and application suggest that under a linear model setting, weight-smoothing models with Laplace priors yield robust results when weighting is necessary, and provide considerable reduction in RMSE otherwise. In logistic regression models, estimates using weight-smoothing models with Laplace priors are robust, but with less gain in efficiency than in linear regression settings.

2017 ◽  
Vol 16 (4) ◽  
pp. 170-176
Author(s):  
Tharmmambal Balakrishnan ◽  
◽  
Pek Siang Edmund Teo ◽  
Wan Tin Lim ◽  
Xiao Hui Xin ◽  
...  

Coordination and consolidation of care provided in acute care hospitals need reconfiguration and reorganization to meet the demand of large number of acute admissions. We report on the effectiveness of an Acute Medical Ward AMW (AMW) receiving cases that were suspected to have infection related diagnosis on admission by Emergency Department (ED), addressing this in a large tertiary hospital in South East Asia. Mean Length of Stay (LOS) was compared using Gamma Generalized Linear Models with Log-link while odds of readmissions and mortality were compared using logistic regression models. The LOS (mean: 5.8 days, SD: 9.1 days) of all patients admitted to AMW was similar to discharge diagnosis-matched general ward (GW) patients admitted before AMW implementation, readmission rates were lower (15-day: 5.3%, 30-day: 8.1%). Bivariate and multivariate models revealed that mean LOS after AMW implementation was not significantly different from before AMW implementation (Ratio: 0.99, p=0.473). Our AMW had reduced readmission rates for patients with infection but has not made an overall impact on the LOS and readmission rates for the epartment as a whole.


2019 ◽  
Vol 8 (7) ◽  
pp. 1010 ◽  
Author(s):  
Bastian Kochlik ◽  
Wolfgang Stuetz ◽  
Karine Pérès ◽  
Catherine Féart ◽  
Jesper Tegner ◽  
...  

Frailty and sarcopenia are characterized by a loss of muscle mass and functionality and are diagnosed mainly by functional tests and imaging parameters. However, more muscle specific biomarkers are needed to improve frailty diagnosis. Plasma 3-methylhistidine (3-MH), as well as the 3-MH-to-creatinine (3-MH/Crea) and 3-MH-to-estimated glomerular filtration rate (3-MH/eGFR) ratios might support the diagnosis of frailty. Therefore, we investigated the cross-sectional associations between plasma 3-MH, 3-MH/Crea and 3-MH/eGFR with the frailty status of community-dwelling individuals (>65 years). 360 participants from two French cohorts of the FRAILOMIC initiative were classified into robust, pre-frail and frail according to Fried’s frailty criteria. General linear models as well as bivariate and multiple linear and logistic regression models, which were adjusted for several confounders, were applied to determine associations between biomarkers and frailty status. The present study consisted of 37.8% robust, 43.1% pre-frail and 19.2% frail participants. Frail participants had significantly higher plasma 3-MH, 3-MH/Crea and 3-MH/eGFR ratios than robust individuals, and these biomarkers were positively associated with frailty status. Additionally, the likelihood to be frail was significantly higher for every increase in 3-MH (1.31-fold) and 3-MH/GFR (1.35-fold) quintile after adjusting for confounders. We conclude that 3-MH, 3-MH/Crea and 3-MH/eGFR in plasma might be potential biomarkers to identify frail individuals or those at higher risk to be frail, and we assume that there might be biomarker thresholds to identify these individuals. However, further, especially longitudinal studies are needed.


2009 ◽  
Vol 4 (1) ◽  
pp. 7-31 ◽  
Author(s):  
D. H. Alai ◽  
M. Merz ◽  
M. V. Wüthrich

ABSTRACTThe prediction of adequate claims reserves is a major subject in actuarial practice and science. Due to their simplicity, the chain ladder (CL) and Bornhuetter–Ferguson (BF) methods are the most commonly used claims reserving methods in practice. However, in contrast to the CL method, no estimator for the conditional mean square error of prediction (MSEP) of the ultimate claim has been derived in the BF method until now, and as such, this paper aims to fill that gap. This will be done in the framework of generalized linear models (GLM) using the (overdispersed) Poisson model motivation for the use of CL factor estimates in the estimation of the claims development pattern.


Many factors have led to the increase of suicide-proneness in the present era. As a consequence, many novel methods have been proposed in recent times for prediction of the probability of suicides, using different metrics. The current work reviews a number of models and techniques proposed recently, and offers a novel Bayesian machine learning (ML) model for prediction of suicides, involving classification of the data into separate categories. The proposed model is contrasted against similar computationally-inexpensive techniques such as spline regression. The model is found to generate appreciably accurate results for the dataset considered in this work. The application of Bayesian estimation allows the prediction of causation to a greater degree than the standard spline regression models, which is reflected by the comparatively low root mean square error (RMSE) for all estimates obtained by the proposed model.


2010 ◽  
Vol 5 (1) ◽  
pp. 7-17 ◽  
Author(s):  
D. H. Alai ◽  
M. Merz ◽  
M. V. Wüthrich

AbstractWe revisit the stochastic model of Alai et al. (2009) for the Bornhuetter-Ferguson claims reserving method, Bornhuetter & Ferguson (1972). We derive an estimator of its conditional mean square error of prediction (MSEP) using an approach that is based on generalized linear models and maximum likelihood estimators for the model parameters. This approach leads to simple formulas, which can easily be implemented in a spreadsheet.


BMJ Open ◽  
2018 ◽  
Vol 8 (8) ◽  
pp. e022374
Author(s):  
Mei-Yan Xu ◽  
Bing Cao ◽  
Yan Chen ◽  
Natalie Musial ◽  
Shuai Wang ◽  
...  

ObjectiveHelicobacter pyloriinfection is a major cause of several cancers such as gastric, pancreatic and lung. The relationship betweenH. pyloriand tumour markers continues to remain unclear. The primary goal of this study is to clarify the associations betweenH. pyloriinfection and six tumour markers (ie, carcinoembryonic antigen (CEA), cancer antigen (CA) 153, CA199, CA724, CA125 and alpha-fetoprotein (AFP)). The secondary goal is to provide understanding for further research aboutH. pyloriinfection and gastrointestinal cancer.DesignObservational retrospective study.SettingThe study was performed in Beijing, China, where enrolled subjects had all passed health examinations during the period of 2012–2016. Subjects were categorised intoH. pylori(+) andH. pylori(–) group according to their infection status and the measured six biomarkers. We used logistic regression models and generalised linear models to explore the associations betweenH. pyloriinfection and six tumour markers (ie, CEA, CA153, CA199, CA724, CA125 and AFP).ParticipantsA total of 14 689 subjects were included and 6493 (44.2%) subjects were infected byH. pylori. The subjects had a mean age (1SD) of 45 (18) years. There were 4530 (31.0%) female subjects.ResultsAfter adjusting for the confounding factors, infections withH. pyloriwere found to be significantly associated with abnormal ratios in CEA, AFP and CA724 ofH. pylori(+) toH. pylori(–) groups. Significant positive correlation was found betweenH. pyloriinfection and CEA values (adjusted β=0.056; 95% CI 0.005 to 0.107; p=0.033).ConclusionsIn this observational retrospective study, we observed theH. pyloriinfections in a Chinese population and found higher CEA level inH. pylori-infected subjects and abnormal ratios in CEA, AFP and CA724 in infected subjects to uninfected subjects. These findings may provide a basis for future exploration withH. pyloriand tumour markers.


2020 ◽  
Author(s):  
Tiago Luciano Passafaro ◽  
Fernando B. Lopes ◽  
João R. R. Dórea ◽  
Mark Craven ◽  
Vivian Breen ◽  
...  

Abstract Background: Deep neural networks (DNN) are a particular case of artificial neural networks (ANN) composed by multiple hidden layers, and have recently gained attention in genome-enabled prediction of complex traits. Yet, few studies in genome-enabled prediction have assessed the performance of DNN compared to traditional regression models. Strikingly, no clear superiority of DNN has been reported so far, and results seem highly dependent on the species and traits of application. Nevertheless, the relatively small datasets used in previous studies, most with fewer than 5,000 observations may have precluded the full potential of DNN. Therefore, the objective of this study was to investigate the impact of the dataset sample size on the performance of DNN compared to Bayesian regression models for genome-enable prediction of body weight in broilers by sub-sampling 63,526 observations of the training set.Results: Predictive performance of DNN improved as sample size increased, reaching a plateau at about 0.32 of prediction correlation when 60% of the entire training set size was used (i.e., 39,510 observations). Interestingly, DNN showed superior prediction correlation using up to 3% of training set, but poorer prediction correlation after that compared to Bayesian Ridge Regression (BRR) and Bayes Cπ. Regardless the amount of data used to train the predictive machines, DNN displayed the lowest mean square error of prediction compared to all other approaches. The predictive bias was lower for DNN compared to Bayesian models regardless the amount of data used with estimates closed to one with larger sample sizes. Conclusions: DNN had worse prediction correlation compared to BRR and Bayes Cπ, but improved mean square error of prediction and bias relative to both Bayesian models for genome-enabled prediction of body weight in broilers. Such findings, highlights advantages and disadvantages between predictive approaches depending on the criterion used for comparison. Nonetheless, further analysis is necessary to detect scenarios where DNN can clearly outperform Bayesian benchmark models.


Sign in / Sign up

Export Citation Format

Share Document