scholarly journals Bias-Variance Trade-Off in Continuous Test Norming

Assessment ◽  
2020 ◽  
pp. 107319112093915
Author(s):  
Lieke Voncken ◽  
Casper J. Albers ◽  
Marieke E. Timmerman

In continuous test norming, the test score distribution is estimated as a continuous function of predictor(s). A flexible approach for norm estimation is the use of generalized additive models for location, scale, and shape. It is unknown how sensitive their estimates are to model flexibility and sample size. Generally, a flexible model that fits at the population level has smaller bias than its restricted nonfitting version, yet it has larger sampling variability. We investigated how model flexibility relates to bias, variance, and total variability in estimates of normalized z scores under empirically relevant conditions, involving the skew Student t and normal distributions as population distributions. We considered both transversal and longitudinal assumption violations. We found that models with too strict distributional assumptions yield biased estimates, whereas too flexible models yield increased variance. The skew Student t distribution, unlike the Box–Cox Power Exponential distribution, appeared problematic to estimate for normally distributed data. Recommendations for empirical norming practice are provided.

2019 ◽  
Author(s):  
Lieke Voncken ◽  
Casper J Albers ◽  
Marieke Timmerman

In continuous test norming, the test score distribution is estimated as a continuous function of predictor(s). A flexible approach for norm estimation is the use of generalized additive models for location, scale, and shape (GAMLSS). It is unknown how sensitive their estimates are to model flexibility and sample size. Generally, a flexible model that fits at the population level has smaller bias than its restricted non-fitting version, yet it has larger sampling variability. We investigated how model flexibility relates to bias, variance, and total variability in estimates of normalized z scores under empirically relevant conditions, involving the skew Student t and normal distributions as population distributions. We considered both transversal and longitudinal assumption violations. We found that models with too strict distributional assumptions yield biased estimates, whereas too flexible models yield increased variance. The skew Student t distribution, unlike the BCPE distribution, appeared problematic to estimate for normally distributed data. Recommendations for empirical norming practice are provided.


2019 ◽  
Vol 29 (Supplement_4) ◽  
Author(s):  
G Nagel ◽  
R S Peter ◽  
B Föger ◽  
H Concin

Abstract Background Obesity and its health consequences will dominate health care systems in many countries during the next decades. Prevention programs have been implemented. However, the optimum body mass index (BMI) in relation to all-cause mortality on population level is still a matter of debate. Material and Method Data 1/1989-6/2005 of the Vorarlberg Health Monitoring & Prevention Program (VHM&PP,) and 8/2005-12/2015 for Vorarlberg provided by the Main Association of Austrian Social Security Institutions were analyzed. In both cohorts, information was available on age, sex, measured height and weight as well as the date and cause of death. Generalized additive models were used to model the mortality rate as function of calendar time, age and follow-up. Results The VHM&PP cohort consisted of 85,488 men and 99,873 women and the later of 129,817 men and 152,399 women. In the second cohort, men (mean age 48 (SD16.9) vs. 45.3 (SD 15.5) and women (48.3 (SD 17.7) vs. 45.6 (SD 16.6) years) were slightly older than in the VHM&PP cohort. The average BMI was slightly higher in men (26.1 (SD4.0) vs. 25.7 (SD3.8) kg/m2) but not in women (24.6 (SD 4.8) vs. 24.7 (SD 4.9) kg/m2), respectively. In the VHM&PP cohort more ever smokers were found in both men (40.3 vs. 22.4%) and women (24.8 vs. 18.4%) than in the subsequent cohort. BMI optimum increased slightly between 1985 and 2015, from 24.9 (95%-CI: 24.0-25.9) to 26.4 (25.3-27.3) in men and from 22.4 (21.8-23.1) to 23.3 (22.5-24.5) kg/m2 in women. However, age and follow-up had major impact on the increase. In younger age the associations are quite stable, while in men over 50 years and in women over 60 years the BMI optimum decreased with length of follow-up. Conclusions Overall the BMI optimum increased slightly over time. However, age and follow-up had major impact on the association. These results suggest, that prognosis of obesity related diseases has improved over time. To detangle this further research is necessary. Key messages In Austria the BMI optimum increased slightly over time. Age and follow-up time had major impact on the association.


2021 ◽  
Vol 8 (Supplement_1) ◽  
pp. S676-S677
Author(s):  
Yangyupei Yang ◽  
Simon Mutembo ◽  
Andrea Carcelen ◽  
Kyla Hayford ◽  
Francis Mwansa ◽  
...  

Abstract Background Despite the availability of safe and effective vaccines, measles and congenital rubella syndrome remain important causes of morbidity and mortality. HIV-infected individuals may be more vulnerable to measles because of poor immune responses to vaccination. Population-level estimates and comparisons of measles and rubella seroprevalence between HIV-infected and uninfected children and adults in sub-Saharan Africa are needed to guide vaccination policy and control strategies. Methods This cross-sectional study was performed by analysing a selected and weighted subsample from the Zambia Population HIV Impact Assessment survey (ZAMPHIA). ZAMPHIA was conducted in 2016 to estimate national HIV incidence and prevalence in Zambia. Dried blood spots and plasma samples were tested for IgG antibodies to measles and rubella viruses using a commercial enzyme immunoassay. We estimated national age-specific measles and rubella seroprevalence by HIV infection status using hierarchical generalized additive models. Results Specimens from 9521 HIV-uninfected (3840 children age under 10 years, 3981 youth age 10-19 years, and 1700 adults age 20-49 years) and 331 HIV-infected (53, 107, and 171 respectively) individuals were included in the study. Measles seroprevalence was lower among HIV-infected children (46.4%) compared to HIV-uninfected children (76.4%, p < 0.001). In both HIV-uninfected and HIV-infected individuals, measles seroprevalence increased steadily with age but more rapidly in the HIV-infected until about the age of 20 years when the seroprevalence was similar between the two groups. Above 20 years, measles seroprevalence was similar between HIV-infected and uninfected adults. There was no significant difference in rubella seroprevalence between HIV-infected and HIV-uninfected individuals. Figure 1. Measles and Rubella Age-specific Seroprevalence The lines represent generalized additive model fits for the mean (solid) and 95% confidence intervals (dashed). Data are grouped by age in years and year 0 includes only specimens from children 9-11 months. Rubella-containing vaccine was not available in the public sector prior to the serosurvey. Conclusion Measles seroprevalence was lower among HIV-infected than uninfected children and youth. HIV-infected children would likely benefit from revaccination. Many children were susceptible to rubella before the introduction of the combined measles and rubella vaccine in Zambia. Disclosures Kyla Hayford, PhD, MA, Pfizer, Inc. (Other Financial or Material Support, KH conducted the study and analyses while working at the Johns Hopkins School of Public Health but is an employee at Pfizer, Inc. as of 26 October 2020.)


Author(s):  
François Freddy Ateba ◽  
Manuel Febrero-Bande ◽  
Issaka Sagara ◽  
Nafomon Sogoba ◽  
Mahamoudou Touré ◽  
...  

Mali aims to reach the pre-elimination stage of malaria by the next decade. This study used functional regression models to predict the incidence of malaria as a function of past meteorological patterns to better prevent and to act proactively against impending malaria outbreaks. All data were collected over a five-year period (2012–2017) from 1400 persons who sought treatment at Dangassa’s community health center. Rainfall, temperature, humidity, and wind speed variables were collected. Functional Generalized Spectral Additive Model (FGSAM), Functional Generalized Linear Model (FGLM), and Functional Generalized Kernel Additive Model (FGKAM) were used to predict malaria incidence as a function of the pattern of meteorological indicators over a continuum of the 18 weeks preceding the week of interest. Their respective outcomes were compared in terms of predictive abilities. The results showed that (1) the highest malaria incidence rate occurred in the village 10 to 12 weeks after we observed a pattern of air humidity levels >65%, combined with two or more consecutive rain episodes and a mean wind speed <1.8 m/s; (2) among the three models, the FGLM obtained the best results in terms of prediction; and (3) FGSAM was shown to be a good compromise between FGLM and FGKAM in terms of flexibility and simplicity. The models showed that some meteorological conditions may provide a basis for detection of future outbreaks of malaria. The models developed in this paper are useful for implementing preventive strategies using past meteorological and past malaria incidence.


Author(s):  
Mark David Walker ◽  
Mihály Sulyok

Abstract Background Restrictions on social interaction and movement were implemented by the German government in March 2020 to reduce the transmission of coronavirus disease 2019 (COVID-19). Apple's “Mobility Trends” (AMT) data details levels of community mobility; it is a novel resource of potential use to epidemiologists. Objective The aim of the study is to use AMT data to examine the relationship between mobility and COVID-19 case occurrence for Germany. Is a change in mobility apparent following COVID-19 and the implementation of social restrictions? Is there a relationship between mobility and COVID-19 occurrence in Germany? Methods AMT data illustrates mobility levels throughout the epidemic, allowing the relationship between mobility and disease to be examined. Generalized additive models (GAMs) were established for Germany, with mobility categories, and date, as explanatory variables, and case numbers as response. Results Clear reductions in mobility occurred following the implementation of movement restrictions. There was a negative correlation between mobility and confirmed case numbers. GAM using all three categories of mobility data accounted for case occurrence as well and was favorable (AIC or Akaike Information Criterion: 2504) to models using categories separately (AIC with “driving,” 2511. “transit,” 2513. “walking,” 2508). Conclusion These results suggest an association between mobility and case occurrence. Further examination of the relationship between movement restrictions and COVID-19 transmission may be pertinent. The study shows how new sources of online data can be used to investigate problems in epidemiology.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Narayan Sharma ◽  
René Schwendimann ◽  
Olga Endrich ◽  
Dietmar Ausserhofer ◽  
Michael Simon

Abstract Background Understanding how comorbidity measures contribute to patient mortality is essential both to describe patient health status and to adjust for risks and potential confounding. The Charlson and Elixhauser comorbidity indices are well-established for risk adjustment and mortality prediction. Still, a different set of comorbidity weights might improve the prediction of in-hospital mortality. The present study, therefore, aimed to derive a set of new Swiss Elixhauser comorbidity weightings, to validate and compare them against those of the Charlson and Elixhauser-based van Walraven weights in an adult in-patient population-based cohort of general hospitals. Methods Retrospective analysis was conducted with routine data of 102 Swiss general hospitals (2012–2017) for 6.09 million inpatient cases. To derive the Swiss weightings for the Elixhauser comorbidity index, we randomly halved the inpatient data and validated the results of part 1 alongside the established weighting systems in part 2, to predict in-hospital mortality. Charlson and van Walraven weights were applied to Charlson and Elixhauser comorbidity indices. Derivation and validation of weightings were conducted with generalized additive models adjusted for age, gender and hospital types. Results Overall, the Elixhauser indices, c-statistic with Swiss weights (0.867, 95% CI, 0.865–0.868) and van Walraven’s weights (0.863, 95% CI, 0.862–0.864) had substantial advantage over Charlson’s weights (0.850, 95% CI, 0.849–0.851) and in the derivation and validation groups. The net reclassification improvement of new Swiss weights improved the predictive performance by 1.6% on the Elixhauser-van Walraven and 4.9% on the Charlson weights. Conclusions All weightings confirmed previous results with the national dataset. The new Swiss weightings model improved slightly the prediction of in-hospital mortality in Swiss hospitals. The newly derive weights support patient population-based analysis of in-hospital mortality and seek country or specific cohort-based weightings.


2020 ◽  
Vol 79 (Suppl 1) ◽  
pp. 1252.2-1253
Author(s):  
R. Garofoli ◽  
M. Resche-Rigon ◽  
M. Dougados ◽  
D. Van der Heijde ◽  
C. Roux ◽  
...  

Background:Axial spondyloarthritis (axSpA) is a chronic rheumatic disease that encompasses various clinical presentations: inflammatory chronic back pain, peripheral manifestations and extra-articular manifestations. The current nomenclature divides axSpA in radiographic (in the presence of radiographic sacroiliitis) and non-radiographic (in the absence of radiographic sacroiliitis, with or without MRI sacroiliitis. Given that the functional burden of the disease appears to be greater in patients with radiographic forms, it seems crucial to be able to predict which patients will be more likely to develop structural damage over time. Predictive factors for radiographic progression in axSpA have been identified through use of traditional statistical models like logistic regression. However, these models present some limitations. In order to overcome these limitations and to improve the predictive performance, machine learning (ML) methods have been developed.Objectives:To compare ML models to traditional models to predict radiographic progression in patients with early axSpA.Methods:Study design: prospective French multicentric cohort study (DESIR cohort) with 5years of follow-up. Patients: all patients included in the cohort, i.e. 708 patients with inflammatory back pain for >3 months but <3 years, highly suggestive of axSpA. Data on the first 5 years of follow-up was used. Statistical analyses: radiographic progression was defined as progression either at the spine (increase of at least 1 point per 2 years of mSASSS scores) or at the sacroiliac joint (worsening of at least one grade of the mNY score between 2 visits). Traditional modelling: we first performed a bivariate analysis between our outcome (radiographic progression) and explanatory variables at baseline to select the variables to be included in our models and then built a logistic regression model (M1). Variable selection for traditional models was performed with 2 different methods: stepwise selection based on Akaike Information Criterion (stepAIC) method (M2), and the Least Absolute Shrinkage and Selection Operator (LASSO) method (M3). We also performed sensitivity analysis on all patients with manual backward method (M4) after multiple imputation of missing data. Machine learning modelling: using the “SuperLearner” package on R, we modelled radiographic progression with stepAIC, LASSO, random forest, Discrete Bayesian Additive Regression Trees Samplers (DBARTS), Generalized Additive Models (GAM), multivariate adaptive polynomial spline regression (polymars), Recursive Partitioning And Regression Trees (RPART) and Super Learner. Finally, the accuracy of traditional and ML models was compared based on their 10-foldcross-validated AUC (cv-AUC).Results:10-fold cv-AUC for traditional models were 0.79 and 0.78 for M2 and M3, respectively. The 3 best models in the ML algorithm were the GAM, the DBARTS and the Super Learner models, with 10-fold cv-AUC of: 0.77, 0.76 and 0.74, respectively (Table 1).Table 1.Comparison of 10-fold cross-validated AUC between best traditional and machine learning models.Best modelsCross-validated AUCTraditional models M2 (step AIC method)0.79 M3 (LASSO method)0.78Machine learning approach SL Discrete Bayesian Additive Regression Trees Samplers (DBARTS)0.76 SL Generalized Additive Models (GAM)0.77 Super Learner0.74AUC: Area Under the Curve; AIC: Akaike Information Criterion; LASSO: Least Absolute Shrinkage and Selection Operator; SL: SuperLearner. N = 295.Conclusion:Traditional models predicted better radiographic progression than ML models in this early axSpA population. Further ML algorithms image-based or with other artificial intelligence methods (e.g. deep learning) might perform better than traditional models in this setting.Acknowledgments:Thanks to the French National Society of Rheumatology and the DESIR cohort.Disclosure of Interests:Romain Garofoli: None declared, Matthieu resche-rigon: None declared, Maxime Dougados Grant/research support from: AbbVie, Eli Lilly, Merck, Novartis, Pfizer and UCB Pharma, Consultant of: AbbVie, Eli Lilly, Merck, Novartis, Pfizer and UCB Pharma, Speakers bureau: AbbVie, Eli Lilly, Merck, Novartis, Pfizer and UCB Pharma, Désirée van der Heijde Consultant of: AbbVie, Amgen, Astellas, AstraZeneca, BMS, Boehringer Ingelheim, Celgene, Cyxone, Daiichi, Eisai, Eli-Lilly, Galapagos, Gilead Sciences, Inc., Glaxo-Smith-Kline, Janssen, Merck, Novartis, Pfizer, Regeneron, Roche, Sanofi, Takeda, UCB Pharma; Director of Imaging Rheumatology BV, Christian Roux: None declared, Anna Moltó Grant/research support from: Pfizer, UCB, Consultant of: Abbvie, BMS, MSD, Novartis, Pfizer, UCB


Sign in / Sign up

Export Citation Format

Share Document