A Bayesian Hierarchical Selection Model for Academic Growth With Missing Data

2017 ◽  
Vol 30 (2) ◽  
pp. 147-162
Author(s):  
Jeff Allen
2021 ◽  
Author(s):  
Mathew V Kiang ◽  
Jarvis T Chen ◽  
Nancy Krieger ◽  
Caroline O Buckee ◽  
Monica J Alexander ◽  
...  

AbstractThe ubiquity of smartphones, with their increasingly sophisticated array of sensors, presents an unprecedented opportunity for researchers to collect diverse, temporally-dense data about human behavior while minimizing participant burden. Researchers increasingly make use of smartphone applications for “digital phenotyping,” the collection of phone sensor and log data to study the lived experiences of subjects in their natural environments. While digital phenotyping has shown promise in fields such as psychiatry and neuroscience, there are fundamental gaps in our knowledge about data collection and non-collection (i.e., missing data) in smartphone-based digital phenotyping. Here, we show that digital phenotyping presents a viable method of data collection, over long time periods, across diverse study participants with a range of sociodemographic characteristics. We examined accelerometer and GPS sensor data of 211 participants, amounting to 29,500 person-days of observation, using Bayesian hierarchical negative binomial regression. We found that iOS users had higher rates of accelerometer non-collection but lower GPS non-collection than Android users. For GPS data, rates of non-collection did not differ by race/ethnicity, education, age, or gender. For accelerometer data, Black participants had higher rates of non-collection while Asian participants had slightly lower non-collection. For both sensors, non-collection increased by 0.5% to 0.9% per week. These results demonstrate the feasibility of using smartphone-based digital phenotyping across diverse populations, for extended periods of time, and within diverse cohorts. As smartphones become increasingly embedded in everyday life, the insights of this study will help guide the design, planning, and analysis of digital phenotyping studies.


2016 ◽  
Vol 38 (4) ◽  
pp. 195-206
Author(s):  
Dan Farley ◽  
Daniel Anderson ◽  
P. Shawn Irvin ◽  
Gerald Tindal

Modeling growth for students with significant cognitive disabilities (SWSCD) is difficult due to a variety of factors, including, but not limited to, missing data, test scaling, group heterogeneity, and small sample sizes. These challenges may account for the paucity of previous research exploring the academic growth of SWSCD. Our study represents a unique context in which a reading assessment, calibrated to a common scale, was administered statewide to students in consecutive years across Grades 3 to 5. We used a nonlinear latent growth curve pattern-mixture model to estimate students’ achievement and growth while accounting for patterns of missing data. While we observed significant intercept differences across disability subgroups, there were no significant slope differences. Incorporating missing data patterns into our models improved model fit. Limitations and directions for future research are discussed.


2011 ◽  
Vol 55 (1) ◽  
pp. 802-812 ◽  
Author(s):  
Hyekyung Jung ◽  
Joseph L. Schafer ◽  
Byungtae Seo

2019 ◽  
Vol 29 (5) ◽  
pp. 1354-1367
Author(s):  
Jaeil Ahn ◽  
Hye Seong Ahn

Health-related quality of life consists of multi-dimensional measurements of physical and mental health domains. Health-related quality of life is often followed up to evaluate efficacy of treatments in clinical studies. During the follow-up period, a missing data problem inevitably arises. When missing data occur for reasons related to poor health-related quality of life, a complete-case only analysis can lead to invalid inferences. We propose a Bayesian approach to analyze longitudinal moderate to high-dimensional multivariate outcome data in the presence of non-ignorable missing data. To account for non-ignorable missing data, we employ a selection model for the joint likelihood factorization where we apply Bayesian spike and slab variable selection in the missing data mechanism to detect informative factors among multiple outcomes. We model the relationship between multiple outcomes and covariates using linear mixed effects models where multiple outcome correlations are captured by a hierarchical structure. We conduct simulation studies to evaluate the performance of the proposed method compared with the conventional last observation carried forward approach. We use a motivating example that originates from a longitudinal study of quality of life in gastric cancer patients who underwent distal gastrectomy. In this application, we demonstrate that our proposed method can offer efficiency gain in the marginal associations and provide the associations between outcomes and the absence of patients' information.


Sign in / Sign up

Export Citation Format

Share Document