Missing data techniques for multilevel data: implications of model misspecification

Background Multilevel data can be missing at the individual level or at a nested level, such as family, classroom, or program site. Increased knowledge of higher-level missing data is necessary to develop evaluation design and statistical methods to address it. Methods Participants included 9,514 individuals participating in 47 youth and family programs nationwide who completed multiple self-report measures before and after program participation. Data were marked as missing or not missing at the item, scale, and wave levels for both individuals and program sites. Results Site-level missing data represented a substantial portion of missing data, ranging from 0–46% of missing data at pre-test and 35–71% of missing data at post-test. Youth were the most likely to be missing data, although site-level data did not differ by the age of participants served. In this dataset youth had the most surveys to complete, so their missing data could be due to survey fatigue. Conclusions Much of the missing data for individuals can be explained by the site not administering those questions or scales. These results suggest a need for statistical methods that account for site-level missing data, and for research design methods to reduce the prevalence of site-level missing data or reduce its impact. Researchers can generate buy-in with sites during the community collaboration stage, assessing problematic items for revision or removal and need for ongoing site support, particularly at post-test. We recommend that researchers conducting multilevel data report the amount and mechanism of missing data at each level.

Download Full-text

PROSPECT features and their application to missing data techniques for robust speech recognition

10.21437/interspeech.2004-93 ◽

2004 ◽

Cited By ~ 2

Author(s):

Hugo Van hamme

Keyword(s):

Speech Recognition ◽

Missing Data ◽

Robust Speech Recognition ◽

Missing Data Techniques

Download Full-text

Compensation of partly reliable components for band-limited speech recognition with missing data techniques

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2011.5947598 ◽

2011 ◽

Author(s):

Yongjun He ◽

Jiqing Han ◽

Tieran Zheng ◽

Guibin Zheng

Keyword(s):

Speech Recognition ◽

Missing Data ◽

Band Limited ◽

Missing Data Techniques

Download Full-text

Classification with Incomplete Data

Handbook of Research on Machine Learning Applications and Trends ◽

10.4018/978-1-60566-766-9.ch007 ◽

2010 ◽

pp. 147-175 ◽

Cited By ~ 1

Author(s):

Pedro J. García-Laencina ◽

Juan Morales-Sánchez ◽

Rafael Verdú-Monedero ◽

Jorge Larrey-Ruiz ◽

José-Luis Sancho-Gómez ◽

...

Keyword(s):

Machine Learning ◽

Missing Data ◽

Pattern Classification ◽

Incomplete Data ◽

Data Handling ◽

Advantages And Disadvantages ◽

Word Classification ◽

Classification Tasks ◽

Missing Data Techniques ◽

Fundamental Requirement

Many real-word classification scenarios suffer a common drawback: missing, or incomplete, data. The ability of missing data handling has become a fundamental requirement for pattern classification because the absence of certain values for relevant data attributes can seriously affect the accuracy of classification results. This chapter focuses on incomplete pattern classification. The research works on this topic currently grows wider and it is well known how useful and efficient are most of the solutions based on machine learning. This chapter analyzes the most popular and proper missing data techniques based on machine learning for solving pattern classification tasks, trying to highlight their advantages and disadvantages.

Download Full-text