Improved imputation methods for missing data in two-occasion successive sampling

Communication in Statistics- Theory and Methods ◽

10.1080/03610926.2021.1944211 ◽

2021 ◽

pp. 1-20

Author(s):

Garib Nath Singh ◽

Ashok Kumar Jaiswal ◽

Awadhesh K. Pandey

Keyword(s):

Missing Data ◽

Successive Sampling ◽

Imputation Methods

Download Full-text

Some imputation methods to deal with the problems of missing data in two-occasion successive sampling

Communications in Statistics - Simulation and Computation ◽

10.1080/03610918.2018.1563153 ◽

2019 ◽

pp. 1-24 ◽

Author(s):

Garib Nath Singh ◽

Mohd Khalid ◽

Jong-Min Kim

Keyword(s):

Missing Data ◽

Successive Sampling ◽

Imputation Methods

Download Full-text

Some imputation methods to deal with the issue of missing data problems due to random non-response in two-occasion successive sampling

Communications in Statistics - Simulation and Computation ◽

10.1080/03610918.2020.1828920 ◽

2020 ◽

pp. 1-21

Author(s):

Mohd Khalid ◽

Garib Nath Singh

Keyword(s):

Missing Data ◽

Successive Sampling ◽

Imputation Methods

Download Full-text

Comparison of five imputation methods in handling missing data in a continuous frequency table

10.1063/5.0053286 ◽

2021 ◽

Author(s):

M. B. Mohammed ◽

H. S. Zulkafli ◽

M. B. Adam ◽

N. Ali ◽

I. A. Baba

Keyword(s):

Missing Data ◽

Imputation Methods ◽

Frequency Table ◽

Continuous Frequency

Download Full-text

Technical Review : Performance of Existing Imputation Methods for Missing Data in SVM Ensemble Creation

International Journal of Data Mining & Knowledge Management Process ◽

10.5121/ijdkp.2017.7606 ◽

2017 ◽

Vol 7 (5/6) ◽

pp. 75-91 ◽

Author(s):

Shahid Ali ◽

Simon Dacey

Keyword(s):

Missing Data ◽

Imputation Methods ◽

Technical Review

Download Full-text

On Combining Imputation Methods for Handling Missing Data

Advances in Artificial Intelligence: From Theory to Practice - Lecture Notes in Computer Science ◽

10.1007/978-3-319-60042-0_20 ◽

2017 ◽

pp. 171-181

Author(s):

Nassima Ben Hariz ◽

Hela Khoufi ◽

Ezzeddine Zagrouba

Keyword(s):

Missing Data ◽

Imputation Methods

Download Full-text

Random Forest Missing Data Imputation Methods: Implications for Predicting At-Risk Students

Advances in Intelligent Systems and Computing - Intelligent Systems Design and Applications ◽

10.1007/978-3-030-49342-4_29 ◽

2020 ◽

pp. 298-308

Author(s):

Bevan I. Smith ◽

Charles Chimedza ◽

Jacoba H. Bührmann

Keyword(s):

At Risk ◽

Missing Data ◽

Random Forest ◽

At Risk Students ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods

Download Full-text

Comparison of Imputation Methods on Retrospective Breast Cancer Data in Tanzania: A Case Study of Muhimbili and Ocean Road Hospitals

10.21203/rs.3.rs-820770/v1 ◽

2021 ◽

Author(s):

Rahibu A. Abassi ◽

Amina S. Msengwa ◽

Rocky R. J. Akarro

Keyword(s):

Breast Cancer ◽

Logistic Regression ◽

Missing Data ◽

Binary Logistic Regression ◽

Breast Cancer Dataset ◽

Cancer Dataset ◽

Imputation Methods ◽

Multiple Imputations ◽

Predictive Mean Matching ◽

Mean Square Errors

Abstract Background Clinical data are at risk of having missing or incomplete values for several reasons including patients’ failure to attend clinical measurements, wrong interpretations of measurements, and measurement recorder’s defects. Missing data can significantly affect the analysis and results might be doubtful due to bias caused by omission of missed observation during statistical analysis especially if a dataset is considerably small. The objective of this study is to compare several imputation methods in terms of efficiency in filling-in the missing data so as to increase the prediction and classification accuracy in breast cancer dataset. Methods Five imputation methods namely series mean, k-nearest neighbour, hot deck, predictive mean matching, and multiple imputations were applied to replace the missing values to the real breast cancer dataset. The efficiency of imputation methods was compared by using the Root Mean Square Errors and Mean Absolute Errors to obtain a suitable complete dataset. Binary logistic regression and linear discrimination classifiers were applied to the imputed dataset to compare their efficacy on classification and discrimination. Results The evaluation of imputation methods revealed that the predictive mean matching method was better off compared to other imputation methods. In addition, the binary logistic regression and linear discriminant analyses yield almost similar values on overall classification rates, sensitivity and specificity. Conclusion The predictive mean matching imputation showed higher accuracy in estimating and replacing missing/incomplete data values in a real breast cancer dataset under the study. It is a more effective and good method to handle missing data in this scenario. We recommend to replace missing data by using predictive mean matching since it is a plausible approach toward multiple imputations for numerical variables, as it improves estimation and prediction accuracy over the use complete-case analysis especially when percentage of missing data is not very small.

Download Full-text

Characterizing the effects of missing data and evaluating imputation methods for chemical prioritization applications using ToxPi

BioData Mining ◽

10.1186/s13040-018-0169-5 ◽

2018 ◽

Vol 11 (1) ◽

Author(s):

Kimberly T. To ◽

Rebecca C. Fry ◽

David M. Reif

Keyword(s):

Missing Data ◽

Imputation Methods

Download Full-text

Some Concerns About Imputation Methods for Missing Data

JAMA Psychiatry ◽

10.1001/jamapsychiatry.2021.3894 ◽

2022 ◽

Author(s):

Rie Toyomoto ◽

Satoshi Funada ◽

Toshi A. Furukawa

Keyword(s):

Missing Data ◽

Imputation Methods

Download Full-text

Missing data in longitudinal studies: Comparison of multiple imputation methods in a real clinical setting

Journal of Evaluation in Clinical Practice ◽

10.1111/jep.13376 ◽

2020 ◽

Author(s):

Rosalba Rosato ◽

Eva Pagano ◽

Silvia Testa ◽

Paolo Zola ◽

Daniela di Cuonzo

Keyword(s):

Missing Data ◽

Multiple Imputation ◽

Longitudinal Studies ◽

Clinical Setting ◽

Imputation Methods

Download Full-text