scholarly journals Feature Based Multivariate Data Imputation

Author(s):  
Alessio Petrozziello ◽  
Ivan Jordanov

Organizational decisions are based on data-based-analysis and predictions. Effective decisions require accurate predictions, which in-turn depend on the quality of the data. Real time data is prone to inconsistencies, which exhibit negative impacts on the quality of the predictions. This mandates the need for data imputation techniques. This work presents a prediction-based data imputation technique, Rank Based Multivariate Imputation (RBMI) that operates on multivariate data. The proposed model is composed of the ranking phase and the imputation phase. Ranking dictates, the attribute order in which imputation is to be performed. The proposed model utilizes tree-based approach for the actual imputation process. Experiments were performed on Pima, a diabetes dataset. The data was amputed in range between 5% - 30%. The obtained results were compared with existing state-of-the-art models in terms of MAE and MSE levels. The proposed RBMI model exhibits a reduction of 0.03 in MAE levels and 0.001 in MSE levels.


2011 ◽  
Vol 27 (5) ◽  
pp. 1468-1474 ◽  
Author(s):  
Juan C. Figueroa García ◽  
Dusko Kalenatic ◽  
Cesar Amilcar Lopez Bello

1968 ◽  
Author(s):  
Gerald H. Shure ◽  
Laurence I. Press ◽  
Miles S. Rogers

Sign in / Sign up

Export Citation Format

Share Document