MODELLING OF MISSING DATA IMPUTATION METHODS ON GENE EXPRESSION DATA

Mapping Intimacies ◽

10.21506/j.ponte.2017.4.33 ◽

2017 ◽

Vol 73 (4) ◽

Author(s):

V. Sujatha ◽

Shaheda Akthar

Keyword(s):

Gene Expression ◽

Missing Data ◽

Gene Expression Data ◽

Expression Data ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods

Download Full-text

Missing data imputation using Evolutionary k- Nearest neighbor algorithm for gene expression data

2016 Sixteenth International Conference on Advances in ICT for Emerging Regions (ICTer) ◽

10.1109/icter.2016.7829911 ◽

2016 ◽

Author(s):

Hiroshi de Silva ◽

A. Shehan Perera

Keyword(s):

Gene Expression ◽

Missing Data ◽

Gene Expression Data ◽

Nearest Neighbor ◽

Expression Data ◽

Data Imputation ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Missing Data Imputation ◽

K Nearest Neighbor Algorithm

Download Full-text

Impact of missing data imputation methods on gene expression clustering and classification

BMC Bioinformatics ◽

10.1186/s12859-015-0494-3 ◽

2015 ◽

Vol 16 (1) ◽

Author(s):

Marcilio CP de Souto ◽

Pablo A Jaskowiak ◽

Ivan G Costa

Keyword(s):

Gene Expression ◽

Missing Data ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods ◽

Clustering And Classification

Download Full-text

Random Forest Missing Data Imputation Methods: Implications for Predicting At-Risk Students

Advances in Intelligent Systems and Computing - Intelligent Systems Design and Applications ◽

10.1007/978-3-030-49342-4_29 ◽

2020 ◽

pp. 298-308

Author(s):

Bevan I. Smith ◽

Charles Chimedza ◽

Jacoba H. Bührmann

Keyword(s):

At Risk ◽

Missing Data ◽

Random Forest ◽

At Risk Students ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods

Download Full-text

Long-term trends of wet inorganic nitrogen deposition in Rocky Mountain National Park: Influence of missing data imputation methods and associated uncertainty

The Science of The Total Environment ◽

10.1016/j.scitotenv.2019.06.104 ◽

2019 ◽

Vol 687 ◽

pp. 817-826 ◽

Author(s):

Bret A. Schichtel ◽

Kristi A. Gebhart ◽

Kristi H. Morris ◽

James R. Cheatham ◽

John Vimont ◽

...

Keyword(s):

Missing Data ◽

National Park ◽

Inorganic Nitrogen ◽

Rocky Mountain ◽

Rocky Mountain National Park ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods ◽

Long Term Trends

Download Full-text

Missing Data Imputation in Internet of Things Gateways

Information ◽

10.3390/info12100425 ◽

2021 ◽

Vol 12 (10) ◽

pp. 425

Author(s):

Cinthya M. França ◽

Rodrigo S. Couto ◽

Pedro B. Velloso

Keyword(s):

Neural Network ◽

Missing Data ◽

Internet Of Things ◽

Execution Time ◽

Regression Models ◽

Weather Data ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods ◽

The Neural Network

In an Internet of Things (IoT) environment, sensors collect and send data to application servers through IoT gateways. However, these data may be missing values due to networking problems or sensor malfunction, which reduces applications’ reliability. This work proposes a mechanism to predict and impute missing data in IoT gateways to achieve greater autonomy at the network edge. These gateways typically have limited computing resources. Therefore, the missing data imputation methods must be simple and provide good results. Thus, this work presents two regression models based on neural networks to impute missing data in IoT gateways. In addition to the prediction quality, we analyzed both the execution time and the amount of memory used. We validated our models using six years of weather data from Rio de Janeiro, varying the missing data percentages. The results show that the neural network regression models perform better than the other imputation methods analyzed, based on the averages and repetition of previous values, for all missing data percentages. In addition, the neural network models present a short execution time and need less than 140 KiB of memory, which allows them to run on IoT gateways.

Download Full-text

Microarray missing values imputation methods: Critical analysis review

Computer Science and Information Systems ◽

10.2298/csis0902165h ◽

2009 ◽

Vol 6 (2) ◽

pp. 165-190 ◽

Author(s):

Mou'ath Hourani ◽

Emary El

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Missing Values ◽

Estimation Method ◽

Least Square ◽

Support Vector ◽

Expression Data ◽

Imputation Methods ◽

Value Estimation

Gene expression data often contain missing expression values. For the purpose of conducting an effective clustering analysis and since many algorithms for gene expression data analysis require a complete matrix of gene array values, choosing the most effective missing value estimation method is necessary. In this paper, the most commonly used imputation methods from literature are critically reviewed and analyzed to explain the proper use, weakness and point the observations on each published method. From the conducted analysis, we conclude that the Local Least Square (LLS) and Support Vector Regression (SVR) algorithms have achieved the best performances. SVR can be considered as a complement algorithm for LLS especially when applied to noisy data. However, both algorithms suffer from some deficiencies presented in choosing the value of Number of Selected Genes (K) and the appropriate kernel function. To overcome these drawbacks, the need for new method that automatically chooses the parameters of the function and it also has an appropriate computational complexity is imperative.

Download Full-text

Data Quality Evaluation, Outlier Detection and Missing Data Imputation Methods for IoT in Smart Cities

Studies in Computational Intelligence - Machine Intelligence and Data Analytics for Sustainable Future Smart Cities ◽

10.1007/978-3-030-72065-0_1 ◽

2021 ◽

pp. 1-18

Author(s):

Vera Van Zoest ◽

Xiuming Liu ◽

Edith Ngai

Keyword(s):

Missing Data ◽

Data Quality ◽

Outlier Detection ◽

Quality Evaluation ◽

Smart Cities ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods

Download Full-text

A Comparison of Missing Data Imputation Methods in Within-Subject Repeated Measure Design

10.14416/j.kmutnb.2020.11.003 ◽

2020 ◽

Author(s):

Nalattaporn Roopmok ◽

Kamolchanok Panishkan

Keyword(s):

Missing Data ◽

Repeated Measure ◽

Data Imputation ◽

Repeated Measure Design ◽

Missing Data Imputation ◽

Imputation Methods ◽

Download Full-text

Evaluation of Missing Data Imputation Methods for an Enhanced Distributed PV Generation Prediction

Advances in Intelligent Systems and Computing - Proceedings of the Future Technologies Conference (FTC) 2019 ◽

10.1007/978-3-030-32520-6_43 ◽

2019 ◽

pp. 590-609 ◽

Author(s):

Aditya Sundararajan ◽

Arif I. Sarwat

Keyword(s):

Missing Data ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods ◽

Download Full-text

Use Case and Performance Analyses for Missing Data Imputation Methods in Big Data Analytics

Proceedings of 2020 the 6th International Conference on Computing and Data Engineering ◽

10.1145/3379247.3379270 ◽

2020 ◽

Author(s):

Lan Yang ◽

Jason Amaro Chiang

Keyword(s):

Big Data ◽

Missing Data ◽

Data Analytics ◽

Big Data Analytics ◽

Use Case ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods ◽

Performance Analyses ◽

And Performance

Download Full-text