Missing data imputation using Evolutionary k- Nearest neighbor algorithm for gene expression data

2016 Sixteenth International Conference on Advances in ICT for Emerging Regions (ICTer) ◽

10.1109/icter.2016.7829911 ◽

2016 ◽

Author(s):

Hiroshi de Silva ◽

A. Shehan Perera

Keyword(s):

Gene Expression ◽

Missing Data ◽

Gene Expression Data ◽

Nearest Neighbor ◽

Expression Data ◽

Data Imputation ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Missing Data Imputation ◽

K Nearest Neighbor Algorithm

Download Full-text

MODELLING OF MISSING DATA IMPUTATION METHODS ON GENE EXPRESSION DATA

10.21506/j.ponte.2017.4.33 ◽

2017 ◽

Vol 73 (4) ◽

Author(s):

V. Sujatha ◽

Shaheda Akthar

Keyword(s):

Gene Expression ◽

Missing Data ◽

Gene Expression Data ◽

Expression Data ◽

Data Imputation ◽

Missing Data Imputation ◽

Imputation Methods

Download Full-text

TOBMI: trans-omics block missing data imputation using a k-nearest neighbor weighted approach

Bioinformatics ◽

10.1093/bioinformatics/bty796 ◽

2018 ◽

Vol 35 (8) ◽

pp. 1278-1283 ◽

Author(s):

Xuesi Dong ◽

Lijuan Lin ◽

Ruyang Zhang ◽

Yang Zhao ◽

David C Christiani ◽

...

Keyword(s):

Missing Data ◽

Nearest Neighbor ◽

Data Imputation ◽

K Nearest Neighbor ◽

Missing Data Imputation

Download Full-text

K-Nearest Neighbor (K-NN) based Missing Data Imputation

2019 5th International Conference on Science in Information Technology (ICSITech) ◽

10.1109/icsitech46713.2019.8987530 ◽

2019 ◽

Author(s):

Della Murbarani Prawidya Murti ◽

Utomo Pujianto ◽

Aji Prasetya Wibawa ◽

Muhammad Iqbal Akbar

Keyword(s):

Missing Data ◽

Nearest Neighbor ◽

Data Imputation ◽

K Nearest Neighbor ◽

Missing Data Imputation

Download Full-text

Semi-supervised Naive Hubness Bayesian k-Nearest Neighbor for Gene Expression Data

Advances in Intelligent Systems and Computing - Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015 ◽

10.1007/978-3-319-26227-7_10 ◽

2016 ◽

pp. 101-110

Author(s):

Krisztian Buza

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Nearest Neighbor ◽

Expression Data ◽

K Nearest Neighbor

Download Full-text

ReliefSeq: A Gene-Wise Adaptive-K Nearest-Neighbor Feature Selection Tool for Finding Gene-Gene Interactions and Main Effects in mRNA-Seq Gene Expression Data

PLoS ONE ◽

10.1371/journal.pone.0081527 ◽

2013 ◽

Vol 8 (12) ◽

pp. e81527 ◽

Author(s):

Brett A. McKinney ◽

Bill C. White ◽

Diane E. Grill ◽

Peter W. Li ◽

Richard B. Kennedy ◽

...

Keyword(s):

Gene Expression ◽

Feature Selection ◽

Gene Expression Data ◽

Nearest Neighbor ◽

Gene Interactions ◽

Expression Data ◽

K Nearest Neighbor ◽

Selection Tool ◽

Download Full-text

Evolutionary k-nearest neighbor imputation algorithm for gene expression data

International Journal on Advances in ICT for Emerging Regions (ICTer) ◽

10.4038/icter.v10i1.7183 ◽

2018 ◽

Vol 10 (1) ◽

pp. 11

Author(s):

Hiroshi De Silva ◽

A. Shehan Perera

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Nearest Neighbor ◽

Expression Data ◽

K Nearest Neighbor ◽

Nearest Neighbor Imputation

Download Full-text

Gene Assessment and Sample Classification for Gene Expression Data Using a Genetic Algorithm / k-nearest Neighbor Method

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207013330733 ◽

2001 ◽

Vol 4 (8) ◽

pp. 727-739 ◽

Author(s):

Leping Li ◽

Thomas Darden ◽

Clarice Weingberg ◽

A. Levine ◽

Lee Pedersen

Keyword(s):

Gene Expression ◽

Genetic Algorithm ◽

Gene Expression Data ◽

Nearest Neighbor ◽

Expression Data ◽

K Nearest Neighbor ◽

Sample Classification

Download Full-text

Missing Data Imputation for Geolocation-based Price Prediction Using KNN–MCF Method

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9040227 ◽

2020 ◽

Vol 9 (4) ◽

pp. 227

Author(s):

Karshiev Sanjar ◽

Olimov Bekhzod ◽

Jaesoo Kim ◽

Anand Paul ◽

Jeonghong Kim

Keyword(s):

Missing Data ◽

Nearest Neighbor ◽

House Price ◽

Economic Policies ◽

K Nearest Neighbor ◽

Text Data ◽

Missing Data Imputation ◽

National Economic ◽

K Nearest Neighbor Algorithm ◽

Accurate house price forecasts are very important for formulating national economic policies. In this paper, we offer an effective method to predict houses’ sale prices. Our algorithm includes one-hot encoding to convert text data into numeric data, feature correlation to select only the most correlated variables, and a technique to overcome the missing data. Our approach is an effective way to handle missing data in large datasets with the K-nearest neighbor algorithm based on the most correlated features (KNN–MCF). As far as we are concerned, there has been no previous research that has focused on important features dealing with missing observations. Compared to the typical machine learning prediction algorithms, the prediction accuracy of the proposed method is 92.01% with the random forest algorithm, which is more efficient than the other methods.

Download Full-text

Data Imputation Methods for Missing Values in the Context of Clustering

Big Data and Knowledge Sharing in Virtual Organizations - Advances in Knowledge Acquisition, Transfer, and Management ◽

10.4018/978-1-5225-7519-1.ch011 ◽

2019 ◽

pp. 240-274

Author(s):

Mehmet S. Aktaş ◽

Sinan Kaplan ◽

Hasan Abacı ◽

Oya Kalipsiz ◽

Utku Ketenci ◽

...

Keyword(s):

Missing Data ◽

Expectation Maximization ◽

Missing Values ◽

Nearest Neighbor ◽

Real Life ◽

Data Imputation ◽

K Nearest Neighbor ◽

Missing Data Imputation ◽

Data Scarcity ◽

Imputation Methods

Missing data is a common problem for data clustering quality. Most real-life datasets have missing data, which in turn has some effect on clustering tasks. This chapter investigates the appropriate data treatment methods for varying missing data scarcity distributions including gamma, Gaussian, and beta distributions. The analyzed data imputation methods include mean, hot-deck, regression, k-nearest neighbor, expectation maximization, and multiple imputation. To reveal the proper methods to deal with missing data, data mining tasks such as clustering is utilized for evaluation. With the experimental studies, this chapter identifies the correlation between missing data imputation methods and missing data distributions for clustering tasks. The results of the experiments indicated that expectation maximization and k-nearest neighbor methods provide best results for varying missing data scarcity distributions.

Download Full-text

Grey Relational Analysis Based k Nearest Neighbor Missing Data Imputation for Software Quality Datasets

2016 IEEE International Conference on Software Quality, Reliability and Security (QRS) ◽

10.1109/qrs.2016.20 ◽

2016 ◽

Author(s):

Jianglin Huang ◽

Hongyi Sun

Keyword(s):

Missing Data ◽

Software Quality ◽

Grey Relational Analysis ◽

Nearest Neighbor ◽

Data Imputation ◽

K Nearest Neighbor ◽

Missing Data Imputation ◽

Relational Analysis ◽

Grey Relational

Download Full-text