Effective Collaborative Filtering Approaches Based on Missing Data Imputation

Collaborative filtering (CF) is a recommendation technique that analyzes the behavior of various users and recommends the items preferred by users with similar preferences. However, CF methods suffer from poor recommendation accuracy when the user preference data used in the recommendation process is sparse. Data imputation can alleviate the data sparsity problem by substituting a virtual part of the missing user preferences. In this paper, we propose a k-recursive reliability-based imputation (k-RRI) that first selects data with high reliability and then recursively imputes data with additional selection while gradually lowering the reliability criterion. We also propose a new similarity measure that weights common interests and indifferences between users and items. The proposed method can overcome disregarding the importance of missing data and resolve the problem of poor data imputation of existing methods. The experimental results demonstrate that the proposed approach significantly improves recommendation accuracy compared to those resulting from the state-of-the-art methods while demanding less computational complexity.

Download Full-text

An Improved Novel Index Measured Segmentation Based Imputation Algorithm for Missing Data Imputation

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse/v7i6/0217 ◽

2017 ◽

Vol 7 (6) ◽

pp. 283-286

Author(s):

Priyadharsini .C ◽

◽

Antony Selvadoss Thanamani ◽

Keyword(s):

Missing Data ◽

Data Imputation ◽

Missing Data Imputation

Download Full-text

A Two-stage Deep Autoencoder-based Missing Data Imputation Method for Wind Farm SCADA Data

IEEE Sensors Journal ◽

10.1109/jsen.2021.3061109 ◽

2021 ◽

pp. 1-1

Author(s):

Xin Liu ◽

Zijun Zhang

Keyword(s):

Missing Data ◽

Wind Farm ◽

Imputation Method ◽

Data Imputation ◽

Two Stage ◽

Missing Data Imputation

Download Full-text

Cooperative Clustering Missing Data Imputation

2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/smc42975.2020.9283484 ◽

2020 ◽

Author(s):

Daoming Wan ◽

Roozbeh Razavi-Far ◽

Mehrdad Saif

Keyword(s):

Missing Data ◽

Data Imputation ◽

Missing Data Imputation ◽

Cooperative Clustering

Download Full-text

Spatio-Temporal Missing Data Imputation for Smart Power Grids

Proceedings of the Twelfth ACM International Conference on Future Energy Systems ◽

10.1145/3447555.3466586 ◽

2021 ◽

Author(s):

Sanmukh R. Kuppannagari ◽

Yao Fu ◽

Chung Ming Chueng ◽

Viktor K. Prasanna

Keyword(s):

Missing Data ◽

Power Grids ◽

Data Imputation ◽

Smart Power ◽

Missing Data Imputation ◽

Smart Power Grids ◽

Spatio Temporal

Download Full-text

Missing Data Imputation on IoT Sensor Networks: Implications for on-site Sensor Calibration

IEEE Sensors Journal ◽

10.1109/jsen.2021.3105442 ◽

2021 ◽

pp. 1-1

Author(s):

Nwamaka U. Okafor ◽

Declan T. Delaney

Keyword(s):

Sensor Networks ◽

Missing Data ◽

Sensor Calibration ◽

Data Imputation ◽

Missing Data Imputation

Download Full-text

Continuous missing data imputation with incomplete dataset by generative adversarial networks–based unsupervised learning for long-term bridge health monitoring

Structural Health Monitoring ◽

10.1177/14759217211021942 ◽

2021 ◽

pp. 147592172110219

Author(s):

Huachen Jiang ◽

Chunfeng Wan ◽

Kang Yang ◽

Youliang Ding ◽

Songtao Xue

Keyword(s):

Missing Data ◽

Health Monitoring ◽

Signal Transmission ◽

Imputation Accuracy ◽

Generative Adversarial Networks ◽

Data Imputation ◽

Sensor Failure ◽

Generative Adversarial Network ◽

Missing Data Imputation ◽

Adversarial Network

Wireless sensors are the key components of structural health monitoring systems. During the signal transmission, sensor failure is inevitable, among which, data loss is the most common type. Missing data problem poses a huge challenge to the consequent damage detection and condition assessment, and therefore, great importance should be attached. Conventional missing data imputation basically adopts the correlation-based method, especially for strain monitoring data. However, such methods often require delicate model selection, and the correlations for vehicle-induced strains are much harder to be captured compared with temperature-induced strains. In this article, a novel data-driven generative adversarial network (GAN) for imputing missing strain response is proposed. As opposed to traditional ways where correlations for inter-strains are explicitly modeled, the proposed method directly imputes the missing data considering the spatial–temporal relationships with other strain sensors based on the remaining observed data. Furthermore, the intact and complete dataset is not even necessary during the training process, which shows another great superiority over the model-based imputation method. The proposed method is implemented and verified on a real concrete bridge. In order to demonstrate the applicability and robustness of the GAN, imputation for single and multiple sensors is studied. Results show the proposed method provides an excellent performance of imputation accuracy and efficiency.

Download Full-text

Kernel weighted least square approach for imputing missing values of metabolomics data

Scientific Reports ◽

10.1038/s41598-021-90654-0 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Nishith Kumar ◽

Md. Aminul Hoque ◽

Masahiro Sugimoto

Keyword(s):

Missing Data ◽

Large Scale ◽

Missing Values ◽

Kernel Weight ◽

Least Square ◽

Data Matrix ◽

Data Imputation ◽

Metabolomics Data ◽

Missing Value ◽

Missing Data Imputation

AbstractMass spectrometry is a modern and sophisticated high-throughput analytical technique that enables large-scale metabolomic analyses. It yields a high-dimensional large-scale matrix (samples × metabolites) of quantified data that often contain missing cells in the data matrix as well as outliers that originate for several reasons, including technical and biological sources. Although several missing data imputation techniques are described in the literature, all conventional existing techniques only solve the missing value problems. They do not relieve the problems of outliers. Therefore, outliers in the dataset decrease the accuracy of the imputation. We developed a new kernel weight function-based proposed missing data imputation technique that resolves the problems of missing values and outliers. We evaluated the performance of the proposed method and other conventional and recently developed missing imputation techniques using both artificially generated data and experimentally measured data analysis in both the absence and presence of different rates of outliers. Performances based on both artificial data and real metabolomics data indicate the superiority of our proposed kernel weight-based missing data imputation technique to the existing alternatives. For user convenience, an R package of the proposed kernel weight-based missing value imputation technique was developed, which is available at https://github.com/NishithPaul/tWLSA.

Download Full-text

Missing data imputation in meteorological datasets with the GAIN method

2021 IEEE International Workshop on Metrology for Industry 4.0 & IoT (MetroInd4.0&IoT) ◽

10.1109/metroind4.0iot51437.2021.9488451 ◽

2021 ◽

Author(s):

Marina Popolizio ◽

Alberto Amato ◽

Tiziano Politi ◽

Roberto Calienno ◽

Vincenzo Di Lecce

Keyword(s):

Missing Data ◽

Data Imputation ◽

Missing Data Imputation

Download Full-text

Effective Collaborative Filtering Approaches Based on Missing Data Imputation

Boosting collaborative filtering based on missing data imputation using item's genre information

A Technique of Recursive Reliability-Based Missing Data Imputation for Collaborative Filtering

An Improved Novel Index Measured Segmentation Based Imputation Algorithm for Missing Data Imputation

A Two-stage Deep Autoencoder-based Missing Data Imputation Method for Wind Farm SCADA Data

Cooperative Clustering Missing Data Imputation

Spatio-Temporal Missing Data Imputation for Smart Power Grids

Missing Data Imputation on IoT Sensor Networks: Implications for on-site Sensor Calibration

Continuous missing data imputation with incomplete dataset by generative adversarial networks–based unsupervised learning for long-term bridge health monitoring

Kernel weighted least square approach for imputing missing values of metabolomics data

Missing data imputation in meteorological datasets with the GAIN method

Export Citation Format