Improved Nonnegative Matrix Factorization Based Feature Selection for High Dimensional Data Analysis

Feature selection has become the focus of research areas of applications with high dimensional data. Nonnegative matrix factorization (NMF) is a good method for dimensionality reduction but it cant select the optimal feature subset for its a feature extraction method. In this paper, a two-step strategy method based on improved NMF is proposed.The first step is to get the basis of each catagory in the dataset by NMF. Added constrains can guarantee these basises are sparse and mostly distinguish from each other which can contribute to classfication. An auxiliary function is used to prove the algorithm convergent.The classic ReliefF algorithm is used to weight each feature by all the basis vectors and choose the optimal feature subset in the second step.The experimental results revealed that the proposed method can select a representive and relevant feature subset which is effective in improving the performance of the classifier.

Download Full-text

Spectral clustering of high-dimensional data via Nonnegative Matrix Factorization

2015 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2015.7280465 ◽

2015 ◽

Cited By ~ 3

Author(s):

Shulin Wang ◽

Fang Chen ◽

Jianwen Fang

Keyword(s):

Matrix Factorization ◽

Spectral Clustering ◽

Nonnegative Matrix Factorization ◽

High Dimensional Data ◽

Nonnegative Matrix ◽

High Dimensional

Download Full-text

A Comparison between Nonnegative Matrix Factorization and Feature Selection for Machine Learning of Personalized Recommender System

Journal of Digital Contents Society ◽

10.9728/dcs.2020.21.4.793 ◽

2020 ◽

Vol 21 (4) ◽

pp. 793-798

Author(s):

Chan-Woo Yoo ◽

Hee-Chern Kim

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Recommender System ◽

Matrix Factorization ◽

Nonnegative Matrix Factorization ◽

Nonnegative Matrix ◽

Selection For ◽

Personalized Recommender System

Download Full-text

Large Sample Covariance Matrices and High-Dimensional Data Analysis

10.1017/cbo9781107588080 ◽

2015 ◽

Cited By ~ 26

Author(s):

Jianfeng Yao ◽

Shurong Zheng ◽

Zhidong Bai

Keyword(s):

Data Analysis ◽

High Dimensional Data ◽

Covariance Matrices ◽

High Dimensional ◽

Large Sample ◽

Sample Covariance Matrices ◽

Sample Covariance ◽

High Dimensional Data Analysis

Download Full-text

Transfer Learning via Feature Selection Based Nonnegative Matrix Factorization

Web Information Systems Engineering – WISE 2019 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-34223-4_6 ◽

2019 ◽

pp. 82-97

Author(s):

Thirunavukarasu Balasubramaniam ◽

Richi Nayak ◽

Chau Yuen

Keyword(s):

Feature Selection ◽

Transfer Learning ◽

Matrix Factorization ◽

Nonnegative Matrix Factorization ◽

Nonnegative Matrix

Download Full-text

Machine Learning and High-Dimensional Data Analysis

Principles of Clinical Cancer Research ◽

10.1891/9781617052392.0017 ◽

2018 ◽

Author(s):

Sanjay Aneja ◽

James B. Yu

Keyword(s):

Machine Learning ◽

Data Analysis ◽

High Dimensional Data ◽

High Dimensional ◽

High Dimensional Data Analysis

Download Full-text

From Data to the Physics Using Ultrametrics: New Results in High Dimensional Data Analysis

10.1063/1.2193119 ◽

2006 ◽

Cited By ~ 4

Author(s):

Fionn Murtagh

Keyword(s):

Data Analysis ◽

High Dimensional Data ◽

High Dimensional ◽

High Dimensional Data Analysis

Download Full-text

Fast and Positive Definite Estimation of Large Covariance Matrix for High-Dimensional Data Analysis

IEEE Transactions on Big Data ◽

10.1109/tbdata.2019.2937785 ◽

2019 ◽

pp. 1-1 ◽

Cited By ~ 2

Author(s):

Fei Wen ◽

Lei Chu ◽

Rendong Ying ◽

Peilin Liu

Keyword(s):

Data Analysis ◽

Covariance Matrix ◽

High Dimensional Data ◽

Positive Definite ◽

High Dimensional ◽

High Dimensional Data Analysis

Download Full-text

MHSNMF: multi-view hessian regularization based symmetric nonnegative matrix factorization for microbiome data analysis

BMC Bioinformatics ◽

10.1186/s12859-020-03555-w ◽

2020 ◽

Vol 21 (S6) ◽

Author(s):

Yuanyuan Ma ◽

Junmin Zhao ◽

Yingjun Ma

Keyword(s):

Data Analysis ◽

Matrix Factorization ◽

Nonnegative Matrix Factorization ◽

Rapid Development ◽

Nonnegative Matrix ◽

Metabolomics Data ◽

Normalized Mutual Information ◽

Microbiome Data ◽

Symmetric Nonnegative Matrix Factorization ◽

Microbiome Data Analysis

Abstract Background With the rapid development of high-throughput technique, multiple heterogeneous omics data have been accumulated vastly (e.g., genomics, proteomics and metabolomics data). Integrating information from multiple sources or views is challenging to obtain a profound insight into the complicated relations among micro-organisms, nutrients and host environment. In this paper we propose a multi-view Hessian regularization based symmetric nonnegative matrix factorization algorithm (MHSNMF) for clustering heterogeneous microbiome data. Compared with many existing approaches, the advantages of MHSNMF lie in: (1) MHSNMF combines multiple Hessian regularization to leverage the high-order information from the same cohort of instances with multiple representations; (2) MHSNMF utilities the advantages of SNMF and naturally handles the complex relationship among microbiome samples; (3) uses the consensus matrix obtained by MHSNMF, we also design a novel approach to predict the classification of new microbiome samples. Results We conduct extensive experiments on two real-word datasets (Three-source dataset and Human Microbiome Plan dataset), the experimental results show that the proposed MHSNMF algorithm outperforms other baseline and state-of-the-art methods. Compared with other methods, MHSNMF achieves the best performance (accuracy: 95.28%, normalized mutual information: 91.79%) on microbiome data. It suggests the potential application of MHSNMF in microbiome data analysis. Conclusions Results show that the proposed MHSNMF algorithm can effectively combine the phylogenetic, transporter, and metabolic profiles into a unified paradigm to analyze the relationships among different microbiome samples. Furthermore, the proposed prediction method based on MHSNMF has been shown to be effective in judging the types of new microbiome samples.

Download Full-text