scholarly journals Hypergraph Regularized Discriminative Nonnegative Matrix Factorization on Sample Classification and Co-Differentially Expressed Gene Selection

Complexity ◽  
2019 ◽  
Vol 2019 ◽  
pp. 1-12 ◽  
Author(s):  
Yong-Jing Hao ◽  
Ying-Lian Gao ◽  
Mi-Xiao Hou ◽  
Ling-Yun Dai ◽  
Jin-Xing Liu

Nonnegative Matrix Factorization (NMF) is a significant big data analysis technique. However, standard NMF regularized by simple graph does not have discriminative function, and traditional graph models cannot accurately reflect the problem of multigeometry information between data. To solve the above problem, this paper proposed a new method called Hypergraph Regularized Discriminative Nonnegative Matrix Factorization (HDNMF), which captures intrinsic geometry by constructing hypergraphs rather than simple graphs. The introduction of the hypergraph method allows high-order relationships between samples to be considered, and the introduction of label information enables the method to have discriminative effect. Both the hypergraph Laplace and the discriminative label information are utilized together to learn the projection matrix in the standard method. In addition, we offered a corresponding multiplication update solution for the optimization. Experiments indicate that the method proposed is more effective by comparing with the earlier methods.

2017 ◽  
Vol 266 ◽  
pp. 91-100 ◽  
Author(s):  
Huirong Li ◽  
Jiangshe Zhang ◽  
Guang Shi ◽  
Junmin Liu

RSC Advances ◽  
2016 ◽  
Vol 6 (46) ◽  
pp. 39652-39656 ◽  
Author(s):  
Jing Chen ◽  
Qin Ma ◽  
Xiaoyan Hu ◽  
Miao Zhang ◽  
Dongdong Qin ◽  
...  

Cancer classification is a key problem for identifying the genomic biomarkers and treating cancerous tumors in clinical research.


2017 ◽  
Vol 29 (9) ◽  
pp. 2553-2579 ◽  
Author(s):  
Ronghua Shang ◽  
Chiyang Liu ◽  
Yang Meng ◽  
Licheng Jiao ◽  
Rustam Stolkin

Nonnegative matrix factorization (NMF) is well known to be an effective tool for dimensionality reduction in problems involving big data. For this reason, it frequently appears in many areas of scientific and engineering literature. This letter proposes a novel semisupervised NMF algorithm for overcoming a variety of problems associated with NMF algorithms, including poor use of prior information, negative impact on manifold structure of the sparse constraint, and inaccurate graph construction. Our proposed algorithm, nonnegative matrix factorization with rank regularization and hard constraint (NMFRC), incorporates label information into data representation as a hard constraint, which makes full use of prior information. NMFRC also measures pairwise similarity according to geodesic distance rather than Euclidean distance. This results in more accurate measurement of pairwise relationships, resulting in more effective manifold information. Furthermore, NMFRC adopts rank constraint instead of norm constraints for regularization to balance the sparseness and smoothness of data. In this way, the new data representation is more representative and has better interpretability. Experiments on real data sets suggest that NMFRC outperforms four other state-of-the-art algorithms in terms of clustering accuracy.


Complexity ◽  
2017 ◽  
Vol 2017 ◽  
pp. 1-11 ◽  
Author(s):  
Ling-Yun Dai ◽  
Chun-Mei Feng ◽  
Jin-Xing Liu ◽  
Chun-Hou Zheng ◽  
Jiguo Yu ◽  
...  

Differential expression plays an important role in cancer diagnosis and classification. In recent years, many methods have been used to identify differentially expressed genes. However, the recognition rate and reliability of gene selection still need to be improved. In this paper, a novel constrained method named robust nonnegative matrix factorization via joint graph Laplacian and discriminative information (GLD-RNMF) is proposed for identifying differentially expressed genes, in which manifold learning and the discriminative label information are incorporated into the traditional nonnegative matrix factorization model to train the objective matrix. Specifically,L2,1-norm minimization is enforced on both the error function and the regularization term which is robust to outliers and noise in gene data. Furthermore, the multiplicative update rules and the details of convergence proof are shown for the new model. The experimental results on two publicly available cancer datasets demonstrate that GLD-RNMF is an effective method for identifying differentially expressed genes.


Sign in / Sign up

Export Citation Format

Share Document