scholarly journals Robust Semisupervised Nonnegative Local Coordinate Factorization for Data Representation

Complexity ◽  
2018 ◽  
Vol 2018 ◽  
pp. 1-16
Author(s):  
Wei Jiang ◽  
Qian Lv ◽  
Chenggang Yan ◽  
Kewei Tang ◽  
Jie Zhang

Obtaining an optimum data representation is a challenging issue that arises in many intellectual data processing techniques such as data mining, pattern recognition, and gene clustering. Many existing methods formulate this problem as a nonnegative matrix factorization (NMF) approximation problem. The standard NMF uses the least square loss function, which is not robust to outlier points and noises and fails to utilize prior label information to enhance the discriminability of representations. In this study, we develop a novel matrix factorization method called robust semisupervised nonnegative local coordinate factorization by integrating robust NMF, a robust local coordinate constraint, and local spline regression into a unified framework. We use the l2,1 norm for the loss function of the NMF and a local coordinate constraint term to make our method insensitive to outlier points and noises. In addition, we exploit the local and global consistencies of sample labels to guarantee that data representation is compact and discriminative. An efficient multiplicative updating algorithm is deduced to solve the novel loss function, followed by a strict proof of the convergence. Several experiments conducted in this study on face and gene datasets clearly indicate that the proposed method is more effective and robust compared to the state-of-the-art methods.

2017 ◽  
Vol 29 (9) ◽  
pp. 2553-2579 ◽  
Author(s):  
Ronghua Shang ◽  
Chiyang Liu ◽  
Yang Meng ◽  
Licheng Jiao ◽  
Rustam Stolkin

Nonnegative matrix factorization (NMF) is well known to be an effective tool for dimensionality reduction in problems involving big data. For this reason, it frequently appears in many areas of scientific and engineering literature. This letter proposes a novel semisupervised NMF algorithm for overcoming a variety of problems associated with NMF algorithms, including poor use of prior information, negative impact on manifold structure of the sparse constraint, and inaccurate graph construction. Our proposed algorithm, nonnegative matrix factorization with rank regularization and hard constraint (NMFRC), incorporates label information into data representation as a hard constraint, which makes full use of prior information. NMFRC also measures pairwise similarity according to geodesic distance rather than Euclidean distance. This results in more accurate measurement of pairwise relationships, resulting in more effective manifold information. Furthermore, NMFRC adopts rank constraint instead of norm constraints for regularization to balance the sparseness and smoothness of data. In this way, the new data representation is more representative and has better interpretability. Experiments on real data sets suggest that NMFRC outperforms four other state-of-the-art algorithms in terms of clustering accuracy.


2019 ◽  
Vol 164 ◽  
pp. 29-37 ◽  
Author(s):  
Shudong Huang ◽  
Peng Zhao ◽  
Yazhou Ren ◽  
Tianrui Li ◽  
Zenglin Xu

Author(s):  
Siyuan Peng ◽  
Zhijing Yang ◽  
Bingo Wing-Kuen Ling ◽  
Badong Chen ◽  
Zhiping Lin

Sign in / Sign up

Export Citation Format

Share Document