Training Data Reduction and Classification Based on Greedy Kernel Principal Component Analysis and Fuzzy C-Means Algorithm

2013 ◽  
Vol 347-350 ◽  
pp. 2390-2394
Author(s):  
Xiao Fang Liu ◽  
Chun Yang

Nonlinear feature extraction used standard Kernel Principal Component Analysis (KPCA) method has large memories and high computational complexity in large datasets. A Greedy Kernel Principal Component Analysis (GKPCA) method is applied to reduce training data and deal with the nonlinear feature extraction problem for training data of large data in classification. First, a subset, which approximates to the original training data, is selected from the full training data using the greedy technique of the GKPCA method. Then, the feature extraction model is trained by the subset instead of the full training data. Finally, FCM algorithm classifies feature extraction data of the GKPCA, KPCA and PCA methods, respectively. The simulation results indicate that the feature extraction performance of both the GKPCA, and KPCA methods outperform the PCA method. In addition of retaining the performance of the KPCA method, the GKPCA method reduces computational complexity due to the reduced training set in classification.

2014 ◽  
Vol 1030-1032 ◽  
pp. 1822-1827
Author(s):  
Ning Lv ◽  
Guang Yuan Bai ◽  
Lu Qi Yan ◽  
Yuan Jian Fu

In order to overcome the application limitations of principal component analysis fault diagnose model in non-linear time-varying and reduce computational complexity for process monitoring based on non-linear principal component, we introduced kernel transformation theory of nonlinear space to extract data feature extraction and a fault monitoring model based on kernel principal component analysis (KPCA) for constant value detection was proposed. Through the proper selection of kernel function parameter values, the KPCA model can achieve constant value of process fault detection and has lower computational complexity than other non-linear algorithms. The fault detection experiment for beer fermentation process shows that this method is able to detect process faults in a timely manner and has good real-time performance and accuracy in the batch process of slowly time-varying.


2012 ◽  
Vol 572 ◽  
pp. 7-12
Author(s):  
Fei He ◽  
Quan Yang ◽  
Bao Jian Wang

With more and more process data acquired from manufacturing process, extracting useful information to build empirical models of past successful operations is urgently required to get higher product quality. Clustering is the important data mining methods, where feature extraction is a significant factor to ensure the accurate rate of clustering and classification. As a common non-linear feature extraction method, kernel principal component analysis (KPCA) uses the variance as the information metric, but the variance is not always effective in some cases. Since information entropy is nonlinear and can effectively represent the dependencies of features, the Renyi entropy is used as the information metric to extract the feature in this paper. Simulation data, Tennessee Eastman and hot rolling process data are used for model validation. As a result the proposed method has better performance on feature extraction, compared with traditional KPCA.


Sign in / Sign up

Export Citation Format

Share Document