Feature Extraction for Classification of Proteomic Profile
This work studies on feature extraction for classification of proteomic profile. We evaluated four methods, including principal component analysis (PCA), independent component analysis (ICA), locally linear embedding (LLE) and weighted maximum margin criterion (WMMC). PCA, ICA and LLE extract features based on traditional low-dimension map technique. Comparatively, WMMC extracts features according to classification goal. To study classification performance of PCA, ICA, LLE and WMMC in detail, we used two well known classification methods, support vector machine (SVM) and Fisher discriminant analysis (FDA), to classify profiles. The results show WMMC having relatively good performance due to its prediction accuracy, sensitivity and specificity for diagnosis; it can correctly identify features with high discrimination ability from high-dimensional proteomic profile. When feature set size was reduced less than 10, PCA, ICA and LLE lose a lot of classification information, and the prediction accuracies are less than 90%. However, WMMC can extract most classification information. Its prediction accuracies, sensitivities and specificities are more than 95%. Obviously, WMMC is more suitable to proteomic profile classification. For classifier, FDA is sensible to feature extraction.