Feature selection technique for classification of hyperspectral AVIRIS data

1991 ◽  
Author(s):  
Sylvia S. Shen ◽  
Bonnie Y. Trang

Microarray technology has been developed as one of the powerful tools that have attracted many researchers to analyze gene expression level for a given organism. It has been observed that gene expression data have very large (in terms of thousands) of features and less number of samples (in terms of hundreds). This characteristic makes difficult to do an analysis of gene expression data. Hence efficient feature selection technique must be applied before we go for any kind of analysis. Feature selection plays a vital role in the classification of gene expression data. There are several feature selection techniques have been induced in this field. But Support Vector Machine with Recursive Feature Elimination (SVM-RFE) has been proven as the promising feature selection methods among others. SVM-RFE ranks the genes (features) by training the SVM classification model and with the combination of RFE method key genes are selected. Huge time consumption is the main issue of SVM-RFE. We introduced an efficient implementation of linier SVM to overcome this problem and improved the RFE with variable step size. Then, combined method was used for selecting informative genes. Effective resampling method is proposed to preprocess the datasets. This is used to make the distribution of samples balanced, which gives more reliable classification results. In this paper, we have also studied the applicability of common classifiers. Detailed experiments are conducted on four commonly used microarray gene expression datasets. The results show that the proposed method comparable classification performance


2021 ◽  
Vol 26 (5) ◽  
pp. 437-444
Author(s):  
Akhilesh Kumar Shrivas ◽  
Amit Kumar Dewangan ◽  
Samrendra Mohan Ghosh

Healthcare diagnosis system is very important and critical task in medical science for doctors and medical students. Chronic kidney disease is a very serious and dangerous problem which is directly related to the human life. In this research work, we have used data mining and feature selection technique to develop the robust and computationally efficient model for classifying chronic and non chronic kidney disease. An ensemble model is constructing through combination of two more similar types of trained model which helps to improve the performance. Feature selection is frequently used in machine learning area to raise a model with a few numbers of features which increase the performance of classification accuracy. The proposed feature selection techniques principle of Genetic Search (GS) and Greedy Stepwise Search (GSW). This proposed technique called GS-NB utilizes a pursuit methodology which is embedded in the Genetic Algorithm to select the features based on natural selection, the procedure that drives biological evolution. Then proposed technique called GSW-NB utilizes a search strategy that is included in the Greedy Stepwise to search the relevant feature based on problem solving heuristic for settling the locally ideal decision at each stage. The performance of suggested technique were estimated on Chronic Kidney Disease (CKD) classification problems and compared with proposed feature selection method. The classification techniques namely the Single Rule Classification (SRC), Conditional Inference Tree (CIT) and their ensemble model (SRC, CIT) have used for classification of CKD. The proposed ensemble model have used stacking learning technique which combines multiple classifiers, hence we can improve the performance of classifiers. The classifier performance is measured with observed accuracy, sensitivity and specificity. The experimental results demonstrated that the ensemble model (SRC, CIT) with GS-NB and GSW-NB can recognized CKD better than existing model. The proposed model can be beneficial and useful in medical science for identifying and diagnosis of chronic kidney disease.


Sign in / Sign up

Export Citation Format

Share Document