RANDOM FOREST BASED CLASSIFICATION OF MEDICAL X-RAY IMAGES USING A GENETIC ALGORITHM FOR FEATURE SELECTION

Automated classification of medical images is an increasingly important tool for physicians in their daily activities. However, due to its computational complexity, this task is one of the major current challenges in the field of content-based image retrieval (CBIR). In this paper, a medical image classification approach is proposed. This method is composed of two main phases. The first step consists of a pre-processing, where a texture and shape based features vector is extracted. Also, a feature selection approach was applied by using a Genetic Algorithm (GA). The proposed GA uses a kNN based classification error as fitness function, which enables the GA to obtain a combinatorial set of feature giving rise to optimal accuracy. In the second phase, a classification process is achieved by using random Forest classifier and a supervised multi-class classifier based on the support vector machine (SVM) for classifying X-ray images.

Download Full-text

A Novel Hybrid Feature Selection Model for Classification of Neuromuscular Dystrophies Using Bhattacharyya Coefficient, Genetic Algorithm and Radial Basis Function Based Support Vector Machine

Interdisciplinary Sciences Computational Life Sciences ◽

10.1007/s12539-016-0183-6 ◽

2016 ◽

Vol 10 (2) ◽

pp. 244-250

Author(s):

Divya Anand ◽

Babita Pandey ◽

Devendra K. Pandey

Keyword(s):

Genetic Algorithm ◽

Support Vector Machine ◽

Feature Selection ◽

Radial Basis Function ◽

Basis Function ◽

Selection Model ◽

Support Vector ◽

Bhattacharyya Coefficient ◽

Radial Basis

Download Full-text

A genetic algorithm based wrapper feature selection method for classification of hyperspectral images using support vector machine

10.1117/12.813256 ◽

2008 ◽

Cited By ~ 36

Author(s):

Li Zhuo ◽

Jing Zheng ◽

Xia Li ◽

Fang Wang ◽

Bin Ai ◽

...

Keyword(s):

Genetic Algorithm ◽

Support Vector Machine ◽

Feature Selection ◽

Feature Selection Method ◽

Selection Method ◽

Hyperspectral Images ◽

Support Vector ◽

Wrapper Feature Selection

Download Full-text

A Composite Hybrid Feature Selection Learning-Based Optimization of Genetic Algorithm For Breast Cancer Detection

10.20944/preprints202003.0298.v1 ◽

2020 ◽

Author(s):

Ahmed Abdullah Farid ◽

Gamal Selim ◽

Hatem Khater

Keyword(s):

Breast Cancer ◽

Genetic Algorithm ◽

Feature Selection ◽

Early Stage ◽

Fitness Function ◽

Support Vector ◽

Initial Population ◽

Tree Classifier ◽

Selection Approach ◽

Feature Selection Approach

Breast cancer is a significant health issue across the world. Breast cancer is the most widely-diagnosed cancer in women; early-stage diagnosis of disease and therapies increase patient safety. This paper proposes a synthetic model set of features focused on the optimization of the genetic algorithm (CHFS-BOGA) to forecast breast cancer. This hybrid feature selection approach combines the advantages of three filter feature selection approaches with an optimize Genetic Algorithm (OGA) to select the best features to improve the performance of the classification process and scalability. We propose OGA by improving the initial population generating and genetic operators using the results of filter approaches as some prior information with using the C4.5 decision tree classifier as a fitness function instead of probability and random selection. The authors collected available updated data from Wisconsin UCI machine learning with a total of 569 rows and 32 columns. The dataset evaluated using an explorer set of weka data mining open-source software for the analysis purpose. The results show that the proposed hybrid feature selection approach significantly outperforms the single filter approaches and principal component analysis (PCA) for optimum feature selection. These characteristics are good indicators for the return prediction. The highest accuracy achieved with the proposed system before (CHFS-BOGA) using the support vector machine (SVM) classifiers was 97.3%. The highest accuracy after (CHFS-BOGA-SVM) was 98.25% on split 70.0% train, remainder test, and 100% on the full training set. Moreover, the receiver operating characteristic (ROC) curve was equal to 1.0. The results showed that the proposed (CHFS-BOGA-SVM) system was able to accurately classify the type of breast tumor, whether malignant or benign.

Download Full-text

A Genetic Algorithm Based Feature Selection for Classification of Brain MRI Scan Images Using Random Forest Classifier

International Journal of Advanced Engineering Research and Science ◽

10.22161/ijaers.4.5.21 ◽

2017 ◽

Vol 4 (5) ◽

pp. 131-136 ◽

Cited By ~ 1

Author(s):

Dr. S. Mary Joans ◽

J. Sandhiya

Keyword(s):

Genetic Algorithm ◽

Feature Selection ◽

Random Forest ◽

Brain Mri ◽

Random Forest Classifier ◽

Mri Scan ◽

Selection For

Download Full-text

Sparse Contribution Feature Selection and Classifiers Optimized by Concave-Convex Variation for HCC Image Recognition

BioMed Research International ◽

10.1155/2017/9718386 ◽

2017 ◽

Vol 2017 ◽

pp. 1-14 ◽

Cited By ~ 6

Author(s):

Wenbo Pang ◽

Huiyan Jiang ◽

Siqi Li

Keyword(s):

Feature Selection ◽

Random Forest ◽

Image Recognition ◽

Bilateral Filter ◽

Variation Method ◽

Support Vector ◽

Image Patches ◽

Learning Machine ◽

Hematoxylin Eosin

Accurate classification of hepatocellular carcinoma (HCC) image is of great importance in pathology diagnosis and treatment. This paper proposes a concave-convex variation (CCV) method to optimize three classifiers (random forest, support vector machine, and extreme learning machine) for the more accurate HCC image classification results. First, in preprocessing stage, hematoxylin-eosin (H&E) pathological images are enhanced using bilateral filter and each HCC image patch is obtained under the guidance of pathologists. Then, after extracting the complete features of each patch, a new sparse contribution (SC) feature selection model is established to select the beneficial features for each classifier. Finally, a concave-convex variation method is developed to improve the performance of classifiers. Experiments using 1260 HCC image patches demonstrate that our proposed CCV classifiers have improved greatly compared to each original classifier and CCV-random forest (CCV-RF) performs the best for HCC image recognition.

Download Full-text

A method for handling metabonomics data from liquid chromatography/mass spectrometry: combinational use of support vector machine recursive feature elimination, genetic algorithm and random forest for feature selection

Metabolomics ◽

10.1007/s11306-011-0274-7 ◽

2011 ◽

Vol 7 (4) ◽

pp. 549-558 ◽

Cited By ~ 40

Author(s):

Xiaohui Lin ◽

Quancai Wang ◽

Peiyuan Yin ◽

Liang Tang ◽

Yexiong Tan ◽

...

Keyword(s):

Mass Spectrometry ◽

Genetic Algorithm ◽

Support Vector Machine ◽

Feature Selection ◽

Liquid Chromatography ◽

Random Forest ◽

Recursive Feature Elimination ◽

Support Vector ◽

Liquid Chromatography Mass Spectrometry ◽

Chromatography Mass Spectrometry

Download Full-text

COMBAT GA-BASED GENE SELECTION FOR CLASSIFICATION OF MICROARRAY DATA

Biomedical Engineering Applications Basis and Communications ◽

10.4015/s1016237208000969 ◽

2008 ◽

Vol 20 (06) ◽

pp. 345-352

Author(s):

Li-Yeh Chuang ◽

Cheng-San Yang ◽

Jung-Chike Li ◽

Cheng-Hong Yang

Keyword(s):

Gene Expression ◽

Feature Selection ◽

Microarray Data ◽

Clinical Medicine ◽

Fitness Function ◽

Error Rates ◽

Experimental Results ◽

Classification Error ◽

Cancer Type

Microarray data can provide valuable results for a variety of gene expression profile problems and contribute to advances in clinical medicine. The application of microarray data on cancer-type classification has recently gained in popularity. The properties of microarray data contain a large number of features (genes) with high dimensions, and one in the multi-class category. These facts make testing and training of general classification methods difficult. Reducing the number of genes and achieving lower classification error rates are the main issues to be solved. The classification of microarray data samples can be regarded as a feature selection and classifier design problem. The goal of feature selection is to select those subsets of differentially expressed genes that are potentially relevant for distinguishing the sample classes. Classical genetic algorithms (GAs) may suffer from premature convergence and thus lead to poor experimental results. In this paper, combat genetic algorithm (CGA) is used to implement the feature selection, and a K-nearest neighbor with the leave-one-out cross-validation method serves as a classifier of the CGA fitness function for the classification problem. The proposed method was applied to 10 microarray data sets that were obtained from the literature. The experimental results show that the proposed method not only effectively reduced the number of gene expression levels but also achieved lower classification error rates.

Download Full-text

Detection of Amaranthus palmeri sp. Seedlings in Vegetable Farms Using Genetic Algorithm Optimized Support Vector Machine

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.666.267 ◽

2014 ◽

Vol 666 ◽

pp. 267-271 ◽

Cited By ~ 1

Author(s):

W.K Wong ◽

Muralindran Mariappan ◽

Ali Chekima ◽

Manimehala Nadarajan ◽

Brendan Khoo

Keyword(s):

Genetic Algorithm ◽

Support Vector Machine ◽

Feature Selection ◽

Fine Tuning ◽

Support Vector ◽

Amaranthus Palmeri ◽

Weed Species ◽

Classification Rate ◽

Vector Machines

This research is a part of a larger research scope to recognise individual weed species for weed scouting and spot weeding. Support Vector Machines are used to classify the presence of specified weeds(Amaranthus palmeri )by analysing the shape of the weeds. Weed leaves are extracted using image dilation and erosion methods. Several shape feature types were proposed and a total of 59 features were used as the feature pool. The feature selection and fine tuning of the Support Vector Machine are performed using Genetic Algorithm. The outcome is a generalised classifier that enables classification of weed leaves with an average of 90.5% classification rate.

Download Full-text

Classification of SSVEP-based BCIs using Genetic Algorithm

Journal Of Big Data ◽

10.1186/s40537-021-00478-y ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Hamideh Soltani ◽

Zahra Einalou ◽

Mehrdad Dadgostar ◽

Keivan Maghooli

Keyword(s):

Genetic Algorithm ◽

Dimension Reduction ◽

Classification Accuracy ◽

Bayesian Method ◽

Computer Interface ◽

Support Vector ◽

Svm Classifier ◽

Effective Dimension ◽

Effective Dimension Reduction

AbstractBrain computer interface (BCI) systems have been regarded as a new way of communication for humans. In this research, common methods such as wavelet transform are applied in order to extract features. However, genetic algorithm (GA), as an evolutionary method, is used to select features. Finally, classification was done using the two approaches support vector machine (SVM) and Bayesian method. Five features were selected and the accuracy of Bayesian classification was measured to be 80% with dimension reduction. Ultimately, the classification accuracy reached 90.4% using SVM classifier. The results of the study indicate a better feature selection and the effective dimension reduction of these features, as well as a higher percentage of classification accuracy in comparison with other studies.

Download Full-text

CLASSIFICATION OF HIGH-DIMENSIONAL MICROARRAY DATA WITH A TWO-STEP PROCEDURE VIA A WILCOXON CRITERION AND MULTILAYER PERCEPTRON

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026811002969 ◽

2011 ◽

Vol 10 (01) ◽

pp. 1-14

Author(s):

VLADIMIR NIKULIN ◽

TIAN-HSIANG HUANG ◽

GEOFFREY J. MCLACHLAN

Keyword(s):

Data Mining ◽

Feature Selection ◽

High Dimensional ◽

Second Step ◽

Support Vector ◽

Step Procedure ◽

Leave One Out ◽

Natural Combination ◽

Feature Selection Techniques

The method presented in this paper is novel as a natural combination of two mutually dependent steps. Feature selection is a key element (first step) in our classification system, which was employed during the 2010 International RSCTC data mining (bioinformatics) Challenge. The second step may be implemented using any suitable classifier such as linear regression, support vector machine or neural networks. We conducted leave-one-out (LOO) experiments with several feature selection techniques and classifiers. Based on the LOO evaluations, we decided to use feature selection with the separation type Wilcoxon-based criterion for all final submissions. The method presented in this paper was tested successfully during the RSCTC data mining Challenge, where we achieved the top score in the Basic track.

Download Full-text