Feature Selection by Fuzzy Inference and Its Application to Spam-Mail Filtering

Author(s):  
Jong-Wan Kim ◽  
Sin-Jae Kang
2020 ◽  
Author(s):  
Adel Bakhshipour ◽  
Hemad Zareiforoush

Abstract A combination of decision tree (DT) and fuzzy logic techniques was used to develop a fuzzy model for differentiating peanut plant from weeds. Color features and wavelet-based texture features were extracted from images of peanut plant and its three common weeds. Two feature selection techniques namely Principal Component Analysis (PCA) and Correlation-based Feature Selection (CFS) were applied on input dataset and three Decision Trees (DTs) including J48, Random Tree (RT), and Reduced Error Pruning (REP) were used to distinguish between different plants. In all cases, the best overall classification accuracies were achieved when CFS-selected features were used as input data. The obtained accuracies of J48-CFS, REP-CFS, and RT-CFS trees for classification of the four plant categories namely peanut plant, Velvetleaf, False daisy, and Nicandra, were 80.83%, 80.00% and 79.17% respectively. Along with these almost low accuracies, the structures of the decision trees were complex making them unsuitable for developing a fuzzy inference system. The classifiers were also used for differentiating peanut plant from the group of weeds. The overall accuracies on training and testing datasets were respectively 95.56% and 93.75% for J48-CFS; 92.78% and 91.67% for REP-CFS; and 93.33% and 92.59% for RT-CFS DTs. The results showed that the J48-CFS and REP-CFS were the most appropriate models to set the membership functions and rules of the fuzzy classifier system. Based on the results, it can be concluded that the developed DT-based fuzzy logic model can be used effectively to discriminate weeds from peanut plant in the form of machine vision-based cultivating systems.


2019 ◽  
Vol 12 (04) ◽  
pp. 1950039 ◽  
Author(s):  
Sarah M. Ayyad ◽  
Ahmed I. Saleh ◽  
Labib M. Labib

Classification of gene expression data is a pivotal research area that plays a substantial role in diagnosis and prediction of diseases. Generally, feature selection is one of the extensively used techniques in data mining approaches, especially in classification. Gene expression data are usually composed of dozens of samples characterized by thousands of genes. This increases the dimensionality coupled with the existence of irrelevant and redundant features. Accordingly, the selection of informative genes (features) becomes difficult, which badly affects the gene classification accuracy. In this paper, we consider the feature selection for classifying gene expression microarray datasets. The goal is to detect the most possibly cancer-related genes in a distributed manner, which helps in effectively classifying the samples. Initially, the available huge amount of considered features are subdivided and distributed among several processors. Then, a new filter selection method based on a fuzzy inference system is applied to each subset of the dataset. Finally, all the resulted features are ranked, then a wrapper-based selection method is applied. Experimental results showed that our proposed feature selection technique performs better than other techniques since it produces lower time latency and improves classification performance.


Sensors ◽  
2019 ◽  
Vol 19 (4) ◽  
pp. 943 ◽  
Author(s):  
Sadegh Arefnezhad ◽  
Sajjad Samiee ◽  
Arno Eichberger ◽  
Ali Nahvi

This paper presents a novel feature selection method to design a non-invasive driver drowsiness detection system based on steering wheel data. The proposed feature selector can select the most related features to the drowsiness level to improve the classification accuracy. This method is based on the combination of the filter and wrapper feature selection algorithms using adaptive neuro-fuzzy inference system (ANFIS). In this method firstly, four different filter indexes are applied on extracted features from steering wheel data. After that, output values of each filter index are imported as inputs to a fuzzy inference system to determine the importance degree of each feature and select the most important features. Then, the selected features are imported to a support vector machine (SVM) for binary classification to classify the driving conditions in two classes of drowsy and awake. Finally, the classifier accuracy is exploited to adjust parameters of an adaptive fuzzy system using a particle swarm optimization (PSO) algorithm. The experimental data were collected from about 20.5 h of driving in the simulator. The results show that the drowsiness detection system is working with a high accuracy and also confirm that this method is more accurate than the recent available algorithms.


2009 ◽  
Vol 16-19 ◽  
pp. 886-890 ◽  
Author(s):  
Wen Tao Sui ◽  
Dan Zhang

This paper presents a fault diagnosis method on roller bearings based on adaptive neuro-fuzzy inference system (ANFIS) in combination with feature selection. The class separability index was used as a feature selection criterion to select pertinent features from data set. An adaptive neural-fuzzy inference system was trained and used as a diagnostic classifier. For comparison purposes, the back propagation neural networks (BPN) method was also investigated. The results indicate that the ANFIS model has potential for fault diagnosis of roller bearings.


Author(s):  
Meir Perez ◽  
David M. Rubin ◽  
Tshilidzi Marwala ◽  
Lesley E. Scott ◽  
Jonathan Featherston ◽  
...  

2020 ◽  
Author(s):  
Adel Bakhshipour ◽  
Hemad Zareiforoush

Abstract A combination of decision tree (DT) and fuzzy logic techniques was used to develop a fuzzy model for differentiating peanut plant from weeds. Color features and wavelet-based texture features were extracted from images of peanut plant and its three common weeds. Two feature selection techniques namely Principal Component Analysis (PCA) and Correlation-based Feature Selection (CFS) were applied on input dataset and three Decision Trees (DTs) including J48, Random Tree (RT), and Reduced Error Pruning (REP) were used to distinguish between different plants. In all cases, the best overall classification accuracies were achieved when CFS-selected features were used as input data. The obtained accuracies of J48-CFS, REP-CFS, and RT-CFS trees for classification of the four plant categories namely peanut plant, Velvetleaf, False daisy, and Nicandra, were 80.83%, 80.00% and 79.17% respectively. Along with these almost low accuracies, the structures of the decision trees were complex making them unsuitable for developing a fuzzy inference system. The classifiers were also used for differentiating peanut plant from the group of weeds. The overall accuracies were 95.56% for J48-CFS, 92.78% for REP-CFS, and 93.33% for RT-CFS DTs. The results showed that the J48-CFS and REP-CFS were the most appropriate models to set the membership functions and rules of the fuzzy classifier system. Based on the results, it can be concluded that the developed DT-based fuzzy logic model can be used effectively to discriminate weeds from peanut plant in the form of machine vision-based cultivating systems.


Plant Methods ◽  
2020 ◽  
Vol 16 (1) ◽  
Author(s):  
Adel Bakhshipour ◽  
Hemad Zareiforoush

AbstractA combination of decision tree (DT) and fuzzy logic techniques was used to develop a fuzzy model for differentiating peanut plant from weeds. Color features and wavelet-based texture features were extracted from images of peanut plant and its three common weeds. Two feature selection techniques namely Principal Component Analysis (PCA) and Correlation-based Feature Selection (CFS) were applied on input dataset and three Decision Trees (DTs) including J48, Random Tree (RT), and Reduced Error Pruning (REP) were used to distinguish between different plants. In all cases, the best overall classification accuracies were achieved when CFS-selected features were used as input data. The obtained accuracies of J48-CFS, REP-CFS, and RT-CFS trees for classification of the four plant categories namely peanut plant, Velvetleaf, False daisy, and Nicandra, were 80.83%, 80.00% and 79.17% respectively. Along with these almost low accuracies, the structures of the decision trees were complex making them unsuitable for developing a fuzzy inference system. The classifiers were also used for differentiating peanut plant from the group of weeds. The overall accuracies on training and testing datasets were respectively 95.56% and 93.75% for J48-CFS; 92.78% and 91.67% for REP-CFS; and 93.33% and 92.59% for RT-CFS DTs. The results showed that the J48-CFS and REP-CFS were the most appropriate models to set the membership functions and rules of the fuzzy classifier system. Based on the results, it can be concluded that the developed DT-based fuzzy logic model can be used effectively to discriminate weeds from peanut plant in the form of machine vision-based cultivating systems.


Sign in / Sign up

Export Citation Format

Share Document