Scaling Up Feature Selection: A Distributed Filter Approach

This chapter presents a novel approach for classification of dataset by suitably tuning the parameters of radial basis function networks with an additional cost of feature selection. Inputting optimal and relevant set of features to a radial basis function may greatly enhance the network efficiency (in terms of accuracy) at the same time compact its size. In this chapter, the authors use information gain theory (a kind of filter approach) for reducing the features and differential evolution for tuning center and spread of radial basis functions. Different feature selection methods, handling missing values and removal of inconsistency to improve the classification accuracy of the proposed model are emphasized. The proposed approach is validated with a few benchmarking highly skewed and balanced dataset retrieved from University of California, Irvine (UCI) repository. The experimental study is encouraging to pursue further extensive research in highly skewed data.

Download Full-text

A Survey of Feature Selection Techniques

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch289 ◽

2011 ◽

pp. 1888-1895 ◽

Cited By ~ 17

Author(s):

Barak Chizi ◽

Lior Rokach ◽

Oded Maimon

Keyword(s):

Data Mining ◽

Feature Selection ◽

Mining Method ◽

Data Set ◽

Data Mining Method ◽

Data Mining Algorithms ◽

Wrapper Approach ◽

Computationally Intensive ◽

Filter Approach ◽

Mining Algorithms

Dimensionality (i.e., the number of data set attributes or groups of attributes) constitutes a serious obstacle to the efficiency of most data mining algorithms (Maimon and Last, 2000). The main reason for this is that data mining algorithms are computationally intensive. This obstacle is sometimes known as the “curse of dimensionality” (Bellman, 1961). The objective of Feature Selection is to identify features in the data-set as important, and discard any other feature as irrelevant and redundant information. Since Feature Selection reduces the dimensionality of the data, data mining algorithms can be operated faster and more effectively by using Feature Selection. In some cases, as a result of feature selection, the performance of the data mining method can be improved. The reason for that is mainly a more compact, easily interpreted representation of the target concept. The filter approach (Kohavi , 1995; Kohavi and John ,1996) operates independently of the data mining method employed subsequently -- undesirable features are filtered out of the data before learning begins. These algorithms use heuristics based on general characteristics of the data to evaluate the merit of feature subsets. A sub-category of filter methods that will be refer to as rankers, are methods that employ some criterion to score each feature and provide a ranking. From this ordering, several feature subsets can be chosen by manually setting There are three main approaches for feature selection: wrapper, filter and embedded. The wrapper approach (Kohavi, 1995; Kohavi and John,1996), uses an inducer as a black box along with a statistical re-sampling technique such as cross-validation to select the best feature subset according to some predictive measure. The embedded approach (see for instance Guyon and Elisseeff, 2003) is similar to the wrapper approach in the sense that the features are specifically selected for a certain inducer, but it selects the features in the process of learning.

Download Full-text

Laser-induced breakdown spectroscopy spectral feature selection to enhance classification capabilities: A t-test filter approach

Spectrochimica Acta Part B Atomic Spectroscopy ◽

10.1016/j.sab.2019.105721 ◽

2019 ◽

Vol 162 ◽

pp. 105721 ◽

Cited By ~ 3

Author(s):

Curtis Huffman ◽

Hugo Sobral ◽

Estrella Terán-Hinojosa

Keyword(s):

Feature Selection ◽

Spectral Feature ◽

T Test ◽

Laser Induced Breakdown Spectroscopy ◽

Breakdown Spectroscopy ◽

Laser Induced Breakdown ◽

Test Filter ◽

Filter Approach ◽

Spectral Feature Selection

Download Full-text

A Filter Approach to Feature Selection Based on Survival Cauchy-Schwartz Mutual Information

2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS) ◽

10.1109/hpcc/smartcity/dss.2018.00228 ◽

2018 ◽

Author(s):

Su Xiangchenyang ◽

Liu Fang

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Filter Approach

Download Full-text

Methods for Gene Selection and Classification of Microarray Dataset

Handbook of Research on Biomimicry in Information Retrieval and Knowledge Management - Advances in Web Technologies and Engineering ◽

10.4018/978-1-5225-3004-6.ch004 ◽

2018 ◽

pp. 66-77

Author(s):

Mekour Norreddine

Keyword(s):

Gene Expression ◽

Feature Selection ◽

Gene Expression Data ◽

Gene Selection ◽

Information Gain ◽

Microarray Dataset ◽

Data Sets ◽

Expression Data ◽

Wrapper Approach ◽

Filter Approach

One of the problems that gene expression data resolved is feature selection. There is an important process for choosing which features are important for prediction; there are two general approaches for feature selection: filter approach and wrapper approach. In this chapter, the authors combine the filter approach with method ranked information gain and wrapper approach with a searching method of the genetic algorithm. The authors evaluate their approach on two data sets of gene expression data: Leukemia, and the Central Nervous System. The classifier Decision tree (C4.5) is used for improving the classification performance.

Download Full-text

Enhancing CBIR Performance Using Evolutionary Algorithm-Assisted Significant Feature Selection: A Filter Approach

International Journal of Applied Research on Information Technology and Computing ◽

10.5958/0975-8089.2016.00005.1 ◽

2016 ◽

Vol 7 (1) ◽

pp. 53 ◽

Cited By ~ 3

Author(s):

Asha Gowda Karegowda ◽

PT Bharathi

Keyword(s):

Feature Selection ◽

Evolutionary Algorithm ◽

Significant Feature ◽

Filter Approach

Download Full-text

A new Unsupervised Spectral Feature Selection Method for mixed data: A filter approach

Pattern Recognition ◽

10.1016/j.patcog.2017.07.020 ◽

2017 ◽

Vol 72 ◽

pp. 314-326 ◽

Cited By ~ 18

Author(s):

Saúl Solorio-Fernández ◽

José Fco. Martínez-Trinidad ◽

J. Ariel Carrasco-Ochoa

Keyword(s):

Feature Selection ◽

Spectral Feature ◽

Feature Selection Method ◽

Selection Method ◽

Mixed Data ◽

Filter Approach ◽

Spectral Feature Selection

Download Full-text

A GMDH-type neural network with multi-filter feature selection for the prediction of transition temperatures of bent-core liquid crystals

RSC Advances ◽

10.1039/c6ra15056j ◽

2016 ◽

Vol 6 (102) ◽

pp. 99676-99684 ◽

Cited By ~ 5

Author(s):

Davor Antanasijević ◽

Jelena Antanasijević ◽

Viktor Pocajt ◽

Gordana Ušćumlić

Keyword(s):

Neural Network ◽

Neural Networks ◽

Feature Selection ◽

Transition Temperatures ◽

Chi Square ◽

Qspr Study ◽

Selection For ◽

Filter Approach ◽

Selection Of Descriptors ◽

Selection Of

The QSPR study on transition temperatures of five-ring bent-core LCs was performed using GMDH-type neural networks. A novel multi-filter approach, which combines chi square ranking, v-WSH and GMDH algorithm was used for the selection of descriptors.

Download Full-text