Feature Subset Selection by Estimation of Distribution Algorithms

Estimation of Distribution Algorithms - Genetic Algorithms and Evolutionary Computation ◽

10.1007/978-1-4615-1539-5_13 ◽

2002 ◽

pp. 269-293 ◽

Cited By ~ 5

Author(s):

I. Inza ◽

P. Larrañaga ◽

B. Sierra

Keyword(s):

Subset Selection ◽

Feature Subset Selection ◽

Feature Subset ◽

Estimation Of Distribution Algorithms ◽

Estimation Of Distribution ◽

Distribution Algorithms

Download Full-text

Estimation of Distribution Algorithms for Feature Subset Selection in Large Dimensionality Domains

Data Mining ◽

10.4018/978-1-930708-25-9.ch005 ◽

2011 ◽

pp. 97-116 ◽

Cited By ~ 1

Author(s):

Inaki Inza ◽

Pedro Larranaga ◽

Basilio Sierra

Keyword(s):

Probabilistic Models ◽

Subset Selection ◽

Population Based ◽

Feature Subset Selection ◽

Feature Subset ◽

Estimation Of Distribution Algorithms ◽

Text Learning ◽

Estimation Of Distribution ◽

Selection Tasks ◽

Distribution Algorithms

Feature Subset Selection (FSS) is a well-known task of Machine Learning, Data Mining, Pattern Recognition or Text Learning paradigms. Genetic Algorithms (GAs) are possibly the most commonly used algorithms for Feature Subset Selection tasks. Although the FSS literature contains many papers, few of them tackle the task of FSS in domains with more than 50 features. In this chapter we present a novel search heuristic paradigm, called Estimation of Distribution Algorithms (EDAs), as an alternative to GAs, to perform a population-based and randomized search in datasets of a large dimensionality. The EDA paradigm avoids the use of genetic crossover and mutation operators to evolve the populations. In absence of these operators, the evolution is guaranteed by the factorization of the probability distribution of the best solutions found in a generation of the search and the subsequent simulation of this distribution to obtain a new pool of solutions. In this chapter we present four different probabilistic models to perform this factorization. In a comparison with two types of GAs in natural and artificial datasets of a large dimensionality, EDAbased approaches obtain encouraging results with regard to accuracy, and a fewer number of evaluations were needed than used in genetic approaches.

Download Full-text

Feature subset selection by genetic algorithms and estimation of distribution algorithms

Artificial Intelligence in Medicine ◽

10.1016/s0933-3657(01)00085-9 ◽

2001 ◽

Vol 23 (2) ◽

pp. 187-205 ◽

Cited By ~ 33

Author(s):

I. Inza ◽

M. Merino ◽

P. Larrañaga ◽

J. Quiroga ◽

B. Sierra ◽

...

Keyword(s):

Genetic Algorithms ◽

Subset Selection ◽

Feature Subset Selection ◽

Feature Subset ◽

Estimation Of Distribution Algorithms ◽

Estimation Of Distribution ◽

Distribution Algorithms

Download Full-text

Prototype Selection and Feature Subset Selection by Estimation of Distribution Algorithms. A Case Study in the Survival of Cirrhotic Patients Treated with TIPS

Artificial Intelligence in Medicine - Lecture Notes in Computer Science ◽

10.1007/3-540-48229-6_3 ◽

2001 ◽

pp. 20-29 ◽

Cited By ~ 12

Author(s):

B. Sierra ◽

E. Lazkano ◽

I. Inza ◽

M. Merino ◽

P. Larrañaga ◽

...

Keyword(s):

Subset Selection ◽

Feature Subset Selection ◽

Feature Subset ◽

Estimation Of Distribution Algorithms ◽

Prototype Selection ◽

Estimation Of Distribution ◽

Cirrhotic Patients ◽

Distribution Algorithms

Download Full-text

Classifier Subset Selection to construct multi-classifiers by means of estimation of distribution algorithms

Neurocomputing ◽

10.1016/j.neucom.2015.01.036 ◽

2015 ◽

Vol 157 ◽

pp. 46-60 ◽

Cited By ~ 24

Author(s):

Iñigo Mendialdua ◽

Andoni Arruti ◽

Ekaitz Jauregi ◽

Elena Lazkano ◽

Basilio Sierra

Keyword(s):

Subset Selection ◽

Estimation Of Distribution Algorithms ◽

Estimation Of Distribution ◽

Distribution Algorithms

Download Full-text

GENE SELECTION FOR CANCER CLASSIFICATION USING WRAPPER APPROACHES

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001404003800 ◽

2004 ◽

Vol 18 (08) ◽

pp. 1373-1390 ◽

Cited By ~ 46

Author(s):

ROSA BLANCO ◽

PEDRO LARRAÑAGA ◽

IÑAKI INZA ◽

BASILIO SIERRA

Keyword(s):

Gene Expression ◽

Gene Selection ◽

Cancer Classification ◽

Feature Subset Selection ◽

Feature Subset ◽

Estimation Of Distribution ◽

Distribution Algorithms ◽

Selection Algorithms ◽

Selection Of ◽

General Method

Despite the fact that cancer classification has considerably improved, nowadays a general method that classifies known types of cancer has not yet been developed. In this work, we propose the use of supervised classification techniques, coupled with feature subset selection algorithms, to automatically perform this classification in gene expression datasets. Due to the large number of features of gene expression datasets, the search of a highly accurate combination of features is done by means of the new Estimation of Distribution Algorithms paradigm. In order to assess the accuracy level of the proposed approach, the naïve-Bayes classification algorithm is employed in a wrapper form. Promising results are achieved, in addition to a considerable reduction in the number of genes. Stating the optimal selection of genes as a search task, an automatic and robust choice in the genes finally selected is performed, in contrast to previous works that research the same types of problems.

Download Full-text