Automatic Design of Decision-Tree Algorithms with Evolutionary Algorithms

This study reports the empirical analysis of a hyper-heuristic evolutionary algorithm that is capable of automatically designing top-down decision-tree induction algorithms. Top-down decision-tree algorithms are of great importance, considering their ability to provide an intuitive and accurate knowledge representation for classification problems. The automatic design of these algorithms seems timely, given the large literature accumulated over more than 40 years of research in the manual design of decision-tree induction algorithms. The proposed hyper-heuristic evolutionary algorithm, HEAD-DT, is extensively tested using 20 public UCI datasets and 10 microarray gene expression datasets. The algorithms automatically designed by HEAD-DT are compared with traditional decision-tree induction algorithms, such as C4.5 and CART. Experimental results show that HEAD-DT is capable of generating algorithms which are significantly more accurate than C4.5 and CART.

Download Full-text

A branch & bound algorithm to determine optimal bivariate splits for oblique decision tree induction

Applied Intelligence ◽

10.1007/s10489-021-02281-x ◽

2021 ◽

Author(s):

Ferdinand Bollwein ◽

Stephan Westphal

Keyword(s):

Decision Tree ◽

Feature Space ◽

Classification Problems ◽

Decision Tree Induction ◽

Single Attribute ◽

Global Optimal ◽

The Individual ◽

Tree Building ◽

Very High ◽

Multiclass Classification Problems

AbstractUnivariate decision tree induction methods for multiclass classification problems such as CART, C4.5 and ID3 continue to be very popular in the context of machine learning due to their major benefit of being easy to interpret. However, as these trees only consider a single attribute per node, they often get quite large which lowers their explanatory value. Oblique decision tree building algorithms, which divide the feature space by multidimensional hyperplanes, often produce much smaller trees but the individual splits are hard to interpret. Moreover, the effort of finding optimal oblique splits is very high such that heuristics have to be applied to determine local optimal solutions. In this work, we introduce an effective branch and bound procedure to determine global optimal bivariate oblique splits for concave impurity measures. Decision trees based on these bivariate oblique splits remain fairly interpretable due to the restriction to two attributes per split. The resulting trees are significantly smaller and more accurate than their univariate counterparts due to their ability of adapting better to the underlying data and capturing interactions of attribute pairs. Moreover, our evaluation shows that our algorithm even outperforms algorithms based on heuristically obtained multivariate oblique splits despite the fact that we are focusing on two attributes only.

Download Full-text

LEARNING HYPERPLANES THAT CAPTURES THE GEOMETRIC STRUCTURE OF CLASS REGIONS

Graduate Research in Engineering and Technology ◽

10.47893/gret.2013.1003 ◽

2013 ◽

pp. 7-12

Author(s):

PRAMOD PATIL ◽

ALKA LONDHE ◽

PARAG KULKARNI

Keyword(s):

Decision Tree ◽

Decision Trees ◽

Geometric Structure ◽

Gini Index ◽

Decision Tree Algorithm ◽

Top Down ◽

Angle Bisector ◽

Eigen Value ◽

Tree Algorithms ◽

Left And Right

Most of the decision tree algorithms rely on impurity measures to evaluate the goodness of hyperplanes at each node while learning a decision tree in a top-down fashion. These impurity measures are not differentiable with relation to the hyperplane parameters. Therefore the algorithms for decision tree learning using impurity measures need to use some search techniques for finding the best hyperplane at every node. These impurity measures don’t properly capture the geometric structures of the data. In this paper a Two-Class algorithm for learning oblique decision trees is proposed. Aggravated by this, the algorithm uses a strategy, to evaluate the hyperplanes in such a way that the (linear) geometric structure in the data is taken into consideration. At each node of the decision tree, algorithm finds the clustering hyperplanes for both the classes. The clustering hyperplanes are obtained by solving the generalized Eigen-value problem. Then the data is splitted based on angle bisector and recursively learn the left and right sub-trees of the node. Since, in general, there will be two angle bisectors; one is selected which is better based on an impurity measure gini index. Thus the algorithm combines the ideas of linear tendencies in data and purity of nodes to find better decision trees. This idea leads to small decision trees and better performance.

Download Full-text

Evolutionary Algorithms for Global Decision Tree Induction

Encyclopedia of Information Science and Technology, Fourth Edition ◽

10.4018/978-1-5225-2255-3.ch185 ◽

2018 ◽

pp. 2132-2141 ◽

Cited By ~ 1

Author(s):

Marek Kretowski ◽

Marcin Czajkowski

Keyword(s):

Evolutionary Algorithms ◽

Decision Tree ◽

Knowledge Discovery ◽

Decision Trees ◽

Optimal Solution ◽

Top Down ◽

Local Optima ◽

Optimal Decisions ◽

Decision Tree Induction ◽

Regression Functions

Decision trees represent one of the main predictive techniques in knowledge discovery. This chapter describes evolutionary induced trees, which are emerging alternatives to the greedy top-down solutions. Most typical tree-based system searches only for locally optimal decisions at each node and do not guarantee the optimal solution. Application of evolutionary algorithms to the problem of decision tree induction allows searching for the structure of the tree, tests in internal nodes and regression functions in the leaves (for model trees) at the same time. As a result, such globally induced decision tree is able to avoid local optima and usually leads to better prediction than the greedy counterparts.

Download Full-text

On the automatic design of decision-tree induction algorithms

10.11606/t.55.2013.tde-21032014-144814 ◽

2013 ◽

Author(s):

Rodrigo Coelho Barros

Keyword(s):

Decision Tree ◽

Automatic Design ◽

Decision Tree Induction

Download Full-text

An Evolutionary Algorithm for Oblique Decision Tree Induction

Lecture Notes in Computer Science - Artificial Intelligence and Soft Computing - ICAISC 2004 ◽

10.1007/978-3-540-24844-6_63 ◽

2004 ◽

pp. 432-437 ◽

Cited By ~ 20

Author(s):

Marek Krȩtowski

Keyword(s):

Decision Tree ◽

Evolutionary Algorithm ◽

Decision Tree Induction

Download Full-text

Evolutionary Algorithms for Global Decision Tree Induction

Advanced Methodologies and Technologies in Business Operations and Management - Advances in Logistics, Operations, and Management Science ◽

10.4018/978-1-5225-7362-3.ch050 ◽

2019 ◽

pp. 668-679

Author(s):

Marek Kretowski ◽

Marcin Czajkowski

Keyword(s):

Evolutionary Algorithms ◽

Decision Tree ◽

Knowledge Discovery ◽

Decision Trees ◽

Optimal Solution ◽

Top Down ◽

Local Optima ◽

Optimal Decisions ◽

Decision Tree Induction ◽

Regression Functions

Decision trees represent one of the main predictive techniques in knowledge discovery. This chapter describes evolutionary-induced trees, which are emerging alternatives to the greedy top-down solutions. Most typical tree-based systems search only for locally optimal decisions at each node and do not guarantee the optimal solution. Application of evolutionary algorithms to the problem of decision tree induction allows searching for the structure of the tree, tests in internal nodes, and regression functions in the leaves (for model trees) at the same time. As a result, such globally induced decision trees are able to avoid local optima and usually lead to better prediction than the greedy counterparts.

Download Full-text