Differential Evolution and Perceptron Decision Trees for Classification Tasks

Decision Trees (DTs) are widely used Machine Learning (ML) models with a broad range of applications. The interest in these models has increased even further in the context of Explainable AI (XAI), as decision trees of limited depth are very interpretable models. However, traditional algorithms for learning DTs are heuristic in nature; they may produce trees that are of suboptimal quality under depth constraints. We introduce PyDL8.5, a Python library to infer depth-constrained Optimal Decision Trees (ODTs). PyDL8.5 provides an interface for DL8.5, an efficient algorithm for inferring depth-constrained ODTs. The library provides an easy-to-use scikit-learn compatible interface. It cannot only be used for classification tasks, but also for regression, clustering, and other tasks. We introduce an interface that allows users to easily implement these other learning tasks. We provide a number of examples of how to use this library.

Download Full-text

OC1-DE: A Differential Evolution Based Approach for Inducing Oblique Decision Trees

Artificial Intelligence and Soft Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-59063-9_38 ◽

2017 ◽

pp. 427-438 ◽

Cited By ~ 3

Author(s):

Rafael Rivera-Lopez ◽

Juana Canul-Reich ◽

José A. Gámez ◽

José M. Puerta

Keyword(s):

Differential Evolution ◽

Decision Trees

Download Full-text

Simplifying decision trees: A survey

The Knowledge Engineering Review ◽

10.1017/s0269888997000015 ◽

1997 ◽

Vol 12 (01) ◽

pp. 1-40 ◽

Cited By ~ 142

Author(s):

LEONARD A. BRESLOW ◽

DAVID W. AHA

Keyword(s):

Decision Trees ◽

Data Structures ◽

Classification Accuracy ◽

Case Based Reasoning ◽

Reasoning Systems ◽

Tree Generation ◽

Good Classification ◽

Classification Tasks ◽

Insight Into ◽

Case Based

Induced decision trees are an extensively-researched solution to classification tasks. For many practical tasks, the trees produced by tree-generation algorithms are not comprehensible to users due to their size and complexity. Although many tree induction algorithms have been shown to produce simpler, more comprehensible trees (or data structures derived from trees) with good classification accuracy, tree simplification has usually been of secondary concern relative to accuracy, and no attempt has been made to survey the literature from the perspective of simplification. We present a framework that organizes the approaches to tree simplification and summarize and critique the approaches within this framework. The purpose of this survey is to provide researchers and practitioners with a concise overview of tree-simplification approaches and insight into their relative capabilities. In our final discussion, we briefly describe some empirical findings and discuss the application of tree induction algorithms to case retrieval in case-based reasoning systems.

Download Full-text

Artificial Intelligence Techniques for Unbalanced Datasets in Real World Classification Tasks

Machine Learning ◽

10.4018/978-1-60960-818-7.ch304 ◽

2012 ◽

pp. 414-427 ◽

Cited By ~ 1

Author(s):

Marco Vannucci ◽

Valentina Colla ◽

Silvia Cateni ◽

Mirko Sgarbi

Keyword(s):

Artificial Intelligence ◽

Support Vector Machines ◽

Decision Trees ◽

Real World ◽

Support Vector ◽

Artificial Intelligence Techniques ◽

Vector Machines ◽

Classification Tasks

In this chapter a survey on the problem of classification tasks in unbalanced datasets is presented. The effect of the imbalance of the distribution of target classes in databases is analyzed with respect to the performance of standard classifiers such as decision trees and support vector machines, and the main approaches to improve the generally not satisfactory results obtained by such methods are described. Finally, two typical applications coming from real world frameworks are introduced, and the uses of the techniques employed for the related classification tasks are shown in practice.

Download Full-text

Enhancing the Interpretability of Deep Models in Healthcare Through Attention: Application to Glucose Forecasting for Diabetic People

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001421600065 ◽

2021 ◽

pp. 2160006

Author(s):

Maxime De Bois ◽

Mounîm A. El Yacoubi ◽

Mehdi Ammi

Keyword(s):

Neural Network ◽

Deep Learning ◽

Decision Trees ◽

Recurrent Neural Network ◽

Black Box ◽

Good Compromise ◽

Time Prediction ◽

Classification Tasks

The adoption of deep learning in healthcare is hindered by their “black box” nature. In this paper, we explore the RETAIN architecture for the task of glucose forecasting for diabetic people. By using a two-level attention mechanism, the recurrent-neural-network-based RETAIN model is interpretable. We evaluate the RETAIN model on the type-2 IDIAB and the type-1 OhioT1DM datasets by comparing its statistical and clinical performances against two deep models and three models based on decision trees. We show that the RETAIN model offers a very good compromise between accuracy and interpretability, being almost as accurate as the LSTM and FCN models while remaining interpretable. We show the usefulness of its interpretable nature by analyzing the contribution of each variable to the final prediction. It revealed that signal values older than 1[Formula: see text]h are not used by the RETAIN model for 30[Formula: see text]min ahead of time prediction of glucose. Also, we show how the RETAIN model changes its behavior upon the arrival of an event such as carbohydrate intakes or insulin infusions. In particular, it showed that the patient’s state before the event is particularly important for the prediction. Overall the RETAIN model, thanks to its interpretability, seems to be a very promising model for regression or classification tasks in healthcare.

Download Full-text

Classification of Tumor Samples from Expression Data Using Decision Trunks

Cancer Informatics ◽

10.4137/cin.s10356 ◽

2013 ◽

Vol 12 ◽

pp. CIN.S10356 ◽

Cited By ~ 3

Author(s):

Benjamin Ulfenborg ◽

Karin Klinga-Levan ◽

Björn Olsson

Keyword(s):

Decision Trees ◽

Comprehensive Evaluation ◽

Expression Data ◽

Current State ◽

Wide Range ◽

Machine Learning Approach ◽

Human Decision ◽

Classification Tasks ◽

Testing Practices

We present a novel machine learning approach for the classification of cancer samples using expression data. We refer to the method as “decision trunks,” since it is loosely based on decision trees, but contains several modifications designed to achieve an algorithm that: (1) produces smaller and more easily interpretable classifiers than decision trees; (2) is more robust in varying application scenarios; and (3) achieves higher classification accuracy. The decision trunk algorithm has been implemented and tested on 26 classification tasks, covering a wide range of cancer forms, experimental methods, and classification scenarios. This comprehensive evaluation indicates that the proposed algorithm performs at least as well as the current state of the art algorithms in terms of accuracy, while producing classifiers that include on average only 2–3 markers. We suggest that the resulting decision trunks have clear advantages over other classifiers due to their transparency, interpretability, and their correspondence with human decision-making and clinical testing practices.

Download Full-text

A hybrid scheme-based one-vs-all decision trees for multi-class classification tasks

Knowledge-Based Systems ◽

10.1016/j.knosys.2020.105922 ◽

2020 ◽

Vol 198 ◽

pp. 105922 ◽

Cited By ~ 4

Author(s):

Jianjian Yan ◽

Zhongnan Zhang ◽

Kunhui Lin ◽

Fan Yang ◽

Xiongbiao Luo

Keyword(s):

Decision Trees ◽

Hybrid Scheme ◽

Classification Tasks ◽

Multi Class Classification

Download Full-text

Artificial Intelligence Techniques for Unbalanced Datasets in Real World Classification Tasks

Computational Modeling and Simulation of Intellect ◽

10.4018/978-1-60960-551-3.ch021 ◽

2011 ◽

pp. 551-565 ◽

Cited By ~ 4

Author(s):

Marco Vannucci ◽

Valentina Colla ◽

Silvia Cateni ◽

Mirko Sgarbi

Keyword(s):

Artificial Intelligence ◽

Support Vector Machines ◽

Decision Trees ◽

Real World ◽

Support Vector ◽

Artificial Intelligence Techniques ◽

Vector Machines ◽

Classification Tasks

In this chapter a survey on the problem of classification tasks in unbalanced datasets is presented. The effect of the imbalance of the distribution of target classes in databases is analyzed with respect to the performance of standard classifiers such as decision trees and support vector machines, and the main approaches to improve the generally not satisfactory results obtained by such methods are described. Finally, two typical applications coming from real world frameworks are introduced, and the uses of the techniques employed for the related classification tasks are shown in practice.

Download Full-text

Differential Evolution and Perceptron Decision Trees for Fault Detection in Power Transformers

Advances in Intelligent Systems and Computing - Soft Computing Models in Industrial and Environmental Applications ◽

10.1007/978-3-642-32922-7_15 ◽

2013 ◽

pp. 143-152 ◽

Cited By ~ 2

Author(s):

A. R. R. Freitas ◽

R. C. Pedrosa Silva ◽

F. G. Guimarães

Keyword(s):

Fault Detection ◽

Differential Evolution ◽

Decision Trees ◽

Power Transformers

Download Full-text

Differential Evolution and Perceptron Decision Trees for Classification Tasks

Evaluating Nonlinear Decision Trees for Binary Classification Tasks with Other Existing Methods

PyDL8.5: a Library for Learning Optimal Decision Trees

OC1-DE: A Differential Evolution Based Approach for Inducing Oblique Decision Trees

Simplifying decision trees: A survey

Artificial Intelligence Techniques for Unbalanced Datasets in Real World Classification Tasks

Enhancing the Interpretability of Deep Models in Healthcare Through Attention: Application to Glucose Forecasting for Diabetic People

Classification of Tumor Samples from Expression Data Using Decision Trunks

A hybrid scheme-based one-vs-all decision trees for multi-class classification tasks

Artificial Intelligence Techniques for Unbalanced Datasets in Real World Classification Tasks

Differential Evolution and Perceptron Decision Trees for Fault Detection in Power Transformers

Export Citation Format