Explaining Image Classifiers Generating Exemplars and Counter-Exemplars from Latent Representations

We present an approach to explain the decisions of black box image classifiers through synthetic exemplar and counter-exemplar learnt in the latent feature space. Our explanation method exploits the latent representations learned through an adversarial autoencoder for generating a synthetic neighborhood of the image for which an explanation is required. A decision tree is trained on a set of images represented in the latent space, and its decision rules are used to generate exemplar images showing how the original image can be modified to stay within its class. Counterfactual rules are used to generate counter-exemplars showing how the original image can “morph” into another class. The explanation also comprehends a saliency map highlighting the areas that contribute to its classification, and areas that push it into another class. A wide and deep experimental evaluation proves that the proposed method outperforms existing explainers in terms of fidelity, relevance, coherence, and stability, besides providing the most useful and interpretable explanations.

Download Full-text

Decision Tree Integration Using Dynamic Regions of Competence

Entropy ◽

10.3390/e22101129 ◽

2020 ◽

Vol 22 (10) ◽

pp. 1129

Author(s):

Jędrzej Biedrzycki ◽

Robert Burduk

Keyword(s):

Decision Tree ◽

Decision Rules ◽

Feature Space ◽

Majority Voting ◽

Classification Model ◽

Training Dataset ◽

Multiple Classifier Systems ◽

Classifier Systems ◽

Voting Rule ◽

Multiple Classifier

A vital aspect of the Multiple Classifier Systems construction process is the base model integration. For example, the Random Forest approach used the majority voting rule to fuse the base classifiers obtained by bagging the training dataset. In this paper we propose the algorithm that uses partitioning the feature space whose split is determined by the decision rules of each decision tree node which is the base classification model. After dividing the feature space, the centroid of each new subspace is determined. This centroids are used in order to determine the weights needed in the integration phase based on the weighted majority voting rule. The proposal was compared with other Multiple Classifier Systems approaches. The experiments regarding multiple open-source benchmarking datasets demonstrate the effectiveness of our method. To discuss the results of our experiments, we use micro and macro-average classification performance measures.

Download Full-text

Explaining Sentiment Classification with Synthetic Exemplars and Counter-Exemplars

Discovery Science - Lecture Notes in Computer Science ◽

10.1007/978-3-030-61527-7_24 ◽

2020 ◽

pp. 357-373

Author(s):

Orestis Lampridis ◽

Riccardo Guidotti ◽

Salvatore Ruggieri

Keyword(s):

Decision Tree ◽

Black Box ◽

Sentiment Classification ◽

Box Model ◽

Local Approach ◽

Latent Space ◽

Selection Of

Abstract We present xspells, a model-agnostic local approach for explaining the decisions of a black box model for sentiment classification of short texts. The explanations provided consist of a set of exemplar sentences and a set of counter-exemplar sentences. The former are examples classified by the black box with the same label as the text to explain. The latter are examples classified with a different label (a form of counter-factuals). Both are close in meaning to the text to explain, and both are meaningful sentences – albeit they are synthetically generated. xspells generates neighbors of the text to explain in a latent space using Variational Autoencoders for encoding text and decoding latent instances. A decision tree is learned from randomly generated neighbors, and used to drive the selection of the exemplars and counter-exemplars. We report experiments on two datasets showing that xspells outperforms the well-known lime method in terms of quality of explanations, fidelity, and usefulness, and that is comparable to it in terms of stability.

Download Full-text

A branch & bound algorithm to determine optimal bivariate splits for oblique decision tree induction

Applied Intelligence ◽

10.1007/s10489-021-02281-x ◽

2021 ◽

Author(s):

Ferdinand Bollwein ◽

Stephan Westphal

Keyword(s):

Decision Tree ◽

Feature Space ◽

Classification Problems ◽

Decision Tree Induction ◽

Single Attribute ◽

Global Optimal ◽

The Individual ◽

Tree Building ◽

Very High ◽

Multiclass Classification Problems

AbstractUnivariate decision tree induction methods for multiclass classification problems such as CART, C4.5 and ID3 continue to be very popular in the context of machine learning due to their major benefit of being easy to interpret. However, as these trees only consider a single attribute per node, they often get quite large which lowers their explanatory value. Oblique decision tree building algorithms, which divide the feature space by multidimensional hyperplanes, often produce much smaller trees but the individual splits are hard to interpret. Moreover, the effort of finding optimal oblique splits is very high such that heuristics have to be applied to determine local optimal solutions. In this work, we introduce an effective branch and bound procedure to determine global optimal bivariate oblique splits for concave impurity measures. Decision trees based on these bivariate oblique splits remain fairly interpretable due to the restriction to two attributes per split. The resulting trees are significantly smaller and more accurate than their univariate counterparts due to their ability of adapting better to the underlying data and capturing interactions of attribute pairs. Moreover, our evaluation shows that our algorithm even outperforms algorithms based on heuristically obtained multivariate oblique splits despite the fact that we are focusing on two attributes only.

Download Full-text

Discovery of novel chemical reactions by deep generative recurrent neural network

Scientific Reports ◽

10.1038/s41598-021-81889-y ◽

2021 ◽

Vol 11 (1) ◽

Cited By ~ 1

Author(s):

William Bort ◽

Igor I. Baskin ◽

Timur Gimadiev ◽

Artem Mukanov ◽

Ramil Nugmanov ◽

...

Keyword(s):

Chemical Reactions ◽

Short Term Memory ◽

De Novo ◽

Molecular Structures ◽

Topographic Map ◽

Short Term ◽

Suzuki Reactions ◽

Class A ◽

Latent Space ◽

Long Short Term Memory

AbstractThe “creativity” of Artificial Intelligence (AI) in terms of generating de novo molecular structures opened a novel paradigm in compound design, weaknesses (stability & feasibility issues of such structures) notwithstanding. Here we show that “creative” AI may be as successfully taught to enumerate novel chemical reactions that are stoichiometrically coherent. Furthermore, when coupled to reaction space cartography, de novo reaction design may be focused on the desired reaction class. A sequence-to-sequence autoencoder with bidirectional Long Short-Term Memory layers was trained on on-purpose developed “SMILES/CGR” strings, encoding reactions of the USPTO database. The autoencoder latent space was visualized on a generative topographic map. Novel latent space points were sampled around a map area populated by Suzuki reactions and decoded to corresponding reactions. These can be critically analyzed by the expert, cleaned of irrelevant functional groups and eventually experimentally attempted, herewith enlarging the synthetic purpose of popular synthetic pathways.

Download Full-text

A Black-Box Adversarial Attack via Deep Reinforcement Learning on the Feature Space

2021 IEEE Conference on Dependable and Secure Computing (DSC) ◽

10.1109/dsc49826.2021.9346264 ◽

2021 ◽

Author(s):

Lyue Li ◽

Amir Rezapour ◽

Wen-Guey Tzeng

Keyword(s):

Reinforcement Learning ◽

Feature Space ◽

Black Box ◽

Adversarial Attack

Download Full-text

Predicting 30-day Hospital Readmission with Publicly Available Administrative Database

Methods of Information in Medicine ◽

10.3414/me14-02-0017 ◽

2015 ◽

Vol 54 (06) ◽

pp. 560-567 ◽

Cited By ~ 11

Author(s):

K. Zhu ◽

Z. Lou ◽

J. Zhou ◽

N. Ballester ◽

P. Parikh ◽

...

Keyword(s):

Heart Failure ◽

Logistic Regression ◽

Decision Tree ◽

Ad Hoc ◽

Prediction Models ◽

Conditional Logistic Regression ◽

Hospital Readmissions ◽

Decision Rules ◽

Classification Models ◽

Standard Classification

SummaryIntroduction: This article is part of the Focus Theme of Methods of Information in Medicine on “Big Data and Analytics in Healthcare”.Background: Hospital readmissions raise healthcare costs and cause significant distress to providers and patients. It is, therefore, of great interest to healthcare organizations to predict what patients are at risk to be readmitted to their hospitals. However, current logistic regression based risk prediction models have limited prediction power when applied to hospital administrative data. Meanwhile, although decision trees and random forests have been applied, they tend to be too complex to understand among the hospital practitioners.Objectives: Explore the use of conditional logistic regression to increase the prediction accuracy.Methods: We analyzed an HCUP statewide in-patient discharge record dataset, which includes patient demographics, clinical and care utilization data from California. We extracted records of heart failure Medicare beneficiaries who had inpatient experience during an 11-month period. We corrected the data imbalance issue with under-sampling. In our study, we first applied standard logistic regression and decision tree to obtain influential variables and derive practically meaning decision rules. We then stratified the original data set accordingly and applied logistic regression on each data stratum. We further explored the effect of interacting variables in the logistic regression modeling. We conducted cross validation to assess the overall prediction performance of conditional logistic regression (CLR) and compared it with standard classification models.Results: The developed CLR models outperformed several standard classification models (e.g., straightforward logistic regression, stepwise logistic regression, random forest, support vector machine). For example, the best CLR model improved the classification accuracy by nearly 20% over the straightforward logistic regression model. Furthermore, the developed CLR models tend to achieve better sensitivity of more than 10% over the standard classification models, which can be translated to correct labeling of additional 400 – 500 readmissions for heart failure patients in the state of California over a year. Lastly, several key predictor identified from the HCUP data include the disposition location from discharge, the number of chronic conditions, and the number of acute procedures.Conclusions: It would be beneficial to apply simple decision rules obtained from the decision tree in an ad-hoc manner to guide the cohort stratification. It could be potentially beneficial to explore the effect of pairwise interactions between influential predictors when building the logistic regression models for different data strata. Judicious use of the ad-hoc CLR models developed offers insights into future development of prediction models for hospital readmissions, which can lead to better intuition in identifying high-risk patients and developing effective post-discharge care strategies. Lastly, this paper is expected to raise the awareness of collecting data on additional markers and developing necessary database infrastructure for larger-scale exploratory studies on readmission risk prediction.

Download Full-text

treeheatr: an R package for interpretable decision tree visualizations

10.1101/2020.07.10.196352 ◽

2020 ◽

Author(s):

Trang T. Le ◽

Jason H. Moore

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Feature Space ◽

R Package ◽

Tree Structure ◽

Decision Tree Model ◽

Teaching Tool ◽

Tree Model ◽

Machine Learning Methods ◽

Link Type

AbstractSummarytreeheatr is an R package for creating interpretable decision tree visualizations with the data represented as a heatmap at the tree’s leaf nodes. The integrated presentation of the tree structure along with an overview of the data efficiently illustrates how the tree nodes split up the feature space and how well the tree model performs. This visualization can also be examined in depth to uncover the correlation structure in the data and importance of each feature in predicting the outcome. Implemented in an easily installed package with a detailed vignette, treeheatr can be a useful teaching tool to enhance students’ understanding of a simple decision tree model before diving into more complex tree-based machine learning methods.AvailabilityThe treeheatr package is freely available under the permissive MIT license at https://trang1618.github.io/treeheatr and https://cran.r-project.org/package=treeheatr. It comes with a detailed vignette that is automatically built with GitHub Actions continuous [email protected]

Download Full-text

Experimental Evaluation of Time-Series Decision Tree

Lecture Notes in Computer Science - Active Mining ◽

10.1007/11423270_11 ◽

2005 ◽

pp. 190-209 ◽

Cited By ~ 2

Author(s):

Yuu Yamada ◽

Einoshin Suzuki ◽

Hideto Yokoi ◽

Katsuhiko Takabayashi

Keyword(s):

Time Series ◽

Decision Tree ◽

Experimental Evaluation

Download Full-text

Sparse reduced-order modelling: sensor-based dynamics to full-state estimation

Journal of Fluid Mechanics ◽

10.1017/jfm.2018.147 ◽

2018 ◽

Vol 844 ◽

pp. 459-490 ◽

Cited By ~ 44

Author(s):

Jean-Christophe Loiseau ◽

Bernd R. Noack ◽

Steven L. Brunton

Keyword(s):

Coherent Structures ◽

Feature Space ◽

Black Box ◽

Box Model ◽

Sensor Data ◽

Dynamic Feature ◽

Time Resolved ◽

Reduced Order ◽

Reduced Order Modelling ◽

Full State

We propose a general dynamic reduced-order modelling framework for typical experimental data: time-resolved sensor data and optional non-time-resolved particle image velocimetry (PIV) snapshots. This framework can be decomposed into four building blocks. First, the sensor signals are lifted to a dynamic feature space without false neighbours. Second, we identify a sparse human-interpretable nonlinear dynamical system for the feature state based on the sparse identification of nonlinear dynamics (SINDy). Third, if PIV snapshots are available, a local linear mapping from the feature state to the velocity field is performed to reconstruct the full state of the system. Fourth, a generalized feature-based modal decomposition identifies coherent structures that are most dynamically correlated with the linear and nonlinear interaction terms in the sparse model, adding interpretability. Steps 1 and 2 define a black-box model. Optional steps 3 and 4 lift the black-box dynamics to a grey-box model in terms of the identified coherent structures, if non-time-resolved full-state data are available. This grey-box modelling strategy is successfully applied to the transient and post-transient laminar cylinder wake, and compares favourably with a proper orthogonal decomposition model. We foresee numerous applications of this highly flexible modelling strategy, including estimation, prediction and control. Moreover, the feature space may be based on intrinsic coordinates, which are unaffected by a key challenge of modal expansion: the slow change of low-dimensional coherent structures with changing geometry and varying parameters.

Download Full-text

Quality of Life Modeling at the Regional Level

Regional Development ◽

10.4018/978-1-4666-0882-5.ch111 ◽

2012 ◽

pp. 163-186

Author(s):

Jirí Krupka ◽

Miloslava Kašparová ◽

Pavel Jirava ◽

Jan Mandys

Keyword(s):

Quality Of Life ◽

Czech Republic ◽

Decision Tree ◽

Decision Rules ◽

Real Data ◽

Classification Model ◽

Data Sets ◽

The Czech Republic ◽

First Case

The chapter presents the problem of quality of life modeling in the Czech Republic based on classification methods. It concerns a comparison of methodological approaches; in the first case the approach of the Institute of Sociology of the Academy of Sciences of the Czech Republic was used, the second case is concerning a project of the civic association Team Initiative for Local Sustainable Development. On the basis of real data sets from the institute and team initiative the authors synthesized and analyzed quality of life classification models. They used decision tree classification algorithms for generating transparent decision rules and compare the classification results of decision tree. The classifier models on the basis of C5.0, CHAID, C&RT and C5.0 boosting algorithms were proposed and analyzed. The designed classification model was created in Clementine.

Download Full-text