Mapping Ensembles of Trees to Sparse, Interpretable Multilayer Perceptron Networks

Abstract Tree-based classifiers provide easy-to-understand outputs. Artificial neural networks (ANN) commonly outperform tree-based classifiers; nevertheless, understanding their outputs requires specialized knowledge in most cases. The highly redundant architecture of ANN is typically designed through an expensive trial-and-error scheme. We aim at (1) investigating whether using ensembles of decision trees to design the architecture of low-redundant, sparse ANN provides better-performing networks, and (2) evaluating whether such trees can be used to provide human-understandable explanations for their outputs. Information about the hierarchy of the features, and how good they are at separating subsets of samples among the classes, is gathered from each branch in an ensemble of trees. This information is used to design the architecture of a sparse multilayer perceptron network. Networks built using our method are called ForestNet. Tree branches corresponding to highly activated neurons are used to provide explanations of the networks’ outputs. ForestNets are able to handle low- and high-dimensional data, as we show on an evaluation using four datasets. Our networks consistently outperformed their respective ensemble of trees and had similar performance to their fully connected counterparts with a significant reduction of connections. Furthermore, our interpretation method seems to provide support for the ForestNet outputs. While ForestNet’s architectures do not allow them yet to capture well the intrinsic variability of visual data, they exhibit very promising results by reducing more than 98% of connections for such visual tasks. Structure similarities between ForestNets and their respective tree ensemble provide means to interpret their outputs.

Download Full-text

Specific Borrmann classification in advanced gastric cancer by an ensemble multilayer perceptron network: a multi‐center research

Medical Physics ◽

10.1002/mp.15094 ◽

2021 ◽

Author(s):

Siwen Wang ◽

Di Dong ◽

Wenjuan Zhang ◽

Hui Hu ◽

Hailin Li ◽

...

Keyword(s):

Gastric Cancer ◽

Advanced Gastric Cancer ◽

Multilayer Perceptron ◽

Multilayer Perceptron Network ◽

Center Research

Download Full-text

Multi-purpose prediction of the various edge cut twisted tape insert characteristics: multilayer perceptron network modeling

Journal of Thermal Analysis and Calorimetry ◽

10.1007/s10973-021-10904-1 ◽

2021 ◽

Author(s):

Mohammad Mahdi Tafarroj ◽

Golnaz Zarabian Ghaeini ◽

Javad Abolfazli Esfahani ◽

Kyung Chun Kim

Keyword(s):

Multilayer Perceptron ◽

Network Modeling ◽

Twisted Tape ◽

Multilayer Perceptron Network

Download Full-text

ForestNet – Automatic Design of Sparse Multilayer Perceptron Network Architectures Using Ensembles of Randomized Trees

Lecture Notes in Computer Science - Pattern Recognition ◽

10.1007/978-3-030-41404-7_3 ◽

2020 ◽

pp. 32-45 ◽

Cited By ~ 1

Author(s):

Dalia Rodríguez-Salas ◽

Nishant Ravikumar ◽

Mathias Seuret ◽

Andreas Maier

Keyword(s):

Multilayer Perceptron ◽

Network Architectures ◽

Automatic Design ◽

Multilayer Perceptron Network

Download Full-text

Comparison of Fisher's linear discriminant to multilayer perceptron networks in the classification of vapors using sensor array data

Sensors and Actuators B Chemical ◽

10.1016/j.snb.2005.10.033 ◽

2006 ◽

Vol 115 (2) ◽

pp. 647-655 ◽

Cited By ~ 8

Author(s):

Matteo Pardo ◽

Brian C. Sisk ◽

Giorgio Sberveglieri ◽

Nathan S. Lewis

Keyword(s):

Multilayer Perceptron ◽

Sensor Array ◽

Array Data ◽

Linear Discriminant ◽

Multilayer Perceptron Networks ◽

Fisher’S Linear Discriminant

Download Full-text

Multilayer-Perceptron Network Ensemble Modeling with Genetic Algorithms for the Capacity of Bolted Lap Joint

Lecture Notes in Computer Science - Hybrid Artificial Intelligent Systems ◽

10.1007/978-3-642-28942-2_49 ◽

2012 ◽

pp. 545-556 ◽

Cited By ~ 2

Author(s):

Julio Fernández-Ceniceros ◽

Andrés Sanz-García ◽

Fernando Antoñanzas-Torres ◽

F. Javier Martínez-de-Pisón-Ascacibar

Keyword(s):

Genetic Algorithms ◽

Multilayer Perceptron ◽

Ensemble Modeling ◽

Lap Joint ◽

Multilayer Perceptron Network

Download Full-text

Forecasting PM10 in Algiers: efficacy of multilayer perceptron networks

Environmental Science and Pollution Research ◽

10.1007/s11356-015-5406-6 ◽

2015 ◽

Vol 23 (2) ◽

pp. 1634-1641 ◽

Cited By ~ 20

Author(s):

Hamza Abderrahim ◽

Mohammed Reda Chellali ◽

Ahmed Hamou

Keyword(s):

Multilayer Perceptron ◽

Multilayer Perceptron Networks

Download Full-text

Comparative fault tolerance of generalized radial basis function and multilayer perceptron networks

IEEE International Conference on Neural Networks ◽

10.1109/icnn.1993.298838 ◽

2002 ◽

Cited By ~ 2

Author(s):

B.E. Segee ◽

M.J. Carter

Keyword(s):

Fault Tolerance ◽

Radial Basis Function ◽

Basis Function ◽

Multilayer Perceptron ◽

Radial Basis ◽

Multilayer Perceptron Networks

Download Full-text

Multilayer Perceptron Network with Modified Sigmoid Activation Functions

Artificial Intelligence and Computational Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-642-16530-6_49 ◽

2010 ◽

pp. 414-421 ◽

Cited By ~ 4

Author(s):

Tobias Ebert ◽

Oliver Bänfer ◽

Oliver Nelles

Keyword(s):

Multilayer Perceptron ◽

Activation Functions ◽

Multilayer Perceptron Network

Download Full-text

Convergence Behavior of DNNs with Mutual-Information-Based Regularization

Entropy ◽

10.3390/e22070727 ◽

2020 ◽

Vol 22 (7) ◽

pp. 727 ◽

Cited By ~ 1

Author(s):

Hlynur Jónsson ◽

Giovanni Cherubini ◽

Evangelos Eleftheriou

Keyword(s):

Neural Networks ◽

Mutual Information ◽

Low Complexity ◽

High Dimensional ◽

Test Accuracy ◽

Compression Phase ◽

Hidden Layer ◽

Low Dimensional ◽

Fully Connected ◽

Fully Connected Networks

Information theory concepts are leveraged with the goal of better understanding and improving Deep Neural Networks (DNNs). The information plane of neural networks describes the behavior during training of the mutual information at various depths between input/output and hidden-layer variables. Previous analysis revealed that most of the training epochs are spent on compressing the input, in some networks where finiteness of the mutual information can be established. However, the estimation of mutual information is nontrivial for high-dimensional continuous random variables. Therefore, the computation of the mutual information for DNNs and its visualization on the information plane mostly focused on low-complexity fully connected networks. In fact, even the existence of the compression phase in complex DNNs has been questioned and viewed as an open problem. In this paper, we present the convergence of mutual information on the information plane for a high-dimensional VGG-16 Convolutional Neural Network (CNN) by resorting to Mutual Information Neural Estimation (MINE), thus confirming and extending the results obtained with low-dimensional fully connected networks. Furthermore, we demonstrate the benefits of regularizing a network, especially for a large number of training epochs, by adopting mutual information estimates as additional terms in the loss function characteristic of the network. Experimental results show that the regularization stabilizes the test accuracy and significantly reduces its variance.

Download Full-text

Bispectrum and Recurrent Neural Networks: Improved Classification of Interictal and Preictal States

Scientific Reports ◽

10.1038/s41598-019-52152-2 ◽

2019 ◽

Vol 9 (1) ◽

Cited By ~ 2

Author(s):

Laura Gagliano ◽

Elie Bou Assi ◽

Dang K. Nguyen ◽

Mohamad Sawan

Keyword(s):

Neural Networks ◽

Multilayer Perceptron ◽

Recurrent Neural Networks ◽

Short Term Memory ◽

Focal Epilepsy ◽

Single Layer ◽

Epileptic Seizures ◽

Novel Approach ◽

Multilayer Perceptron Networks

Abstract This work proposes a novel approach for the classification of interictal and preictal brain states based on bispectrum analysis and recurrent Long Short-Term Memory (LSTM) neural networks. Two features were first extracted from bilateral intracranial electroencephalography (iEEG) recordings of dogs with naturally occurring focal epilepsy. Single-layer LSTM networks were trained to classify 5-min long feature vectors as preictal or interictal. Classification performances were compared to previous work involving multilayer perceptron networks and higher-order spectral (HOS) features on the same dataset. The proposed LSTM network proved superior to the multilayer perceptron network and achieved an average classification accuracy of 86.29% on held-out data. Results imply the possibility of forecasting epileptic seizures using recurrent neural networks, with minimal feature extraction.

Download Full-text