Reverse Engineering the Neural Networks for Rule Extraction in Classification Problems

In machine learning, ensembles of models based on Multi-Layer Perceptrons (MLPs) or decision trees are considered successful models. However, explaining their responses is a complex problem that requires the creation of new methods of interpretation. A natural way to explain the classifications of the models is to transform them into propositional rules. In this work, we focus on random forests and gradient-boosted trees. Specifically, these models are converted into an ensemble of interpretable MLPs from which propositional rules are produced. The rule extraction method presented here allows one to precisely locate the discriminating hyperplanes that constitute the antecedents of the rules. In experiments based on eight classification problems, we compared our rule extraction technique to “Skope-Rules” and other state-of-the-art techniques. Experiments were performed with ten-fold cross-validation trials, with propositional rules that were also generated from ensembles of interpretable MLPs. By evaluating the characteristics of the extracted rules in terms of complexity, fidelity, and accuracy, the results obtained showed that our rule extraction technique is competitive. To the best of our knowledge, this is one of the few works showing a rule extraction technique that has been applied to both ensembles of decision trees and neural networks.

Download Full-text

A Comparative Analysis of a Novel Anomaly Detection Algorithm with Neural Networks

Deep Learning and Neural Networks ◽

10.4018/978-1-7998-0414-7.ch004 ◽

2020 ◽

pp. 52-68

Author(s):

Srijan Das ◽

Arpita Dutta ◽

Saurav Sharma ◽

Sangharatna Godboley

Keyword(s):

Neural Networks ◽

Comparative Analysis ◽

Anomaly Detection ◽

Supervised Classification ◽

Detection Algorithm ◽

Classification Problems ◽

Anomalous Data ◽

Criterion Functions ◽

The Neural Networks ◽

Data Objects

Anomaly Detection is an important research domain of Pattern Recognition due to its effects of classification and clustering problems. In this paper, an anomaly detection algorithm is proposed using different primitive cost functions such as Normal Perceptron, Relaxation Criterion, Mean Square Error (MSE) and Ho-Kashyap. These criterion functions are minimized to locate the decision boundary in the data space so as to classify the normal data objects and the anomalous data objects. The authors proposed algorithm uses the concept of supervised classification, though it is very different from solving normal supervised classification problems. This proposed algorithm using different criterion functions has been compared with the accuracy of the Neural Networks (NN) in order to bring out a comparative analysis between them and discuss some advantages.

Download Full-text

A Comparative Analysis of a Novel Anomaly Detection Algorithm with Neural Networks

International Journal of Rough Sets and Data Analysis ◽

10.4018/ijrsda.2017100101 ◽

2017 ◽

Vol 4 (4) ◽

pp. 1-16

Author(s):

Srijan Das ◽

Arpita Dutta ◽

Saurav Sharma ◽

Sangharatna Godboley

Keyword(s):

Neural Networks ◽

Comparative Analysis ◽

Anomaly Detection ◽

Supervised Classification ◽

Detection Algorithm ◽

Classification Problems ◽

Anomalous Data ◽

Criterion Functions ◽

The Neural Networks ◽

Data Objects

Anomaly Detection is an important research domain of Pattern Recognition due to its effects of classification and clustering problems. In this paper, an anomaly detection algorithm is proposed using different primitive cost functions such as Normal Perceptron, Relaxation Criterion, Mean Square Error (MSE) and Ho-Kashyap. These criterion functions are minimized to locate the decision boundary in the data space so as to classify the normal data objects and the anomalous data objects. The authors proposed algorithm uses the concept of supervised classification, though it is very different from solving normal supervised classification problems. This proposed algorithm using different criterion functions has been compared with the accuracy of the Neural Networks (NN) in order to bring out a comparative analysis between them and discuss some advantages.

Download Full-text

BIBLIOMETRIC MAPPING OF THE COMPUTATIONAL INTELLIGENCE FIELD

International Journal of Uncertainty Fuzziness and Knowledge-Based Systems ◽

10.1142/s0218488507004911 ◽

2007 ◽

Vol 15 (05) ◽

pp. 625-645 ◽

Cited By ~ 95

Author(s):

NEES JAN VAN ECK ◽

LUDO WALTMAN

Keyword(s):

Neural Networks ◽

Evolutionary Computation ◽

Computational Intelligence ◽

Fuzzy Systems ◽

Optimization Problems ◽

Control Problems ◽

Classification Problems ◽

Current Structure ◽

The Neural Networks ◽

Regression Problems

In this paper, a bibliometric study of the computational intelligence field is presented. Bibliometric maps showing the associations between the main concepts in the field are provided for the periods 1996–2000 and 2001–2005. Both the current structure of the field and the evolution of the field over the last decade are analyzed. In addition, a number of emerging areas in the field are identified. It turns out that computational intelligence can best be seen as a field that is structured around four important types of problems, namely control problems, classification problems, regression problems, and optimization problems. Within the computational intelligence field, the neural networks and fuzzy systems subfields are fairly intertwined, whereas the evolutionary computation subfield has a relatively independent position.

Download Full-text

Using Sample Selection to Improve Accuracy and Simplicity of Rules Extracted from Neural Networks for Credit Scoring Applications

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026815500212 ◽

2015 ◽

Vol 14 (04) ◽

pp. 1550021 ◽

Cited By ~ 5

Author(s):

Rudy Setiono ◽

Arnulfo Azcarraga ◽

Yoichi Hayashi

Keyword(s):

Neural Networks ◽

Predictive Accuracy ◽

Credit Scoring ◽

Sample Selection ◽

Rule Extraction ◽

Training Data ◽

Training Dataset ◽

Original Dataset ◽

Extraction Algorithm ◽

The Neural Networks

In this paper, we present an approach for sample selection using an ensemble of neural networks for credit scoring. The ensemble determines samples that can be considered outliers by checking the classification accuracy of the neural networks on the original training data samples. Those samples that are consistently misclassified by the neural networks in the ensemble are removed from the training dataset. The remaining data samples are then used to train and prune another neural network for rule extraction. Our experimental results on publicly available benchmark credit scoring datasets show that by eliminating the outliers, we obtain neural networks with higher predictive accuracy and simpler in structure compared to the networks that are trained with the original dataset. A rule extraction algorithm is applied to generate comprehensible rules from the neural networks. The extracted rules are more concise than the rules generated from networks that have been trained using the original datasets.

Download Full-text

Tax Fraud Detection through Neural Networks: An Application Using a Sample of Personal Income Taxpayers

Future Internet ◽

10.3390/fi11040086 ◽

2019 ◽

Vol 11 (4) ◽

pp. 86 ◽

Cited By ~ 2

Author(s):

César Pérez López ◽

María Delgado Rodríguez ◽

Sonia de Lucas Santos

Keyword(s):

Neural Network ◽

Neural Networks ◽

Income Tax ◽

Fraud Detection ◽

Personal Income ◽

Personal Income Tax ◽

Tax Returns ◽

The Neural Networks ◽

Tax Fraud ◽

Efficiency Rate

The goal of the present research is to contribute to the detection of tax fraud concerning personal income tax returns (IRPF, in Spanish) filed in Spain, through the use of Machine Learning advanced predictive tools, by applying Multilayer Perceptron neural network (MLP) models. The possibilities springing from these techniques have been applied to a broad range of personal income return data supplied by the Institute of Fiscal Studies (IEF). The use of the neural networks enabled taxpayer segmentation as well as calculation of the probability concerning an individual taxpayer’s propensity to attempt to evade taxes. The results showed that the selected model has an efficiency rate of 84.3%, implying an improvement in relation to other models utilized in tax fraud detection. The proposal can be generalized to quantify an individual’s propensity to commit fraud with regards to other kinds of taxes. These models will support tax offices to help them arrive at the best decisions regarding action plans to combat tax fraud.

Download Full-text

Neural networks trained with high-dimensional functions approximation data in high-dimensional space

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211417 ◽

2021 ◽

pp. 1-12

Author(s):

Jian Zheng ◽

Jianfeng Wang ◽

Yanping Chen ◽

Shuping Chen ◽

Jingjin Chen ◽

...

Keyword(s):

Neural Networks ◽

Dimensional Space ◽

Data Distribution ◽

High Dimensional ◽

Sufficient Information ◽

Sufficient Data ◽

High Dimensional Space ◽

Positive Effects ◽

The Neural Networks ◽

Using Data

Neural networks can approximate data because of owning many compact non-linear layers. In high-dimensional space, due to the curse of dimensionality, data distribution becomes sparse, causing that it is difficulty to provide sufficient information. Hence, the task becomes even harder if neural networks approximate data in high-dimensional space. To address this issue, according to the Lipschitz condition, the two deviations, i.e., the deviation of the neural networks trained using high-dimensional functions, and the deviation of high-dimensional functions approximation data, are derived. This purpose of doing this is to improve the ability of approximation high-dimensional space using neural networks. Experimental results show that the neural networks trained using high-dimensional functions outperforms that of using data in the capability of approximation data in high-dimensional space. We find that the neural networks trained using high-dimensional functions more suitable for high-dimensional space than that of using data, so that there is no need to retain sufficient data for neural networks training. Our findings suggests that in high-dimensional space, by tuning hidden layers of neural networks, this is hard to have substantial positive effects on improving precision of approximation data.

Download Full-text

Reverse engineering imperceptible backdoor attacks on deep neural networks for detection and training set cleansing

Computers & Security ◽

10.1016/j.cose.2021.102280 ◽

2021 ◽

Vol 106 ◽

pp. 102280

Author(s):

Zhen Xiang ◽

David J. Miller ◽

George Kesidis

Keyword(s):

Neural Networks ◽

Reverse Engineering ◽

Deep Neural Networks ◽

Training Set ◽

And Training

Download Full-text

Classification and Recognition of Plant Leaf Based on Neural Networks

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.464.38 ◽

2011 ◽

Vol 464 ◽

pp. 38-42 ◽

Cited By ~ 5

Author(s):

Ping Ye ◽

Gui Rong Weng

Keyword(s):

Neural Networks ◽

Fractal Dimension ◽

Recognition Rate ◽

Image Binarization ◽

Characteristic Parameters ◽

Moment Invariant ◽

Plant Leaf ◽

The Neural Networks ◽

Novel Method ◽

The Moment

This paper proposed a novel method for leaf classification and recognition. In the method, the moment invariant and fractal dimension were regarded as the characteristic parameters of the plant leaf. In order to extract the representative characteristic parameters, pretreatment of the leaf images, including RGB-gray converting, image binarization and leafstalk removing. The extracted leaf characteristic parameters were further utilized as training sets to train the neural networks. The proposed method was proved effectively to reach a recognition rate about 92% for most of the testing leaf samples

Download Full-text

Localization of Scattering Objects Using Neural Networks

Sensors ◽

10.3390/s21010011 ◽

2020 ◽

Vol 21 (1) ◽

pp. 11

Author(s):

Domonkos Haffner ◽

Ferenc Izsák

Keyword(s):

Neural Networks ◽

Plane Waves ◽

Measurement Data ◽

Scattered Wave ◽

Training Data ◽

Data Set ◽

Main Compound ◽

Incident Plane ◽

The Neural Networks ◽

Simulation Package

The localization of multiple scattering objects is performed while using scattered waves. An up-to-date approach: neural networks are used to estimate the corresponding locations. In the scattering phenomenon under investigation, we assume known incident plane waves, fully reflecting balls with known diameters and measurement data of the scattered wave on one fixed segment. The training data are constructed while using the simulation package μ-diff in Matlab. The structure of the neural networks, which are widely used for similar purposes, is further developed. A complex locally connected layer is the main compound of the proposed setup. With this and an appropriate preprocessing of the training data set, the number of parameters can be kept at a relatively low level. As a result, using a relatively large training data set, the unknown locations of the objects can be estimated effectively.

Download Full-text