Deep Neural Networks Constrained by Decision Rules

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33012496 ◽

2019 ◽

Vol 33 ◽

pp. 2496-2505

Author(s):

Yuzuru Okajima ◽

Kunihiko Sadamasa

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Predictive Accuracy ◽

Decision Rules ◽

Hybrid Technique ◽

Complex Data ◽

Rule Based ◽

Prior Probabilities ◽

The Neural Network ◽

Latent Representations

Deep neural networks achieve high predictive accuracy by learning latent representations of complex data. However, the reasoning behind their decisions is difficult for humans to understand. On the other hand, rule-based approaches are able to justify the decisions by showing the decision rules leading to them, but they have relatively low accuracy. To improve the interpretability of neural networks, several techniques provide post-hoc explanations of decisions made by neural networks, but they cannot guarantee that the decisions are always explained in a simple form like decision rules because their explanations are generated after the decisions are made by neural networks.In this paper, to balance the accuracy of neural networks and the interpretability of decision rules, we propose a hybrid technique called rule-constrained networks, namely, neural networks that make decisions by selecting decision rules from a given ruleset. Because the networks are forced to make decisions based on decision rules, it is guaranteed that every decision is supported by a decision rule. Furthermore, we propose a technique to jointly optimize the neural network and the ruleset from which the network select rules. The log likelihood of correct classifications is maximized under a model with hyper parameters about the ruleset size and the prior probabilities of rules being selected. This feature makes it possible to limit the ruleset size or prioritize human-made rules over automatically acquired rules for promoting the interpretability of the output. Experiments on datasets of time-series and sentiment classification showed rule-constrained networks achieved accuracy as high as that achieved by original neural networks and significantly higher than that achieved by existing rule-based models, while presenting decision rules supporting the decisions.

Download Full-text

Credit risk classification: an integrated predictive accuracy algorithm using artificial and deep neural networks

Annals of Operations Research ◽

10.1007/s10479-021-04114-z ◽

2021 ◽

Author(s):

Mohammad Mahbobi ◽

Salman Kimiagari ◽

Marriappan Vasudevan

Keyword(s):

Neural Networks ◽

Credit Risk ◽

Deep Neural Networks ◽

Predictive Accuracy ◽

Risk Classification

Download Full-text

NNV: The Neural Network Verification Tool for Deep Neural Networks and Learning-Enabled Cyber-Physical Systems

Computer Aided Verification - Lecture Notes in Computer Science ◽

10.1007/978-3-030-53288-8_1 ◽

2020 ◽

pp. 3-17 ◽

Cited By ~ 5

Author(s):

Hoang-Dung Tran ◽

Xiaodong Yang ◽

Diego Manzanas Lopez ◽

Patrick Musau ◽

Luan Viet Nguyen ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Networks ◽

Cyber Physical Systems ◽

Physical Systems ◽

Verification Tool ◽

The Neural Network ◽

Network Verification

Download Full-text

Semisupervised Learning for Seismic Monitoring Applications

Seismological Research Letters ◽

10.1785/0220200195 ◽

2020 ◽

Vol 92 (1) ◽

pp. 388-395

Author(s):

Lisa Linville ◽

Dylan Anderson ◽

Joshua Michalenko ◽

Jennifer Galasso ◽

Timothy Draelos

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Seismic Event ◽

Model Performance ◽

Seismic Monitoring ◽

Semisupervised Learning ◽

Unlabeled Data ◽

Hybrid Technique ◽

Source Type ◽

Adversarial Training

Abstract The impressive performance that deep neural networks demonstrate on a range of seismic monitoring tasks depends largely on the availability of event catalogs that have been manually curated over many years or decades. However, the quality, duration, and availability of seismic event catalogs vary significantly across the range of monitoring operations, regions, and objectives. Semisupervised learning (SSL) enables learning from both labeled and unlabeled data and provides a framework to leverage the abundance of unreviewed seismic data for training deep neural networks on a variety of target tasks. We apply two SSL algorithms (mean-teacher and virtual adversarial training) as well as a novel hybrid technique (exponential average adversarial training) to seismic event classification to examine how unlabeled data with SSL can enhance model performance. In general, we find that SSL can perform as well as supervised learning with fewer labels. We also observe in some scenarios that almost half of the benefits of SSL are the result of the meaningful regularization enforced through SSL techniques and may not be attributable to unlabeled data directly. Lastly, the benefits from unlabeled data scale with the difficulty of the predictive task when we evaluate the use of unlabeled data to characterize sources in new geographic regions. In geographic areas where supervised model performance is low, SSL significantly increases the accuracy of source-type classification using unlabeled data.

Download Full-text

Model Free Localization with Deep Neural Architectures by Means of an Underwater WSN

Sensors ◽

10.3390/s19163530 ◽

2019 ◽

Vol 19 (16) ◽

pp. 3530

Author(s):

Juan Parras ◽

Santiago Zazo ◽

Iván A. Pérez-Álvarez ◽

José Luis Sanz González

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Acoustic Signals ◽

Model Free ◽

Underwater Environment ◽

The Neural Network ◽

Anchor Nodes ◽

Localization Precision ◽

Significant Effort ◽

Localization Systems

In recent years, there has been a significant effort towards developing localization systems in the underwater medium, with current methods relying on anchor nodes, explicitly modeling the underwater channel or cooperation from the target. Lately, there has also been some work on using the approximation capabilities of Deep Neural Networks in order to address this problem. In this work, we study how the localization precision of using Deep Neural Networks is affected by the variability of the channel, the noise level at the receiver, the number of neurons of the neural network and the utilization of the power or the covariance of the received acoustic signals. Our study shows that using deep neural networks is a valid approach when the channel variability is low, which opens the door to further research in such localization methods for the underwater environment.

Download Full-text

OPTIMIZATION PROCESS ANALYSIS FOR HYPERPARAMETERS OF NEURAL NETWORK DATA PROCESSING STRUCTURES

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2020.10.pp.003-010 ◽

2020 ◽

pp. 3-10

Author(s):

V. N. Gridin ◽

I. A. Evdokimov ◽

B. R. Salem ◽

V. I. Solodovnikov

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Networks ◽

Process Analysis ◽

Validation Process ◽

The Neural Network ◽

Source Data ◽

The Neural Networks ◽

Qualitative Characteristics ◽

Setting Parameters

The analysis of key stages, implementation features and functioning principles of the neural networks, including deep neural networks, has been carried out. The problems of choosing the number of hidden elements, methods for the internal topology selection and setting parameters are considered. It is shown that in the training and validation process it is possible to control the capacity of a neural network and evaluate the qualitative characteristics of the constructed model. The issues of construction processes automation and hyperparameters optimization of the neural network structures are considered depending on the user's tasks and the available source data. A number of approaches based on the use of probabilistic programming, evolutionary algorithms, and recurrent neural networks are presented.

Download Full-text

Analysis of Explainers of Black Box Deep Neural Networks for Computer Vision: A Survey

Machine Learning and Knowledge Extraction ◽

10.3390/make3040048 ◽

2021 ◽

Vol 3 (4) ◽

pp. 966-989

Author(s):

Vanessa Buhrmester ◽

David Münch ◽

Michael Arens

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Deep Neural Networks ◽

State Of The Art ◽

Black Box ◽

Complex Data ◽

Comprehensive Overview ◽

Nonlinear Structure ◽

Black Boxes ◽

Insight Into

Deep Learning is a state-of-the-art technique to make inference on extensive or complex data. As a black box model due to their multilayer nonlinear structure, Deep Neural Networks are often criticized as being non-transparent and their predictions not traceable by humans. Furthermore, the models learn from artificially generated datasets, which often do not reflect reality. By basing decision-making algorithms on Deep Neural Networks, prejudice and unfairness may be promoted unknowingly due to a lack of transparency. Hence, several so-called explanators, or explainers, have been developed. Explainers try to give insight into the inner structure of machine learning black boxes by analyzing the connection between the input and output. In this survey, we present the mechanisms and properties of explaining systems for Deep Neural Networks for Computer Vision tasks. We give a comprehensive overview about the taxonomy of related studies and compare several survey papers that deal with explainability in general. We work out the drawbacks and gaps and summarize further research ideas.

Download Full-text

Bias in Deep Neural Networks in Land Use Characterization for International Development

Remote Sensing ◽

10.3390/rs13152908 ◽

2021 ◽

Vol 13 (15) ◽

pp. 2908

Author(s):

Do-Hyung Kim ◽

Guzmán López ◽

Diego Kiedanski ◽

Iyke Maduako ◽

Braulio Ríos ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Land Use ◽

International Development ◽

Deep Neural Networks ◽

Satellite Image ◽

Fine Tuning ◽

Training Dataset ◽

Model Accuracy ◽

The Neural Network

Understanding the biases in Deep Neural Networks (DNN) based algorithms is gaining paramount importance due to its increased applications on many real-world problems. A known problem of DNN penalizing the underrepresented population could undermine the efficacy of development projects dependent on data produced using DNN-based models. In spite of this, the problems of biases in DNN for Land Use and Land Cover Classification (LULCC) have not been a subject of many studies. In this study, we explore ways to quantify biases in DNN for land use with an example of identifying school buildings in Colombia from satellite imagery. We implement a DNN-based model by fine-tuning an existing, pre-trained model for school building identification. The model achieved overall 84% accuracy. Then, we used socioeconomic covariates to analyze possible biases in the learned representation. The retrained deep neural network was used to extract visual features (embeddings) from satellite image tiles. The embeddings were clustered into four subtypes of schools, and the accuracy of the neural network model was assessed for each cluster. The distributions of various socioeconomic covariates by clusters were analyzed to identify the links between the model accuracy and the aforementioned covariates. Our results indicate that the model accuracy is lowest (57%) where the characteristics of the landscape are predominantly related to poverty and remoteness, which confirms our original assumption on the heterogeneous performances of Artificial Intelligence (AI) algorithms and their biases. Based on our findings, we identify possible sources of bias and present suggestions on how to prepare a balanced training dataset that would result in less biased AI algorithms. The framework used in our study to better understand biases in DNN models would be useful when Machine Learning (ML) techniques are adopted in lieu of ground-based data collection for international development programs. Because such programs aim to solve issues of social inequality, MLs are only applicable when they are transparent and accountable.

Download Full-text

XCSR Learning from Compressed Data Acquired by Deep Neural Network

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2017.p0856 ◽

2017 ◽

Vol 21 (5) ◽

pp. 856-867 ◽

Cited By ~ 1

Author(s):

Kazuma Matsumoto ◽

Takato Tatsumi ◽

Hiroyuki Sato ◽

Tim Kovacs ◽

Keiki Takadama ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Learning Classifier System ◽

Rule Based ◽

Classifier System ◽

Learning Classifier ◽

The Neural Network ◽

Compressed Data

The correctness rate of classification of neural networks is improved by deep learning, which is machine learning of neural networks, and its accuracy is higher than the human brain in some fields. This paper proposes the hybrid system of the neural network and the Learning Classifier System (LCS). LCS is evolutionary rule-based machine learning using reinforcement learning. To increase the correctness rate of classification, we combine the neural network and the LCS. This paper conducted benchmark experiments to verify the proposed system. The experiment revealed that: 1) the correctness rate of classification of the proposed system is higher than the conventional LCS (XCSR) and normal neural network; and 2) the covering mechanism of XCSR raises the correctness rate of proposed system.

Download Full-text

Segmentation of Preretinal Space in Optical Coherence Tomography Images Using Deep Neural Networks

Sensors ◽

10.3390/s21227521 ◽

2021 ◽

Vol 21 (22) ◽

pp. 7521

Author(s):

Agnieszka Stankiewicz ◽

Tomasz Marciniak ◽

Adam Dabrowski ◽

Marcin Stopa ◽

Elzbieta Marciniak ◽

...

Keyword(s):

Neural Networks ◽

Optical Coherence Tomography ◽

Deep Neural Networks ◽

Three Dimensional ◽

Semantic Segmentation ◽

Optical Coherence ◽

The Neural Network ◽

Inner Limiting Membrane ◽

Segmentation Quality ◽

Oct Imaging

This paper proposes an efficient segmentation of the preretinal area between the inner limiting membrane (ILM) and posterior cortical vitreous (PCV) of the human eye in an image obtained with the use of optical coherence tomography (OCT). The research was carried out using a database of three-dimensional OCT imaging scans obtained with the Optovue RTVue XR Avanti device. Various types of neural networks (UNet, Attention UNet, ReLayNet, LFUNet) were tested for semantic segmentation, their effectiveness was assessed using the Dice coefficient and compared to the graph theory techniques. Improvement in segmentation efficiency was achieved through the use of relative distance maps. We also show that selecting a larger kernel size for convolutional layers can improve segmentation quality depending on the neural network model. In the case of PVC, we obtain the effectiveness reaching up to 96.35%. The proposed solution can be widely used to diagnose vitreomacular traction changes, which is not yet available in scientific or commercial OCT imaging solutions.

Download Full-text

Energy-efficient Amortized Inference with Cascaded Deep Classifiers

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/302 ◽

2018 ◽

Author(s):

Jiaqi Guan ◽

Yang Liu ◽

Qiang Liu ◽

Jian Peng

Keyword(s):

Neural Networks ◽

Energy Cost ◽

Deep Neural Networks ◽

Predictive Accuracy ◽

Computational Cost ◽

Mobile Sensing ◽

Test Time ◽

Trade Off ◽

Effective Cost ◽

Energy Constrained

Deep neural networks have been remarkable successful in various AI tasks but often cast high computation and energy cost for energy-constrained applications such as mobile sensing. We address this problem by proposing a novel framework that optimizes the prediction accuracy and energy cost simultaneously, thus enabling effective cost-accuracy trade-off at test time. In our framework, each data instance is pushed into a cascade of deep neural networks with increasing sizes, and a selection module is used to sequentially determine when a sufficiently accurate classifier can be used for this data instance. The cascade of neural networks and the selection module are jointly trained in an end-to-end fashion by the REINFORCE algorithm to optimize a trade-off between the computational cost and the predictive accuracy. Our method is able to simultaneously improve the accuracy and efficiency by learning to assign easy instances to fast yet sufficiently accurate classifiers to save computation and energy cost, while assigning harder instances to deeper and more powerful classifiers to ensure satisfiable accuracy. Moreover, we demonstrate our method's effectiveness with extensive experiments on CIFAR-10/100, ImageNet32x32 and original ImageNet dataset.

Download Full-text