A Black-Box Approach to Generate Adversarial Examples Against Deep Neural Networks for High Dimensional Input

Deep models have advanced prediction in many domains, but their lack of interpretability remains a key barrier to the adoption in many real world applications. There exists a large body of work aiming to help humans understand these black box functions to varying levels of granularity – for example, through distillation, gradients, or adversarial examples. These methods however, all tackle interpretability as a separate process after training. In this work, we take a different approach and explicitly regularize deep models so that they are well-approximated by processes that humans can step through in little time. Specifically, we train several families of deep neural networks to resemble compact, axis-aligned decision trees without significant compromises in accuracy. The resulting axis-aligned decision functions uniquely make tree regularized models easy for humans to interpret. Moreover, for situations in which a single, global tree is a poor estimator, we introduce a regional tree regularizer that encourages the deep model to resemble a compact, axis-aligned decision tree in predefined, human-interpretable contexts. Using intuitive toy examples, benchmark image datasets, and medical tasks for patients in critical care and with HIV, we demonstrate that this new family of tree regularizers yield models that are easier for humans to simulate than L1 or L2 penalties without sacrificing predictive power.

Download Full-text

AdverseGen: A Practical Tool for Generating Adversarial Examples to Deep Neural Networks Using Black-Box Approaches

Lecture Notes in Computer Science - Artificial Intelligence XXXVIII ◽

10.1007/978-3-030-91100-3_25 ◽

2021 ◽

pp. 313-326

Author(s):

Keyuan Zhang ◽

Kaiyue Wu ◽

Siyu Chen ◽

Yunce Zhao ◽

Xin Yao

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Black Box ◽

Adversarial Examples ◽

Practical Tool

Download Full-text

Representation of white- and black-box adversarial examples in deep neural networks and humans: A functional magnetic resonance imaging study

2019 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2019.8851763 ◽

2019 ◽

Author(s):

Chihye Han ◽

Wonjun Yoon ◽

Gihyun Kwon ◽

Daeshik Kim ◽

Seungkyu Nam

Keyword(s):

Magnetic Resonance Imaging ◽

Neural Networks ◽

Magnetic Resonance ◽

Deep Neural Networks ◽

Imaging Study ◽

Black Box ◽

Magnetic Resonance Imaging Study ◽

Functional Magnetic Resonance ◽

Resonance Imaging ◽

Adversarial Examples

Download Full-text

Boosting Targeted Black-Box Attacks via Ensemble Substitute Training and Linear Augmentation

Applied Sciences ◽

10.3390/app9112286 ◽

2019 ◽

Vol 9 (11) ◽

pp. 2286 ◽

Cited By ~ 6

Author(s):

Xianfeng Gao ◽

Yu-an Tan ◽

Hongwei Jiang ◽

Quanxin Zhang ◽

Xiaohui Kuang

Keyword(s):

Neural Networks ◽

Image Classification ◽

Deep Neural Networks ◽

Black Box ◽

Decision Boundary ◽

Success Rates ◽

Small Perturbations ◽

Targeted Attacks ◽

Adversarial Examples ◽

Effectiveness And Efficiency

These years, Deep Neural Networks (DNNs) have shown unprecedented performance in many areas. However, some recent studies revealed their vulnerability to small perturbations added on source inputs. Furthermore, we call the ways to generate these perturbations’ adversarial attacks, which contain two types, black-box and white-box attacks, according to the adversaries’ access to target models. In order to overcome the problem of black-box attackers’ unreachabilities to the internals of target DNN, many researchers put forward a series of strategies. Previous works include a method of training a local substitute model for the target black-box model via Jacobian-based augmentation and then use the substitute model to craft adversarial examples using white-box methods. In this work, we improve the dataset augmentation to make the substitute models better fit the decision boundary of the target model. Unlike the previous work that just performed the non-targeted attack, we make it first to generate targeted adversarial examples via training substitute models. Moreover, to boost the targeted attacks, we apply the idea of ensemble attacks to the substitute training. Experiments on MNIST and GTSRB, two common datasets for image classification, demonstrate our effectiveness and efficiency of boosting a targeted black-box attack, and we finally attack the MNIST and GTSRB classifiers with the success rates of 97.7% and 92.8%.

Download Full-text

Natural Scene Statistics for Detecting Adversarial Examples in Deep Neural Networks

2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP) ◽

10.1109/mmsp48831.2020.9287056 ◽

2020 ◽

Author(s):

Anouar Kherchouche ◽

Sid Ahmed Fezza ◽

Wassim Hamidouche ◽

Olivier Deforges

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Natural Scene ◽

Natural Scene Statistics ◽

Adversarial Examples

Download Full-text

Efficient approximation of solutions of parametric linear transport equations by ReLU DNNs

Advances in Computational Mathematics ◽

10.1007/s10444-020-09834-7 ◽

2021 ◽

Vol 47 (1) ◽

Author(s):

Fabian Laakmann ◽

Philipp Petersen

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Initial Conditions ◽

Activation Function ◽

Transport Equations ◽

High Dimensional ◽

Linear Transport ◽

Approximation Rates ◽

Curse Of Dimension ◽

Efficient Approximation

AbstractWe demonstrate that deep neural networks with the ReLU activation function can efficiently approximate the solutions of various types of parametric linear transport equations. For non-smooth initial conditions, the solutions of these PDEs are high-dimensional and non-smooth. Therefore, approximation of these functions suffers from a curse of dimension. We demonstrate that through their inherent compositionality deep neural networks can resolve the characteristic flow underlying the transport equations and thereby allow approximation rates independent of the parameter dimension.

Download Full-text

Hybrid deep neural networks to infer state models of black-box systems

Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering ◽

10.1145/3324884.3416559 ◽

2020 ◽

Author(s):

Mohammad Jafar Mashhadi ◽

Hadi Hemmati

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Black Box ◽

State Models

Download Full-text

Diversity Adversarial Training against Adversarial Attack on Deep Neural Networks

Symmetry ◽

10.3390/sym13030428 ◽

2021 ◽

Vol 13 (3) ◽

pp. 428

Author(s):

Hyun Kwon ◽

Jun Lee

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Diversity Training ◽

Original Data ◽

Training Method ◽

Learning Framework ◽

Adversarial Examples ◽

Adversarial Training ◽

Adversarial Attack ◽

Accuracy Rates

This paper presents research focusing on visualization and pattern recognition based on computer science. Although deep neural networks demonstrate satisfactory performance regarding image and voice recognition, as well as pattern analysis and intrusion detection, they exhibit inferior performance towards adversarial examples. Noise introduction, to some degree, to the original data could lead adversarial examples to be misclassified by deep neural networks, even though they can still be deemed as normal by humans. In this paper, a robust diversity adversarial training method against adversarial attacks was demonstrated. In this approach, the target model is more robust to unknown adversarial examples, as it trains various adversarial samples. During the experiment, Tensorflow was employed as our deep learning framework, while MNIST and Fashion-MNIST were used as experimental datasets. Results revealed that the diversity training method has lowered the attack success rate by an average of 27.2 and 24.3% for various adversarial examples, while maintaining the 98.7 and 91.5% accuracy rates regarding the original data of MNIST and Fashion-MNIST.

Download Full-text

Better Approximations of High Dimensional Smooth Functions by Deep Neural Networks with Rectified Power Units

Communications in Computational Physics ◽

10.4208/cicp.oa-2019-0168 ◽

2020 ◽

Vol 27 (2) ◽

pp. 379-411 ◽

Cited By ~ 1

Author(s):

Bo Li

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

High Dimensional ◽

Smooth Functions

Download Full-text

SADA: Semantic Adversarial Diagnostic Attacks for Autonomous Applications

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6722 ◽

2020 ◽

Vol 34 (07) ◽

pp. 10901-10908 ◽

Cited By ~ 2

Author(s):

Abdullah Hamdi ◽

Matthias Mueller ◽

Bernard Ghanem

Keyword(s):

Neural Networks ◽

Recent Work ◽

Autonomous Navigation ◽

General Framework ◽

Deep Neural Networks ◽

Autonomous Driving ◽

Black Box ◽

Semantic Meaning ◽

Safety Critical ◽

Adversarial Attack

One major factor impeding more widespread adoption of deep neural networks (DNNs) is their lack of robustness, which is essential for safety-critical applications such as autonomous driving. This has motivated much recent work on adversarial attacks for DNNs, which mostly focus on pixel-level perturbations void of semantic meaning. In contrast, we present a general framework for adversarial attacks on trained agents, which covers semantic perturbations to the environment of the agent performing the task as well as pixel-level attacks. To do this, we re-frame the adversarial attack problem as learning a distribution of parameters that always fools the agent. In the semantic case, our proposed adversary (denoted as BBGAN) is trained to sample parameters that describe the environment with which the black-box agent interacts, such that the agent performs its dedicated task poorly in this environment. We apply BBGAN on three different tasks, primarily targeting aspects of autonomous navigation: object detection, self-driving, and autonomous UAV racing. On these tasks, BBGAN can generate failure cases that consistently fool a trained agent.

Download Full-text