Bayesian Adversarial Attack on Graph Neural Networks (Student Abstract)

Xiao Liu; Jing Zhao; Shiliang Sun

doi:10.1609/aaai.v34i10.7206

Bayesian Adversarial Attack on Graph Neural Networks (Student Abstract)

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i10.7206 ◽

2020 ◽

Vol 34 (10) ◽

pp. 13867-13868

Author(s):

Xiao Liu ◽

Jing Zhao ◽

Shiliang Sun

Keyword(s):

Gradient Descent ◽

Random Variable ◽

Misclassification Rate ◽

Experimental Comparison ◽

Graph Node ◽

Adversarial Examples ◽

Adversarial Attack ◽

Projected Gradient Descent ◽

Adversarial Example ◽

Graph Neural Networks

Adversarial attack on graph neural network (GNN) is distinctive as it often jointly trains the available nodes to generate a graph as an adversarial example. Existing attacking approaches usually consider the case that all the training set is available which may be impractical. In this paper, we propose a novel Bayesian adversarial attack approach based on projected gradient descent optimization, called Bayesian PGD attack, which gets more general attack examples than deterministic attack approaches. The generated adversarial examples by our approach using the same partial dataset as deterministic attack approaches would make the GNN have higher misclassification rate on graph node classification. Specifically, in our approach, the edge perturbation Z is used for generating adversarial examples, which is viewed as a random variable with scale constraint, and the optimization target of the edge perturbation is to maximize the KL divergence between its true posterior distribution p(Z|D) and its approximate variational distribution qθ(Z). We experimentally find that the attack performance will decrease with the reduction of available nodes, and the effect of attack using different nodes varies greatly especially when the number of nodes is small. Through experimental comparison with the state-of-the-art attack approaches on GNNs, our approach is demonstrated to have better and robust attack performance.

Download Full-text

Two Improved Methods of Generating Adversarial Examples against Faster R-CNNs for Tram Environment Perception Systems

Complexity ◽

10.1155/2020/6814263 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10

Author(s):

Shize Huang ◽

Xiaowen Liu ◽

Xiaolu Yang ◽

Zhaoxin Zhang ◽

Lingyu Yang

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Gradient Descent ◽

Learning Networks ◽

Adversarial Examples ◽

Environment Perception ◽

Projected Gradient Descent ◽

Adversarial Example ◽

Improved Methods ◽

Growing Neural Networks

Trams have increasingly deployed object detectors to perceive running conditions, and deep learning networks have been widely adopted by those detectors. Growing neural networks have incurred severe attacks such as adversarial example attacks, imposing threats to tram safety. Only if adversarial attacks are studied thoroughly, researchers can come up with better defence methods against them. However, most existing methods of generating adversarial examples have been devoted to classification, and none of them target tram environment perception systems. In this paper, we propose an improved projected gradient descent (PGD) algorithm and an improved Carlini and Wagner (C&W) algorithm to generate adversarial examples against Faster R-CNN object detectors. Experiments verify that both algorithms can successfully conduct nontargeted and targeted white-box digital attacks when trams are running. We also compare the performance of the two methods, including attack effects, similarity to clean images, and the generating time. The results show that both algorithms can generate adversarial examples within 220 seconds, a much shorter time, without decrease of the success rate.

Download Full-text

A Frank-Wolfe Framework for Efficient and Effective Adversarial Attacks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5753 ◽

2020 ◽

Vol 34 (04) ◽

pp. 3486-3494

Author(s):

Jinghui Chen ◽

Dongruo Zhou ◽

Jinfeng Yi ◽

Quanquan Gu

Keyword(s):

Gradient Descent ◽

State Of The Art ◽

Black Box ◽

Success Rates ◽

Practical Usefulness ◽

Efficiency And Effectiveness ◽

Large Distortion ◽

Adversarial Examples ◽

Adversarial Attack ◽

Projected Gradient Descent

Depending on how much information an adversary can access to, adversarial attacks can be classified as white-box attack and black-box attack. For white-box attack, optimization-based attack algorithms such as projected gradient descent (PGD) can achieve relatively high attack success rates within moderate iterates. However, they tend to generate adversarial examples near or upon the boundary of the perturbation set, resulting in large distortion. Furthermore, their corresponding black-box attack algorithms also suffer from high query complexities, thereby limiting their practical usefulness. In this paper, we focus on the problem of developing efficient and effective optimization-based adversarial attack algorithms. In particular, we propose a novel adversarial attack framework for both white-box and black-box settings based on a variant of Frank-Wolfe algorithm. We show in theory that the proposed attack algorithms are efficient with an O(1/√T) convergence rate. The empirical results of attacking the ImageNet and MNIST datasets also verify the efficiency and effectiveness of the proposed algorithms. More specifically, our proposed algorithms attain the best attack performances in both white-box and black-box attacks among all baselines, and are more time and query efficient than the state-of-the-art.

Download Full-text

Universal Adversarial Attack Via Enhanced Projected Gradient Descent

2020 IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip40778.2020.9191288 ◽

2020 ◽

Author(s):

Yingpeng Deng ◽

Lina J. Karam

Keyword(s):

Gradient Descent ◽

Projected Gradient ◽

Adversarial Attack ◽

Projected Gradient Descent

Download Full-text

A Non-Global Disturbance Targeted Adversarial Example Algorithm Combined with C&W and Grad-Cam

10.21203/rs.3.rs-865960/v1 ◽

2021 ◽

Author(s):

Yinghui Zhu ◽

Yuzhen Jiang

Keyword(s):

Learning Systems ◽

Fine Tuning ◽

Generation Process ◽

Original Image ◽

Signal Features ◽

Adversarial Examples ◽

Salient Regions ◽

Adversarial Attack ◽

Adversarial Example ◽

Generation Control

Abstract Adversarial examples are artificially crafted to mislead deep learning systems into making wrong decisions. In the research of attack algorithms against multi-class image classifiers, an improved strategy of applying category explanation to the generation control of targeted adversarial example is proposed to reduce the perturbation noise and improve the adversarial robustness. On the basis of C&W adversarial attack algorithm, the method uses Grad-Cam, a category visualization explanation algorithm of CNN, to dynamically obtain the salient regions according to the signal features of source and target categories during the iterative generation process. The adversarial example of non-global perturbation is finally achieved by gradually shielding the non salient regions and fine-tuning the perturbation signals. Compared with other similar algorithms under the same conditions, the method enhances the effects of the original image category signal on the perturbation position. Experimental results show that, the improved adversarial examples have higher PSNR. In addition, in a variety of different defense processing tests, the examples can keep high adversarial performance and show strong attacking robustness.

Download Full-text

Weighted-Sampling Audio Adversarial Example Attack

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5928 ◽

2020 ◽

Vol 34 (04) ◽

pp. 4908-4915 ◽

Cited By ~ 1

Author(s):

Xiaolei Liu ◽

Kun Wan ◽

Yufei Ding ◽

Xiaosong Zhang ◽

Qingxin Zhu

Keyword(s):

Automatic Speech Recognition ◽

Loss Function ◽

State Of The Art ◽

Low Noise ◽

Denoising Method ◽

Adversarial Examples ◽

Adversarial Attack ◽

Recognition Systems ◽

Level 1 ◽

Adversarial Example

Recent studies have highlighted audio adversarial examples as a ubiquitous threat to state-of-the-art automatic speech recognition systems. Thorough studies on how to effectively generate adversarial examples are essential to prevent potential attacks. Despite many research on this, the efficiency and the robustness of existing works are not yet satisfactory. In this paper, we propose weighted-sampling audio adversarial examples, focusing on the numbers and the weights of distortion to reinforce the attack. Further, we apply a denoising method in the loss function to make the adversarial attack more imperceptible. Experiments show that our method is the first in the field to generate audio adversarial examples with low noise and high audio robustness at the minute time-consuming level 1.

Download Full-text

Heuristic Black-Box Adversarial Attacks on Video Recognition Models

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6918 ◽

2020 ◽

Vol 34 (07) ◽

pp. 12338-12345 ◽

Cited By ~ 1

Author(s):

Zhipeng Wei ◽

Jingjing Chen ◽

Xingxing Wei ◽

Linxi Jiang ◽

Tat-Seng Chua ◽

...

Keyword(s):

Black Box ◽

Computation Cost ◽

Attack Model ◽

Video Recognition ◽

Spatial Domains ◽

Adversarial Examples ◽

Salient Regions ◽

Adversarial Attack ◽

Adversarial Example ◽

The Given

We study the problem of attacking video recognition models in the black-box setting, where the model information is unknown and the adversary can only make queries to detect the predicted top-1 class and its probability. Compared with the black-box attack on images, attacking videos is more challenging as the computation cost for searching the adversarial perturbations on a video is much higher due to its high dimensionality. To overcome this challenge, we propose a heuristic black-box attack model that generates adversarial perturbations only on the selected frames and regions. More specifically, a heuristic-based algorithm is proposed to measure the importance of each frame in the video towards generating the adversarial examples. Based on the frames' importance, the proposed algorithm heuristically searches a subset of frames where the generated adversarial example has strong adversarial attack ability while keeps the perturbations lower than the given bound. Besides, to further boost the attack efficiency, we propose to generate the perturbations only on the salient regions of the selected frames. In this way, the generated perturbations are sparse in both temporal and spatial domains. Experimental results of attacking two mainstream video recognition methods on the UCF-101 dataset and the HMDB-51 dataset demonstrate that the proposed heuristic black-box adversarial attack method can significantly reduce the computation cost and lead to more than 28% reduction in query numbers for the untargeted attack on both datasets.

Download Full-text

Optimization of Graph Neural Networks with Natural Gradient Descent

2020 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata50022.2020.9378063 ◽

2020 ◽

Author(s):

Mohammad Rasool Izadi ◽

Yihao Fang ◽

Robert Stevenson ◽

Lizhen Lin

Keyword(s):

Neural Networks ◽

Gradient Descent ◽

Natural Gradient ◽

Graph Neural Networks

Download Full-text

Adversarial Attack and Defence through Adversarial Training and Feature Fusion for Diabetic Retinopathy Recognition

Sensors ◽

10.3390/s21113922 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3922

Author(s):

Sheeba Lal ◽

Saeed Ur Rehman ◽

Jamal Hussain Shah ◽

Talha Meraj ◽

Hafiz Tayyab Rauf ◽

...

Keyword(s):

Diabetic Retinopathy ◽

Feature Fusion ◽

Speckle Noise ◽

Fundus Images ◽

Adversarial Examples ◽

Adversarial Training ◽

Adversarial Attack ◽

Retinal Fundus Images ◽

Attacks And Defenses ◽

Retinal Fundus

Due to the rapid growth in artificial intelligence (AI) and deep learning (DL) approaches, the security and robustness of the deployed algorithms need to be guaranteed. The security susceptibility of the DL algorithms to adversarial examples has been widely acknowledged. The artificially created examples will lead to different instances negatively identified by the DL models that are humanly considered benign. Practical application in actual physical scenarios with adversarial threats shows their features. Thus, adversarial attacks and defense, including machine learning and its reliability, have drawn growing interest and, in recent years, has been a hot topic of research. We introduce a framework that provides a defensive model against the adversarial speckle-noise attack, the adversarial training, and a feature fusion strategy, which preserves the classification with correct labelling. We evaluate and analyze the adversarial attacks and defenses on the retinal fundus images for the Diabetic Retinopathy recognition problem, which is considered a state-of-the-art endeavor. Results obtained on the retinal fundus images, which are prone to adversarial attacks, are 99% accurate and prove that the proposed defensive model is robust.

Download Full-text

Real-Time Adversarial Attack Detection with Deep Image Prior Initialized as a High-Level Representation Based Blurring Network

Electronics ◽

10.3390/electronics10010052 ◽

2020 ◽

Vol 10 (1) ◽

pp. 52

Author(s):

Richard Evan Sutanto ◽

Sukho Lee

Keyword(s):

Neural Network ◽

Attack Detection ◽

Detection Methods ◽

Defense System ◽

Image Prior ◽

The Neural Network ◽

Adversarial Examples ◽

Deep Image ◽

Adversarial Attack ◽

High Level

Several recent studies have shown that artificial intelligence (AI) systems can malfunction due to intentionally manipulated data coming through normal channels. Such kinds of manipulated data are called adversarial examples. Adversarial examples can pose a major threat to an AI-led society when an attacker uses them as means to attack an AI system, which is called an adversarial attack. Therefore, major IT companies such as Google are now studying ways to build AI systems which are robust against adversarial attacks by developing effective defense methods. However, one of the reasons why it is difficult to establish an effective defense system is due to the fact that it is difficult to know in advance what kind of adversarial attack method the opponent is using. Therefore, in this paper, we propose a method to detect the adversarial noise without knowledge of the kind of adversarial noise used by the attacker. For this end, we propose a blurring network that is trained only with normal images and also use it as an initial condition of the Deep Image Prior (DIP) network. This is in contrast to other neural network based detection methods, which require the use of many adversarial noisy images for the training of the neural network. Experimental results indicate the validity of the proposed method.

Download Full-text

Diversity Adversarial Training against Adversarial Attack on Deep Neural Networks

Symmetry ◽

10.3390/sym13030428 ◽

2021 ◽

Vol 13 (3) ◽

pp. 428

Author(s):

Hyun Kwon ◽

Jun Lee

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Diversity Training ◽

Original Data ◽

Training Method ◽

Learning Framework ◽

Adversarial Examples ◽

Adversarial Training ◽

Adversarial Attack ◽

Accuracy Rates

This paper presents research focusing on visualization and pattern recognition based on computer science. Although deep neural networks demonstrate satisfactory performance regarding image and voice recognition, as well as pattern analysis and intrusion detection, they exhibit inferior performance towards adversarial examples. Noise introduction, to some degree, to the original data could lead adversarial examples to be misclassified by deep neural networks, even though they can still be deemed as normal by humans. In this paper, a robust diversity adversarial training method against adversarial attacks was demonstrated. In this approach, the target model is more robust to unknown adversarial examples, as it trains various adversarial samples. During the experiment, Tensorflow was employed as our deep learning framework, while MNIST and Fashion-MNIST were used as experimental datasets. Results revealed that the diversity training method has lowered the attack success rate by an average of 27.2 and 24.3% for various adversarial examples, while maintaining the 98.7 and 91.5% accuracy rates regarding the original data of MNIST and Fashion-MNIST.

Download Full-text