A New Ensemble Adversarial Attack Powered by Long-Term Gradient Memories

Zhaohui Che; Ali Borji; Guangtao Zhai; Suiyi Ling; Jing Li; Patrick Le Callet

doi:10.1609/aaai.v34i04.5743

A New Ensemble Adversarial Attack Powered by Long-Term Gradient Memories

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5743 ◽

2020 ◽

Vol 34 (04) ◽

pp. 3405-3413

Author(s):

Zhaohui Che ◽

Ali Borji ◽

Guangtao Zhai ◽

Suiyi Ling ◽

Jing Li ◽

...

Keyword(s):

Broad Class ◽

Black Box ◽

Security Threat ◽

Source Models ◽

Adversarial Examples ◽

Adversarial Attack ◽

Prediction Systems ◽

Attack And Defense ◽

Decision Boundaries

Deep neural networks are vulnerable to adversarial attacks. More importantly, some adversarial examples crafted against an ensemble of pre-trained source models can transfer to other new target models, thus pose a security threat to black-box applications (when the attackers have no access to the target models). Despite adopting diverse architectures and parameters, source and target models often share similar decision boundaries. Therefore, if an adversary is capable of fooling several source models concurrently, it can potentially capture intrinsic transferable adversarial information that may allow it to fool a broad class of other black-box target models. Current ensemble attacks, however, only consider a limited number of source models to craft an adversary, and obtain poor transferability. In this paper, we propose a novel black-box attack, dubbed Serial-Mini-Batch-Ensemble-Attack (SMBEA). SMBEA divides a large number of pre-trained source models into several mini-batches. For each single batch, we design 3 new ensemble strategies to improve the intra-batch transferability. Besides, we propose a new algorithm that recursively accumulates the “long-term” gradient memories of the previous batch to the following batch. This way, the learned adversarial information can be preserved and the inter-batch transferability can be improved. Experiments indicate that our method outperforms state-of-the-art ensemble attacks over multiple pixel-to-pixel vision tasks including image translation and salient region prediction. Our method successfully fools two online black-box saliency prediction systems including DeepGaze-II (Kummerer 2017) and SALICON (Huang et al. 2017). Finally, we also contribute a new repository to promote the research on adversarial attack and defense over pixel-to-pixel tasks: https://github.com/CZHQuality/AAA-Pix2pix.

Download Full-text

Cycle-Consistent Adversarial GAN: The Integration of Adversarial Attack and Defense

Security and Communication Networks ◽

10.1155/2020/3608173 ◽

2020 ◽

Vol 2020 ◽

pp. 1-9 ◽

Cited By ~ 1

Author(s):

Lingyun Jiang ◽

Kai Qiao ◽

Ruoxi Qin ◽

Linyuan Wang ◽

Wanting Yu ◽

...

Keyword(s):

Deep Learning ◽

Deep Neural Networks ◽

State Of The Art ◽

Small Magnitude ◽

Defense Strategies ◽

Adversarial Examples ◽

Adversarial Attack ◽

Public Datasets ◽

Attack And Defense

In image classification of deep learning, adversarial examples where input is intended to add small magnitude perturbations may mislead deep neural networks (DNNs) to incorrect results, which means DNNs are vulnerable to them. Different attack and defense strategies have been proposed to better research the mechanism of deep learning. However, those researches in these networks are only for one aspect, either an attack or a defense. There is in the improvement of offensive and defensive performance, and it is difficult to promote each other in the same framework. In this paper, we propose Cycle-Consistent Adversarial GAN (CycleAdvGAN) to generate adversarial examples, which can learn and approximate the distribution of the original instances and adversarial examples, especially promoting attackers and defenders to confront each other and improve their ability. For CycleAdvGAN, once the GeneratorA and D are trained, GA can generate adversarial perturbations efﬁciently for any instance, improving the performance of the existing attack methods, and GD can generate recovery adversarial examples to clean instances, defending against existing attack methods. We apply CycleAdvGAN under semiwhite-box and black-box settings on two public datasets MNIST and CIFAR10. Using the extensive experiments, we show that our method has achieved the state-of-the-art adversarial attack method and also has efficiently improved the defense ability, which made the integration of adversarial attack and defense come true. In addition, it has improved the attack effect only trained on the adversarial dataset generated by any kind of adversarial attack.

Download Full-text

Adversarial Attack and Defense on Deep Neural Network-Based Voice Processing Systems: An Overview

Applied Sciences ◽

10.3390/app11188450 ◽

2021 ◽

Vol 11 (18) ◽

pp. 8450

Author(s):

Xiaojiao Chen ◽

Sheng Li ◽

Hao Huang

Keyword(s):

Deep Neural Networks ◽

Daily Lives ◽

Systematic Classification ◽

Online Purchases ◽

Adversarial Examples ◽

Adversarial Attack ◽

Attack And Defense ◽

Voice Processing ◽

Significant Attention

Voice Processing Systems (VPSes), now widely deployed, have become deeply involved in people’s daily lives, helping drive the car, unlock the smartphone, make online purchases, etc. Unfortunately, recent research has shown that those systems based on deep neural networks are vulnerable to adversarial examples, which attract significant attention to VPS security. This review presents a detailed introduction to the background knowledge of adversarial attacks, including the generation of adversarial examples, psychoacoustic models, and evaluation indicators. Then we provide a concise introduction to defense methods against adversarial attacks. Finally, we propose a systematic classification of adversarial attacks and defense methods, with which we hope to provide a better understanding of the classification and structure for beginners in this field.

Download Full-text

A Survey on Universal Adversarial Attack

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/635 ◽

2021 ◽

Author(s):

Chaoning Zhang ◽

Philipp Benz ◽

Chenguo Lin ◽

Adil Karjauv ◽

Jing Wu ◽

...

Keyword(s):

Machine Learning ◽

Recent Progress ◽

New Findings ◽

Wide Range ◽

Adversarial Examples ◽

Adversarial Attack ◽

Attack And Defense ◽

Audio Video ◽

Significant Attention

The intriguing phenomenon of adversarial examples has attracted significant attention in machine learning and what might be more surprising to the community is the existence of universal adversarial perturbations (UAPs), i.e. a single perturbation to fool the target DNN for most images. With the focus on UAP against deep classifiers, this survey summarizes the recent progress on universal adversarial attacks, discussing the challenges from both the attack and defense sides, as well as the reason for the existence of UAP. We aim to extend this work as a dynamic survey that will regularly update its content to follow new works regarding UAP or universal attack in a wide range of domains, such as image, audio, video, text, etc. Relevant updates will be discussed at: https://bit.ly/2SbQlLG. We welcome authors of future works in this field to contact us for including your new findings.

Download Full-text

Adv-Makeup: A New Imperceptible and Transferable Attack on Face Recognition

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/173 ◽

2021 ◽

Author(s):

Bangjie Yin ◽

Wenxuan Wang ◽

Taiping Yao ◽

Junfeng Guo ◽

Zelun Kong ◽

...

Keyword(s):

Face Recognition ◽

Black Box ◽

Fine Grained ◽

Box Models ◽

Meta Learning ◽

Adversarial Examples ◽

Adversarial Attack ◽

Face Generation ◽

Orbital Region ◽

Black Box Models

Deep neural networks, particularly face recognition models, have been shown to be vulnerable to both digital and physical adversarial examples. However, existing adversarial examples against face recognition systems either lack transferability to black-box models, or fail to be implemented in practice. In this paper, we propose a unified adversarial face generation method - Adv-Makeup, which can realize imperceptible and transferable attack under the black-box setting. Adv-Makeup develops a task-driven makeup generation method with the blending module to synthesize imperceptible eye shadow over the orbital region on faces. And to achieve transferability, Adv-Makeup implements a fine-grained meta-learning based adversarial attack strategy to learn more vulnerable or sensitive features from various models. Compared to existing techniques, sufficient visualization results demonstrate that Adv-Makeup is capable to generate much more imperceptible attacks under both digital and physical scenarios. Meanwhile, extensive quantitative experiments show that Adv-Makeup can significantly improve the attack success rate under black-box setting, even attacking commercial systems.

Download Full-text

Generating adversarial examples without specifying a target model

PeerJ Computer Science ◽

10.7717/peerj-cs.702 ◽

2021 ◽

Vol 7 ◽

pp. e702

Author(s):

Gaoming Yang ◽

Mingwei Li ◽

Xianjing Fang ◽

Ji Zhang ◽

Xingzhu Liang

Keyword(s):

Deep Learning ◽

Success Rate ◽

Black Box ◽

Time Cost ◽

Learning Models ◽

Security Threat ◽

Practical Situation ◽

Data Set ◽

Target Model ◽

Adversarial Examples

Adversarial examples are regarded as a security threat to deep learning models, and there are many ways to generate them. However, most existing methods require the query authority of the target during their work. In a more practical situation, the attacker will be easily detected because of too many queries, and this problem is especially obvious under the black-box setting. To solve the problem, we propose the Attack Without a Target Model (AWTM). Our algorithm does not specify any target model in generating adversarial examples, so it does not need to query the target. Experimental results show that it achieved a maximum attack success rate of 81.78% in the MNIST data set and 87.99% in the CIFAR-10 data set. In addition, it has a low time cost because it is a GAN-based method.

Download Full-text

A Hybrid Adversarial Attack for Different Application Scenarios

Applied Sciences ◽

10.3390/app10103559 ◽

2020 ◽

Vol 10 (10) ◽

pp. 3559 ◽

Cited By ~ 1

Author(s):

Xiaohu Du ◽

Jie Yu ◽

Zibo Yi ◽

Shasha Li ◽

Jun Ma ◽

...

Keyword(s):

Deep Learning ◽

Success Rate ◽

Black Box ◽

De Algorithm ◽

Word Level ◽

Text Readability ◽

Gradient Based ◽

Adversarial Examples ◽

Adversarial Attack ◽

Cosine Distance

Adversarial attack against natural language has been a hot topic in the field of artificial intelligence security in recent years. It is mainly to study the methods and implementation of generating adversarial examples. The purpose is to better deal with the vulnerability and security of deep learning systems. According to whether the attacker understands the deep learning model structure, the adversarial attack is divided into black-box attack and white-box attack. In this paper, we propose a hybrid adversarial attack for different application scenarios. Firstly, we propose a novel black-box attack method of generating adversarial examples to trick the word-level sentiment classifier, which is based on differential evolution (DE) algorithm to generate semantically and syntactically similar adversarial examples. Compared with existing genetic algorithm based adversarial attacks, our algorithm can achieve a higher attack success rate while maintaining a lower word replacement rate. At the 10% word substitution threshold, we have increased the attack success rate from 58.5% to 63%. Secondly, when we understand the model architecture and parameters, etc., we propose a white-box attack with gradient-based perturbation against the same sentiment classifier. In this attack, we use a Euclidean distance and cosine distance combined metric to find the most semantically and syntactically similar substitution, and we introduce the coefficient of variation (CV) factor to control the dispersion of the modified words in the adversarial examples. More dispersed modifications can increase human imperceptibility and text readability. Compared with the existing global attack, our attack can increase the attack success rate and make modification positions in generated examples more dispersed. We’ve increased the global search success rate from 75.8% to 85.8%. Finally, we can deal with different application scenarios by using these two attack methods, that is, whether we understand the internal structure and parameters of the model, we can all generate good adversarial examples.

Download Full-text

A Frank-Wolfe Framework for Efficient and Effective Adversarial Attacks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5753 ◽

2020 ◽

Vol 34 (04) ◽

pp. 3486-3494

Author(s):

Jinghui Chen ◽

Dongruo Zhou ◽

Jinfeng Yi ◽

Quanquan Gu

Keyword(s):

Gradient Descent ◽

State Of The Art ◽

Black Box ◽

Success Rates ◽

Practical Usefulness ◽

Efficiency And Effectiveness ◽

Large Distortion ◽

Adversarial Examples ◽

Adversarial Attack ◽

Projected Gradient Descent

Depending on how much information an adversary can access to, adversarial attacks can be classified as white-box attack and black-box attack. For white-box attack, optimization-based attack algorithms such as projected gradient descent (PGD) can achieve relatively high attack success rates within moderate iterates. However, they tend to generate adversarial examples near or upon the boundary of the perturbation set, resulting in large distortion. Furthermore, their corresponding black-box attack algorithms also suffer from high query complexities, thereby limiting their practical usefulness. In this paper, we focus on the problem of developing efficient and effective optimization-based adversarial attack algorithms. In particular, we propose a novel adversarial attack framework for both white-box and black-box settings based on a variant of Frank-Wolfe algorithm. We show in theory that the proposed attack algorithms are efficient with an O(1/√T) convergence rate. The empirical results of attacking the ImageNet and MNIST datasets also verify the efficiency and effectiveness of the proposed algorithms. More specifically, our proposed algorithms attain the best attack performances in both white-box and black-box attacks among all baselines, and are more time and query efficient than the state-of-the-art.

Download Full-text

Heuristic Black-Box Adversarial Attacks on Video Recognition Models

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6918 ◽

2020 ◽

Vol 34 (07) ◽

pp. 12338-12345 ◽

Cited By ~ 1

Author(s):

Zhipeng Wei ◽

Jingjing Chen ◽

Xingxing Wei ◽

Linxi Jiang ◽

Tat-Seng Chua ◽

...

Keyword(s):

Black Box ◽

Computation Cost ◽

Attack Model ◽

Video Recognition ◽

Spatial Domains ◽

Adversarial Examples ◽

Salient Regions ◽

Adversarial Attack ◽

Adversarial Example ◽

The Given

We study the problem of attacking video recognition models in the black-box setting, where the model information is unknown and the adversary can only make queries to detect the predicted top-1 class and its probability. Compared with the black-box attack on images, attacking videos is more challenging as the computation cost for searching the adversarial perturbations on a video is much higher due to its high dimensionality. To overcome this challenge, we propose a heuristic black-box attack model that generates adversarial perturbations only on the selected frames and regions. More specifically, a heuristic-based algorithm is proposed to measure the importance of each frame in the video towards generating the adversarial examples. Based on the frames' importance, the proposed algorithm heuristically searches a subset of frames where the generated adversarial example has strong adversarial attack ability while keeps the perturbations lower than the given bound. Besides, to further boost the attack efficiency, we propose to generate the perturbations only on the salient regions of the selected frames. In this way, the generated perturbations are sparse in both temporal and spatial domains. Experimental results of attacking two mainstream video recognition methods on the UCF-101 dataset and the HMDB-51 dataset demonstrate that the proposed heuristic black-box adversarial attack method can significantly reduce the computation cost and lead to more than 28% reduction in query numbers for the untargeted attack on both datasets.

Download Full-text

Unfolding the case of returnees: How the European Union and its member States are addressing the return of foreign fighters and their families

International Review of the Red Cross ◽

10.1017/s1816383121000217 ◽

2021 ◽

pp. 1-23

Author(s):

Carlota Rigotti ◽

Júlia Zomignani Barboza

Keyword(s):

European Union ◽

Perceived Threat ◽

The European Union ◽

Security Threat ◽

Short Term ◽

Member States ◽

Foreign Fighters ◽

Term Response

Abstract The return of foreign fighters and their families to the European Union has mostly been considered a security threat by member States, which consequently adopt repressive measures aimed at providing an immediate, short-term response to this perceived threat. In addition to this strong-arm approach, reintegration strategies have also been used to prevent returnees from falling back into terrorism and to break down barriers of hostility between citizens in the long term. Amidst these different strategies, this paper seeks to identify which methods are most desirable for handling returnees.

Download Full-text

A Black-Box Adversarial Attack via Deep Reinforcement Learning on the Feature Space

2021 IEEE Conference on Dependable and Secure Computing (DSC) ◽

10.1109/dsc49826.2021.9346264 ◽

2021 ◽

Author(s):

Lyue Li ◽

Amir Rezapour ◽

Wen-Guey Tzeng

Keyword(s):

Reinforcement Learning ◽

Feature Space ◽

Black Box ◽

Adversarial Attack

Download Full-text