Digital Watermark Perturbation for Adversarial Examples to Fool Deep Neural Networks

This paper presents research focusing on visualization and pattern recognition based on computer science. Although deep neural networks demonstrate satisfactory performance regarding image and voice recognition, as well as pattern analysis and intrusion detection, they exhibit inferior performance towards adversarial examples. Noise introduction, to some degree, to the original data could lead adversarial examples to be misclassified by deep neural networks, even though they can still be deemed as normal by humans. In this paper, a robust diversity adversarial training method against adversarial attacks was demonstrated. In this approach, the target model is more robust to unknown adversarial examples, as it trains various adversarial samples. During the experiment, Tensorflow was employed as our deep learning framework, while MNIST and Fashion-MNIST were used as experimental datasets. Results revealed that the diversity training method has lowered the attack success rate by an average of 27.2 and 24.3% for various adversarial examples, while maintaining the 98.7 and 91.5% accuracy rates regarding the original data of MNIST and Fashion-MNIST.

Download Full-text

A Black-Box Approach to Generate Adversarial Examples Against Deep Neural Networks for High Dimensional Input

2019 IEEE Fourth International Conference on Data Science in Cyberspace (DSC) ◽

10.1109/dsc.2019.00078 ◽

2019 ◽

Author(s):

Chengru Song ◽

Changqiao Xu ◽

Shujie Yang ◽

Zan Zhou ◽

Changhui Gong

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Black Box ◽

High Dimensional ◽

Adversarial Examples

Download Full-text

Detecting Adversarial Examples for Deep Neural Networks via Layer Directed Discriminative Noise Injection

2019 Asian Hardware Oriented Security and Trust Symposium (AsianHOST) ◽

10.1109/asianhost47458.2019.9006702 ◽

2019 ◽

Author(s):

Si Wang ◽

Wenye Liu ◽

Chip-Hong Chang

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Noise Injection ◽

Adversarial Examples

Download Full-text

Assessing Threat of Adversarial Examples on Deep Neural Networks

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) ◽

10.1109/icmla.2016.0020 ◽

2016 ◽

Cited By ~ 9

Author(s):

Abigail Graese ◽

Andras Rozsa ◽

Terrance E. Boult

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Adversarial Examples

Download Full-text

Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks

Proceedings 2018 Network and Distributed System Security Symposium ◽

10.14722/ndss.2018.23198 ◽

2018 ◽

Cited By ~ 102

Author(s):

Weilin Xu ◽

David Evans ◽

Yanjun Qi

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Adversarial Examples

Download Full-text

Global Robustness Evaluation of Deep Neural Networks with Provable Guarantees for the Hamming Distance

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/824 ◽

2019 ◽

Cited By ~ 7

Author(s):

Wenjie Ruan ◽

Min Wu ◽

Youcheng Sun ◽

Xiaowei Huang ◽

Daniel Kroening ◽

...

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Hamming Distance ◽

Critical Systems ◽

Lower And Upper Bounds ◽

Test Dataset ◽

Global Robustness ◽

Adversarial Examples ◽

Optimal Values ◽

Good Proxy

Deployment of deep neural networks (DNNs) in safety-critical systems requires provable guarantees for their correct behaviours. We compute the maximal radius of a safe norm ball around a given input, within which there are no adversarial examples for a trained DNN. We define global robustness as an expectation of the maximal safe radius over a test dataset, and develop an algorithm to approximate the global robustness measure by iteratively computing its lower and upper bounds. Our algorithm is the first efficient method for the Hamming (L0) distance, and we hypothesise that this norm is a good proxy for a certain class of physical attacks. The algorithm is anytime, i.e., it returns intermediate bounds and robustness estimates that are gradually, but strictly, improved as the computation proceeds; tensor-based, i.e., the computation is conducted over a set of inputs simultaneously to enable efficient GPU computation; and has provable guarantees, i.e., both the bounds and the robustness estimates can converge to their optimal values. Finally, we demonstrate the utility of our approach by applying the algorithm to a set of challenging problems.

Download Full-text

Group-Wise Dynamic Dropout Based on Latent Semantic Variations

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6782 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11229-11236

Author(s):

Zhiwei Ke ◽

Zhiwei Wen ◽

Weicheng Xie ◽

Yi Wang ◽

Linlin Shen

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Semantic Information ◽

State Of The Art ◽

Classification Performance ◽

Network Robustness ◽

Feature Detectors ◽

Data Points ◽

Adversarial Examples ◽

Public Datasets

Dropout regularization has been widely used in various deep neural networks to combat overfitting. It works by training a network to be more robust on information-degraded data points for better generalization. Conventional dropout and variants are often applied to individual hidden units in a layer to break up co-adaptations of feature detectors. In this paper, we propose an adaptive dropout to reduce the co-adaptations in a group-wise manner by coarse semantic information to improve feature discriminability. In particular, we showed that adjusting the dropout probability based on local feature densities can not only improve the classification performance significantly but also enhance the network robustness against adversarial examples in some cases. The proposed approach was evaluated in comparison with the baseline and several state-of-the-art adaptive dropouts over four public datasets of Fashion-MNIST, CIFAR-10, CIFAR-100 and SVHN.

Download Full-text

Spoofing Speaker Verification System by Adversarial Examples Leveraging the Generalized Speaker Difference

Security and Communication Networks ◽

10.1155/2021/6664578 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Hongwei Luo ◽

Yijie Shen ◽

Feng Lin ◽

Guoai Xu

Keyword(s):

Neural Networks ◽

Loss Function ◽

Deep Neural Networks ◽

State Of The Art ◽

Speaker Verification ◽

Signal To Noise Ratio ◽

The State ◽

Verification System ◽

Adversarial Examples ◽

Human Hearing

Speaker verification system has gained great popularity in recent years, especially with the development of deep neural networks and Internet of Things. However, the security of speaker verification system based on deep neural networks has not been well investigated. In this paper, we propose an attack to spoof the state-of-the-art speaker verification system based on generalized end-to-end (GE2E) loss function for misclassifying illegal users into the authentic user. Specifically, we design a novel loss function to deploy a generator for generating effective adversarial examples with slight perturbation and then spoof the system with these adversarial examples to achieve our goals. The success rate of our attack can reach 82% when cosine similarity is adopted to deploy the deep-learning-based speaker verification system. Beyond that, our experiments also reported the signal-to-noise ratio at 76 dB, which proves that our attack has higher imperceptibility than previous works. In summary, the results show that our attack not only can spoof the state-of-the-art neural-network-based speaker verification system but also more importantly has the ability to hide from human hearing or machine discrimination.

Download Full-text

Really natural adversarial examples

International Journal of Machine Learning and Cybernetics ◽

10.1007/s13042-021-01435-0 ◽

2021 ◽

Author(s):

Anibal Pedraza ◽

Oscar Deniz ◽

Gloria Bueno

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Object Recognition ◽

Image Quality ◽

Real World ◽

Deep Neural Networks ◽

Distance Metrics ◽

The Real ◽

Adversarial Examples ◽

General Object

AbstractThe phenomenon of Adversarial Examples has become one of the most intriguing topics associated to deep learning. The so-called adversarial attacks have the ability to fool deep neural networks with inappreciable perturbations. While the effect is striking, it has been suggested that such carefully selected injected noise does not necessarily appear in real-world scenarios. In contrast to this, some authors have looked for ways to generate adversarial noise in physical scenarios (traffic signs, shirts, etc.), thus showing that attackers can indeed fool the networks. In this paper we go beyond that and show that adversarial examples also appear in the real-world without any attacker or maliciously selected noise involved. We show this by using images from tasks related to microscopy and also general object recognition with the well-known ImageNet dataset. A comparison between these natural and the artificially generated adversarial examples is performed using distance metrics and image quality metrics. We also show that the natural adversarial examples are in fact at a higher distance from the originals that in the case of artificially generated adversarial examples.

Download Full-text

Digital Watermark Perturbation for Adversarial Examples to Fool Deep Neural Networks

Natural Scene Statistics for Detecting Adversarial Examples in Deep Neural Networks

Diversity Adversarial Training against Adversarial Attack on Deep Neural Networks

A Black-Box Approach to Generate Adversarial Examples Against Deep Neural Networks for High Dimensional Input

Detecting Adversarial Examples for Deep Neural Networks via Layer Directed Discriminative Noise Injection

Assessing Threat of Adversarial Examples on Deep Neural Networks

Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks

Global Robustness Evaluation of Deep Neural Networks with Provable Guarantees for the Hamming Distance

Group-Wise Dynamic Dropout Based on Latent Semantic Variations

Spoofing Speaker Verification System by Adversarial Examples Leveraging the Generalized Speaker Difference

Really natural adversarial examples

Export Citation Format