A Comprehensive Study of Deep Learning for Side-Channel Analysis

Recently, several studies have been published on the application of deep learning to enhance Side-Channel Attacks (SCA). These seminal works have practically validated the soundness of the approach, especially against implementations protected by masking or by jittering. Concurrently, important open issues have emerged. Among them, the relevance of machine (and thereby deep) learning based SCA has been questioned in several papers based on the lack of relation between the accuracy, a typical performance metric used in machine learning, and common SCA metrics like the Guessing entropy or the key-discrimination success rate. Also, the impact of the classical side-channel counter-measures on the efficiency of deep learning has been questioned, in particular by the semi-conductor industry. Both questions enlighten the importance of studying the theoretical soundness of deep learning in the context of side-channel and of developing means to quantify its efficiency, especially with respect to the optimality bounds published so far in the literature for side-channel leakage exploitation. The first main contribution of this paper directly concerns the latter point. It is indeed proved that minimizing the Negative Log Likelihood (NLL for short) loss function during the training of deep neural networks is actually asymptotically equivalent to maximizing the Perceived Information introduced by Renauld et al. at EUROCRYPT 2011 as a lower bound of the Mutual Information between the leakage and the target secret. Hence, such a training can be considered as an efficient and effective estimation of the PI, and thereby of the MI (known to be complex to accurately estimate in the context of secure implementations). As a second direct consequence of our main contribution, it is argued that, in a side-channel exploitation context, choosing the NLL loss function to drive the training is sound from an information theory point of view. As a third contribution, classical counter-measures like Boolean masking or execution flow shuffling, initially dedicated to classical SCA, are proved to stay sound against deep Learning based attacks.

Download Full-text

Improving Deep Learning Networks for Profiled Side-channel Analysis Using Performance Improvement Techniques

ACM Journal on Emerging Technologies in Computing Systems ◽

10.1145/3453162 ◽

2021 ◽

Vol 17 (3) ◽

pp. 1-30

Author(s):

Damien Robissout ◽

Lilian Bossuet ◽

Amaury Habrard ◽

Vincent Grosso

Keyword(s):

Deep Learning ◽

Side Channel ◽

Learning Networks ◽

Weight Decay ◽

Training Time ◽

Side Channel Analysis ◽

Learning Techniques ◽

The Neural Networks ◽

Channel Analysis ◽

The Impact

The use of deep learning techniques to perform side-channel analysis attracted the attention of many researchers as they obtained good performances with them. Unfortunately, the understanding of the neural networks used to perform side-channel attacks is not very advanced yet. In this article, we propose to contribute to this direction by studying the impact of some particular deep learning techniques for tackling side-channel attack problems. More precisely, we propose to focus on three existing techniques: batch normalization, dropout, and weight decay, not yet used in side-channel context. By combining adequately these techniques for our problem, we show that it is possible to improve the attack performance, i.e., the number of traces needed to recover the secret, by more than 55%. Additionally, they allow us to have a gain of more than 34% in terms of training time. We also show that an architecture trained with such techniques is able to perform attacks efficiently even in the context of desynchronized traces.

Download Full-text

A Novel Evaluation Metric for Deep Learning-Based Side Channel Analysis and Its Extended Application to Imbalanced Data

IACR Transactions on Cryptographic Hardware and Embedded Systems ◽

10.46586/tches.v2020.i3.73-96 ◽

2020 ◽

pp. 73-96

Author(s):

Jiajia Zhang ◽

Mengce Zheng ◽

Jiehui Nan ◽

Honggang Hu ◽

Nenghai Yu

Keyword(s):

Deep Learning ◽

Loss Function ◽

Imbalanced Data ◽

Cross Entropy ◽

Side Channel ◽

Worst Case ◽

Side Channel Analysis ◽

Learning Techniques ◽

Channel Analysis ◽

Learning Metrics

Since Kocher (CRYPTO’96) proposed timing attack, side channel analysis (SCA) has shown great potential to break cryptosystems via physical leakage. Recently, deep learning techniques are widely used in SCA and show equivalent and even better performance compared to traditional methods. However, it remains unknown why and when deep learning techniques are effective and efficient for SCA. Masure et al. (IACR TCHES 2020(1):348–375) illustrated that deep learning paradigm is suitable for evaluating implementations against SCA from a worst-case scenario point of view, yet their work is limited to balanced data and a specific loss function. Besides, deep learning metrics are not consistent with side channel metrics. In most cases, they are deceptive in foreseeing the feasibility and complexity of mounting a successful attack, especially for imbalanced data. To mitigate the gap between deep learning metrics and side channel metrics, we propose a novel Cross Entropy Ratio (CER) metric to evaluate the performance of deep learning models for SCA. CER is closely related to traditional side channel metrics Guessing Entropy (GE) and Success Rate (SR) and fits to deep learning scenario. Besides, we show that it works stably while deep learning metrics such as accuracy becomes rather unreliable when the training data tends to be imbalanced. However, estimating CER can be done as easy as natural metrics in deep learning algorithms with low computational complexity. Furthermore, we adapt CER metric to a new kind of loss function, namely CER loss function, designed specifically for deep learning in side channel scenario. In this way, we link directly the SCA objective to deep learning optimization. Our experiments on several datasets show that, for SCA with imbalanced data, CER loss function outperforms Cross Entropy loss function in various conditions.

Download Full-text

Assessing the Impact of the Loss Function, Architecture and Image Type for Deep Learning-Based Wildfire Segmentation

Applied Sciences ◽

10.3390/app11157046 ◽

2021 ◽

Vol 11 (15) ◽

pp. 7046

Author(s):

Jorge Francisco Ciprián-Sánchez ◽

Gilberto Ochoa-Ruiz ◽

Lucile Rossi ◽

Frédéric Morandini

Keyword(s):

Deep Learning ◽

Loss Function ◽

State Of The Art ◽

Fire Detection ◽

Loss Functions ◽

Wildfire Spread ◽

Combine Information ◽

The Impact ◽

Image Type ◽

Segmentation Models

Wildfires stand as one of the most relevant natural disasters worldwide, particularly more so due to the effect of climate change and its impact on various societal and environmental levels. In this regard, a significant amount of research has been done in order to address this issue, deploying a wide variety of technologies and following a multi-disciplinary approach. Notably, computer vision has played a fundamental role in this regard. It can be used to extract and combine information from several imaging modalities in regard to fire detection, characterization and wildfire spread forecasting. In recent years, there has been work pertaining to Deep Learning (DL)-based fire segmentation, showing very promising results. However, it is currently unclear whether the architecture of a model, its loss function, or the image type employed (visible, infrared, or fused) has the most impact on the fire segmentation results. In the present work, we evaluate different combinations of state-of-the-art (SOTA) DL architectures, loss functions, and types of images to identify the parameters most relevant to improve the segmentation results. We benchmark them to identify the top-performing ones and compare them to traditional fire segmentation techniques. Finally, we evaluate if the addition of attention modules on the best performing architecture can further improve the segmentation results. To the best of our knowledge, this is the first work that evaluates the impact of the architecture, loss function, and image type in the performance of DL-based wildfire segmentation models.

Download Full-text

Side-channel analysis attacks based on deep learning network

Frontiers of Computer Science ◽

10.1007/s11704-020-0209-4 ◽

2021 ◽

Vol 16 (2) ◽

Author(s):

Yu Ou ◽

Lang Li

Keyword(s):

Deep Learning ◽

Side Channel ◽

Side Channel Analysis ◽

Learning Network ◽

Channel Analysis ◽

Deep Learning Network

Download Full-text

A Comparison of Weight Initializers in Deep Learning-Based Side-Channel Analysis

Lecture Notes in Computer Science - Applied Cryptography and Network Security Workshops ◽

10.1007/978-3-030-61638-0_8 ◽

2020 ◽

pp. 126-143

Author(s):

Huimin Li ◽

Marina Krček ◽

Guilherme Perin

Keyword(s):

Deep Learning ◽

Side Channel ◽

Side Channel Analysis ◽

Channel Analysis

Download Full-text

Profiling Attacks against ECC: Side Channel Analysis Based on Deep Learning for Curve-25519

10.1109/icct52962.2021.9657885 ◽

2021 ◽

Author(s):

Jiajun Xu ◽

Meng Li ◽

Lixin Liang ◽

Yiwei Zhang ◽

Shaohua Xiang ◽

...

Keyword(s):

Deep Learning ◽

Side Channel ◽

Side Channel Analysis ◽

Channel Analysis

Download Full-text

Recent advances in deep learning‐based side‐channel analysis

ETRI Journal ◽

10.4218/etrij.2019-0163 ◽

2020 ◽

Vol 42 (2) ◽

pp. 292-304 ◽

Cited By ~ 3

Author(s):

Sunghyun Jin ◽

Suhri Kim ◽

HeeSeok Kim ◽

Seokhie Hong

Keyword(s):

Deep Learning ◽

Side Channel ◽

Side Channel Analysis ◽

Recent Advances ◽

Channel Analysis

Download Full-text

Deep learning-based Electromagnetic Side-Channel Analysis for the Investigation of IoT Devices

2020 Second International Conference on Inventive Research in Computing Applications (ICIRCA) ◽

10.1109/icirca48905.2020.9182814 ◽

2020 ◽

Author(s):

Virinchi Tirumaladass ◽

Stefan Axelsson ◽

Mark Dougherty ◽

Muhammed Ahsan Rasool ◽

Mohamed Hamdy Eldefrawy

Keyword(s):

Deep Learning ◽

Side Channel ◽

Side Channel Analysis ◽

Channel Analysis ◽

Iot Devices

Download Full-text

Plaintext: A Missing Feature for Enhancing the Power of Deep Learning in Side-Channel Analysis?

IACR Transactions on Cryptographic Hardware and Embedded Systems ◽

10.46586/tches.v2020.i4.49-85 ◽

2020 ◽

pp. 49-85

Author(s):

Anh-Tuan Hoang ◽

Neil Hanley ◽

Maire O’Neill

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Large Body ◽

Side Channel ◽

Secret Key ◽

Side Channel Analysis ◽

Filter Kernel ◽

Secret Keys ◽

Channel Analysis ◽

Template Attacks

Deep learning (DL) has proven to be very effective for image recognition tasks, with a large body of research on various model architectures for object classification. Straight-forward application of DL to side-channel analysis (SCA) has already shown promising success, with experimentation on open-source variable key datasets showing that secret keys can be revealed with 100s traces even in the presence of countermeasures. This paper aims to further improve the application of DL for SCA, by enhancing the power of DL when targeting the secret key of cryptographic algorithms when protected with SCA countermeasures. We propose a new model, CNN-based model with Plaintext feature extension (CNNP) together with multiple convolutional filter kernel sizes and structures with deeper and narrower neural networks, which has empirically proven its effectiveness by outperforming reference profiling attack methods such as template attacks (TAs), convolutional neural networks (CNNs) and multilayer perceptron (MLP) models. Our model generates state-of-the art results when attacking the ASCAD variable-key database, which has a restricted number of training traces per key, recovering the key within 40 attack traces in comparison with order of 100s traces required by straightforward machine learning (ML) application. During the profiling stage an attacker needs no additional knowledge on the implementation, such as the masking scheme or random mask values, only the ability to record the power consumption or electromagnetic field traces, plaintext/ciphertext and the key. Additionally, no heuristic pre-processing is required in order to break the high-order masking countermeasures of the target implementation.

Download Full-text

Deep Learning Techniques for Side-Channel Analysis

10.1201/9781003201038-13 ◽

2021 ◽

pp. 255-269

Author(s):

Varsha Satheesh Kumar ◽

S. Dillibabu Shanmugam ◽

N. Sarat Chandra Babu

Keyword(s):

Deep Learning ◽

Side Channel ◽

Side Channel Analysis ◽

Learning Techniques ◽

Channel Analysis

Download Full-text