Intrusion Detection Model Using Temporal Convolutional Network Blend Into Attention Mechanism

Ping Zhao; Zhijie Fan*; Zhiwei Cao; Xin Li

doi:10.4018/ijisp.290832

Intrusion Detection Model Using Temporal Convolutional Network Blend Into Attention Mechanism

International Journal of Information Security and Privacy ◽

10.4018/ijisp.290832 ◽

2022 ◽

Vol 16 (1) ◽

pp. 1-20

Author(s):

Ping Zhao ◽

Zhijie Fan* ◽

Zhiwei Cao ◽

Xin Li

Keyword(s):

Neural Networks ◽

Intrusion Detection ◽

Spatial Information ◽

Attack Detection ◽

Attention Mechanism ◽

Surveillance Network ◽

Convolutional Network ◽

Detection Model ◽

Temporal Features ◽

Spatio Temporal

In order to improve the ability to detect network attacks, traditional intrusion detection models often used convolutional neural networks to encode spatial information or recurrent neural networks to obtain temporal features of the data. Some models combined the two methods to extract spatio-temporal features. However, these approaches used separate models and learned features insufficiently. This paper presented an improved model based on temporal convolutional networks (TCN) and attention mechanism. The causal and dilation convolution can capture the spatio-temporal dependencies of the data. The residual blocks allow the network to transfer information in a cross-layered manner, enabling in-depth network learning. Meanwhile, attention mechanism can enhance the model's attention to the relevant anomalous features of different attacks. Finally, this paper compared models results on the KDD CUP99 and UNSW-NB15 datasets. Besides, the authors apply the model to video surveillance network attack detection scenarios. The result shows that the model has advantages in evaluation metrics.

Download Full-text

Residual Invertible Spatio-Temporal Network for Video Super-Resolution

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015981 ◽

2019 ◽

Vol 33 ◽

pp. 5981-5988 ◽

Cited By ~ 12

Author(s):

Xiaobin Zhu ◽

Zhuangzi Li ◽

Xiao-Yu Zhang ◽

Changsheng Li ◽

Yaqi Liu ◽

...

Keyword(s):

Spatial Information ◽

Super Resolution ◽

Temporal Consistency ◽

Temporal Network ◽

Convolutional Network ◽

Feature Representations ◽

Video Frames ◽

Temporal Features ◽

Benchmark Datasets ◽

Spatio Temporal

Video super-resolution is a challenging task, which has attracted great attention in research and industry communities. In this paper, we propose a novel end-to-end architecture, called Residual Invertible Spatio-Temporal Network (RISTN) for video super-resolution. The RISTN can sufficiently exploit the spatial information from low-resolution to high-resolution, and effectively models the temporal consistency from consecutive video frames. Compared with existing recurrent convolutional network based approaches, RISTN is much deeper but more efficient. It consists of three major components: In the spatial component, a lightweight residual invertible block is designed to reduce information loss during feature transformation and provide robust feature representations. In the temporal component, a novel recurrent convolutional model with residual dense connections is proposed to construct deeper network and avoid feature degradation. In the reconstruction component, a new fusion method based on the sparse strategy is proposed to integrate the spatial and temporal features. Experiments on public benchmark datasets demonstrate that RISTN outperforms the state-ofthe-art methods.

Download Full-text

Efficient Attention Mechanism for Dynamic Convolution in Lightweight Neural Network

Applied Sciences ◽

10.3390/app11073111 ◽

2021 ◽

Vol 11 (7) ◽

pp. 3111

Author(s):

Enjie Ding ◽

Yuhao Cheng ◽

Chengcheng Xiao ◽

Zhongyu Liu ◽

Wanli Yu

Keyword(s):

Neural Networks ◽

Spatial Information ◽

Attention Mechanism ◽

Feature Representation ◽

Feature Maps ◽

Convolutional Network ◽

Dynamic Neural Networks ◽

Reduction Methods ◽

Channel Reduction ◽

Channel Information

Light-weight convolutional neural networks (CNNs) suffer limited feature representation capabilities due to low computational budgets, resulting in degradation in performance. To make CNNs more efficient, dynamic neural networks (DyNet) have been proposed to increase the complexity of the model by using the Squeeze-and-Excitation (SE) module to adaptively obtain the importance of each convolution kernel through the attention mechanism. However, the attention mechanism in the SE network (SENet) selects all channel information for calculations, which brings essential challenges: (a) interference caused by the internal redundant information; and (b) increasing number of network calculations. To address the above problems, this work proposes a dynamic convolutional network (termed as EAM-DyNet) to reduce the number of channels in feature maps by extracting only the useful spatial information. EAM-DyNet first uses the random channel reduction and channel grouping reduction methods to remove the redundancy in the information. As the downsampling of information can lead to the loss of useful information, it then applies an adaptive average pooling method to maintain the information integrity. Extensive experimental results on the baseline demonstrate that EAM-DyNet outperformed the existing approaches, thus it can achieve higher accuracy of the network test and less network parameters.

Download Full-text

Intrusion Detecting System Based on Temporal Convolutional Network for In-Vehicle CAN Networks

Mobile Information Systems ◽

10.1155/2021/1440259 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Dongxian Shi ◽

Ming Xu ◽

Ting Wu ◽

Liang Kou

Keyword(s):

Neural Networks ◽

Intrusion Detection ◽

Detection System ◽

Learning Theories ◽

Network Intrusion Detection ◽

Convolutional Network ◽

Detection Model ◽

Network Intrusion ◽

Complicated Model ◽

Abnormal Points

In recent years, deep learning theories, such as Recurrent Neural Networks (RNN) and Convolutional Neural Networks (CNN), have been applied as effective methods for intrusion detection in the vehicle CAN network. However, the existing RNNs realize detection by establishing independent models for each CAN ID, which are unable to learn the potential characteristics of different IDs well, and have relatively complicated model structure and high calculation time cost. CNNs can achieve rapid detection by learning the characteristics of normal and attack CAN ID sequences and exhibit good performance, but the current methods do not locate abnormal points in the sequence. To solve the above problems, this paper proposes an in-vehicle CAN network intrusion detection model based on Temporal Convolutional Network, which is called Temporal Convolutional Network-Based Intrusion Detection System (TCNIDS). In TCNIDS, the CAN ID is serialized into a natural language sequence and a word vector is constructed for each CAN ID through the word embedding coding method to reduce the data dimension. At the same time, TCNIDS uses the parameterized Relu method to improve the temporal convolutional network, which can better learn the potential features of the normal sequence. The TCNIDS model has a simple structure and realizes the point anomaly detection at the message level by predicting the future sequence of normal CAN data and setting the probability strategy. The experimental results show that the overall detection rate, false alarm rate, and accuracy rate of TCNIDS under fuzzy attack, spoofing attack, and DoS attack are higher than those of the traditional temporal convolutional network intrusion detection model.

Download Full-text

Cyber—Physical Attack Detection in Water Distribution Systems with Temporal Graph Convolutional Neural Networks

Water ◽

10.3390/w13091247 ◽

2021 ◽

Vol 13 (9) ◽

pp. 1247

Author(s):

Lydia Tsiami ◽

Christos Makropoulos

Keyword(s):

Neural Networks ◽

Distribution System ◽

Distribution Systems ◽

Water Distribution ◽

Attack Detection ◽

Convolutional Network ◽

Irreversible Damage ◽

Physical Attack ◽

Temporal Graph ◽

Graph Neural Networks

Prompt detection of cyber–physical attacks (CPAs) on a water distribution system (WDS) is critical to avoid irreversible damage to the network infrastructure and disruption of water services. However, the complex interdependencies of the water network’s components make CPA detection challenging. To better capture the spatiotemporal dimensions of these interdependencies, we represented the WDS as a mathematical graph and approached the problem by utilizing graph neural networks. We presented an online, one-stage, prediction-based algorithm that implements the temporal graph convolutional network and makes use of the Mahalanobis distance. The algorithm exhibited strong detection performance and was capable of localizing the targeted network components for several benchmark attacks. We suggested that an important property of the proposed algorithm was its explainability, which allowed the extraction of useful information about how the model works and as such it is a step towards the creation of trustworthy AI algorithms for water applications. Additional insights into metrics commonly used to rank algorithm performance were also presented and discussed.

Download Full-text

A Neural Networks Pruning and Data Fusion Based Intrusion Detection Model

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.651-653.1772 ◽

2014 ◽

Vol 651-653 ◽

pp. 1772-1775

Author(s):

Wei Gong

Keyword(s):

Neural Network ◽

Neural Networks ◽

Artificial Neural Networks ◽

Intrusion Detection ◽

Data Fusion ◽

Input Data ◽

The Other ◽

Pruning Algorithm ◽

Detection Model ◽

Process Ability

The abilities of summarization, learning and self-fitting and inner-parallel computing make artificial neural networks suitable for intrusion detection. On the other hand, data fusion based IDS has been used to solve the problem of distorting rate and failing-to-report rate and improve its performance. However, multi-sensor input-data makes the IDS lose its efficiency. The research of neural network based data fusion IDS tries to combine the strong process ability of neural network with the advantages of data fusion IDS. A neural network is designed to realize the data fusion and intrusion analysis and Pruning algorithm of neural networks is used for filtering information from multi-sensors. In the process of intrusion analysis pruning algorithm of neural networks is used for filtering information from multi-sensors so as to increase its performance and save the bandwidth of networks.

Download Full-text

No Reference Video Quality Assessment Based on Spatio-Temporal Features and Attention Mechanism

Laser & Optoelectronics Progress ◽

10.3788/lop57.181509 ◽

2020 ◽

Vol 57 (18) ◽

pp. 181509

Author(s):

朱泽 Zhu Ze ◽

桑庆兵 Sang Qingbing ◽

张浩 Zhang Hao

Keyword(s):

Quality Assessment ◽

Video Quality ◽

Attention Mechanism ◽

Video Quality Assessment ◽

Temporal Features ◽

Spatio Temporal

Download Full-text

Cascaded hybrid intrusion detection model based on SOM and RBF neural networks

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.5233 ◽

2019 ◽

Vol 32 (21) ◽

Cited By ~ 2

Author(s):

Muder Almiani ◽

Alia AbuGhazleh ◽

Amer Al‐Rahayfeh ◽

Abdul Razaque

Keyword(s):

Neural Networks ◽

Intrusion Detection ◽

Rbf Neural Networks ◽

Detection Model ◽

Model Based

Download Full-text

HAST-IDS: Learning Hierarchical Spatial-Temporal Features Using Deep Neural Networks to Improve Intrusion Detection

IEEE Access ◽

10.1109/access.2017.2780250 ◽

2018 ◽

Vol 6 ◽

pp. 1792-1806 ◽

Cited By ~ 98

Author(s):

Wei Wang ◽

Yiqiang Sheng ◽

Jinlin Wang ◽

Xuewen Zeng ◽

Xiaozhou Ye ◽

...

Keyword(s):

Neural Networks ◽

Intrusion Detection ◽

Deep Neural Networks ◽

Temporal Features

Download Full-text

DeepHTTP: Anomalous HTTP Traffic Detection and Malicious Pattern Mining Based on Deep Learning

Communications in Computer and Information Science - Cyber Security ◽

10.1007/978-981-33-4922-3_11 ◽

2020 ◽

pp. 141-161

Author(s):

Yuqi Yu ◽

Hanbing Yan ◽

Yuan Ma ◽

Hao Zhou ◽

Hongchao Guan

Keyword(s):

Deep Learning ◽

Web Application ◽

Pattern Mining ◽

Short Term Memory ◽

Attack Detection ◽

Attention Mechanism ◽

Discriminative Ability ◽

Feature Extraction Method ◽

Detection Model ◽

Traffic Detection

AbstractHypertext Transfer Protocol (HTTP) accounts for a large portion of Internet application-layer traffic. Since the payload of HTTP traffic can record website status and user request information, many studies use HTTP protocol traffic for web application attack detection. In this work, we propose DeepHTTP, an HTTP traffic detection framework based on deep learning. Unlike previous studies, this framework not only performs malicious traffic detection but also uses the deep learning model to mine malicious fields of the traffic payload. The detection model is called AT-Bi-LSTM, which is based on Bidirectional Long Short-Term Memory (Bi-LSTM) with attention mechanism. The attention mechanism can improve the discriminative ability and make the result interpretable. To enhance the generalization ability of the model, this paper proposes a novel feature extraction method. Experiments show that DeepHTTP has an excellent performance in malicious traffic discrimination and pattern mining.

Download Full-text

Semi-CNN Architecture for Effective Spatio-Temporal Learning in Action Recognition

Applied Sciences ◽

10.3390/app10020557 ◽

2020 ◽

Vol 10 (2) ◽

pp. 557 ◽

Cited By ~ 4

Author(s):

Mei Chee Leong ◽

Dilip K. Prasad ◽

Yong Tsui Lee ◽

Feng Lin

Keyword(s):

Neural Networks ◽

Action Recognition ◽

3D Model ◽

3D Models ◽

Temporal Features ◽

Temporal Encoding ◽

Spatio Temporal ◽

Efficient Learning ◽

2D And 3D ◽

Temporal Learning

This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal features in video action recognition. Unlike 2D convolutional neural networks (CNNs), 3D CNNs can be applied directly on consecutive frames to extract spatio-temporal features. The aim of this work is to fuse the convolution layers from 2D and 3D CNNs to allow temporal encoding with fewer parameters than 3D CNNs. We adopt transfer learning from pre-trained 2D CNNs for spatial extraction, followed by temporal encoding, before connecting to 3D convolution layers at the top of the architecture. We construct our fusion architecture, semi-CNN, based on three popular models: VGG-16, ResNets and DenseNets, and compare the performance with their corresponding 3D models. Our empirical results evaluated on the action recognition dataset UCF-101 demonstrate that our fusion of 1D, 2D and 3D convolutions outperforms its 3D model of the same depth, with fewer parameters and reduces overfitting. Our semi-CNN architecture achieved an average of 16–30% boost in the top-1 accuracy when evaluated on an input video of 16 frames.

Download Full-text