Attention-Based Automated Feature Extraction for Malware Analysis

Every day, hundreds of thousands of malicious files are created to exploit zero-day vulnerabilities. Existing pattern-based antivirus solutions face difficulties in coping with such a large number of new malicious files. To solve this problem, artificial intelligence (AI)-based malicious file detection methods have been proposed. However, even if we can detect malicious files with high accuracy using deep learning, it is difficult to identify why files are malicious. In this study, we propose a malicious file feature extraction method based on attention mechanism. First, by adapting the attention mechanism, we can identify application program interface (API) system calls that are more important than others for determining whether a file is malicious. Second, we confirm that this approach yields an accuracy that is approximately 12% and 5% higher than a conventional AI-based detection model using convolutional neural networks and skip-connected long short-term memory-based detection model, respectively.

Download Full-text

Deep Recurrent Neural Networks for Automatic Detection of Sleep Apnea from Single Channel Respiration Signals

Sensors ◽

10.3390/s20185037 ◽

2020 ◽

Vol 20 (18) ◽

pp. 5037

Author(s):

Hisham ElMoaqet ◽

Mohammad Eid ◽

Martin Glos ◽

Mutaz Ryalat ◽

Thomas Penzel

Keyword(s):

Feature Extraction ◽

Sleep Apnea ◽

Short Term Memory ◽

Single Channel ◽

Automated Detection ◽

Detection Methods ◽

Short Term ◽

Term Memory ◽

Deep Recurrent Neural Network ◽

Long Short Term Memory

Sleep apnea is a common sleep disorder that causes repeated breathing interruption during sleep. The performance of automated apnea detection methods based on respiratory signals depend on the signals considered and feature extraction methods. Moreover, feature engineering techniques are highly dependent on the experts’ experience and their prior knowledge about different physiological signals and conditions of the subjects. To overcome these problems, a novel deep recurrent neural network (RNN) framework is developed for automated feature extraction and detection of apnea events from single respiratory channel inputs. Long short-term memory (LSTM) and bidirectional long short-term memory (BiLSTM) are investigated to develop the proposed deep RNN model. The proposed framework is evaluated over three respiration signals: Oronasal thermal airflow (FlowTh), nasal pressure (NPRE), and abdominal respiratory inductance plethysmography (ABD). To demonstrate our results, we use polysomnography (PSG) data of 17 patients with obstructive, central, and mixed apnea events. Our results indicate the effectiveness of the proposed framework in automatic extraction for temporal features and automated detection of apneic events over the different respiratory signals considered in this study. Using a deep BiLSTM-based detection model, the NPRE signal achieved the highest overall detection results with true positive rate (sensitivity) = 90.3%, true negative rate (specificity) = 83.7%, and area under receiver operator characteristic curve = 92.4%. The present results contribute a new deep learning approach for automated detection of sleep apnea events from single channel respiration signals that can potentially serve as a helpful and alternative tool for the traditional PSG method.

Download Full-text

DeepHTTP: Anomalous HTTP Traffic Detection and Malicious Pattern Mining Based on Deep Learning

Communications in Computer and Information Science - Cyber Security ◽

10.1007/978-981-33-4922-3_11 ◽

2020 ◽

pp. 141-161

Author(s):

Yuqi Yu ◽

Hanbing Yan ◽

Yuan Ma ◽

Hao Zhou ◽

Hongchao Guan

Keyword(s):

Deep Learning ◽

Web Application ◽

Pattern Mining ◽

Short Term Memory ◽

Attack Detection ◽

Attention Mechanism ◽

Discriminative Ability ◽

Feature Extraction Method ◽

Detection Model ◽

Traffic Detection

AbstractHypertext Transfer Protocol (HTTP) accounts for a large portion of Internet application-layer traffic. Since the payload of HTTP traffic can record website status and user request information, many studies use HTTP protocol traffic for web application attack detection. In this work, we propose DeepHTTP, an HTTP traffic detection framework based on deep learning. Unlike previous studies, this framework not only performs malicious traffic detection but also uses the deep learning model to mine malicious fields of the traffic payload. The detection model is called AT-Bi-LSTM, which is based on Bidirectional Long Short-Term Memory (Bi-LSTM) with attention mechanism. The attention mechanism can improve the discriminative ability and make the result interpretable. To enhance the generalization ability of the model, this paper proposes a novel feature extraction method. Experiments show that DeepHTTP has an excellent performance in malicious traffic discrimination and pattern mining.

Download Full-text

A Fault Feature Extraction Method for DC-DC Converters Based on Automatic Hyperparameter-Optimized One-Dimensional Convolution and Long Short-Term Memory Neural Networks

IEEE Journal of Emerging and Selected Topics in Power Electronics ◽

10.1109/jestpe.2021.3131706 ◽

2021 ◽

pp. 1-1

Author(s):

Yuanyuan Jiang ◽

Ling Xia ◽

Jie Zhang

Keyword(s):

Neural Networks ◽

Feature Extraction ◽

Extraction Method ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

One Dimensional ◽

Feature Extraction Method ◽

Long Short Term Memory ◽

Fault Feature Extraction

Download Full-text

Attention-Based BiLSTM for Negation Handling in Sentimen Analysis

IJCCS (Indonesian Journal of Computing and Cybernetics Systems) ◽

10.22146/ijccs.60733 ◽

2020 ◽

Vol 14 (4) ◽

pp. 397

Author(s):

Riszki Wijayatun Pratiwi ◽

Yunita Sari ◽

Yohanes Suyanto

Keyword(s):

Feature Extraction ◽

Sentiment Analysis ◽

Extraction Method ◽

Short Term Memory ◽

Model Performance ◽

Short Term ◽

Term Memory ◽

Feature Extraction Method ◽

Long Short Term Memory

Research on sentiment analysis in recent years has increased. However, in sentiment analysis research there are still few ideas about the handling of negation, one of which is in the Indonesian sentence. This results in sentences that contain elements of the word negation have not found the exact polarity.The purpose of this research is to analyze the effect of the negation word in Indonesian. Based on positive, neutral and negative classes, using attention-based Long Short Term Memory and word2vec feature extraction method with continuous bag-of-word (CBOW) architecture. The dataset used is data from Twitter. Model performance is seen in the accuracy value.The use of word2vec with CBOW architecture and the addition of layer attention to the Long Short Term Memory (LSTM) and Bidirectional Long Short Term Memory (BiLSTM) methods obtained an accuracy of 78.16% and for BiLSTM resulted in an accuracy of 79.68%. whereas in the FSW algorithm is 73.50% and FWL 73.79%. It can be concluded that attention based BiLSTM has the highest accuracy, but the addition of layer attention in the Long Short Term Memory method is not too significant for negation handling. because the addition of the attention layer cannot determine the words that you want to pay attention to.

Download Full-text

Dam Deformation Interpretation and Prediction Based on a Long Short-Term Memory Model Coupled with an Attention Mechanism

Applied Sciences ◽

10.3390/app11146625 ◽

2021 ◽

Vol 11 (14) ◽

pp. 6625

Author(s):

Yan Su ◽

Kailiang Weng ◽

Chuan Lin ◽

Zeqin Chen

Keyword(s):

Short Term Memory ◽

Attention Mechanism ◽

Impact Factors ◽

Nonlinear Prediction ◽

Time Dimension ◽

Short Term ◽

Deformation Prediction ◽

Term Memory ◽

Long Short Term Memory ◽

Dam Deformation

An accurate dam deformation prediction model is vital to a dam safety monitoring system, as it helps assess and manage dam risks. Most traditional dam deformation prediction algorithms ignore the interpretation and evaluation of variables and lack qualitative measures. This paper proposes a data processing framework that uses a long short-term memory (LSTM) model coupled with an attention mechanism to predict the deformation response of a dam structure. First, the random forest (RF) model is introduced to assess the relative importance of impact factors and screen input variables. Secondly, the density-based spatial clustering of applications with noise (DBSCAN) method is used to identify and filter the equipment based abnormal values to reduce the random error in the measurements. Finally, the coupled model is used to focus on important factors in the time dimension in order to obtain more accurate nonlinear prediction results. The results of the case study show that, of all tested methods, the proposed coupled method performed best. In addition, it was found that temperature and water level both have significant impacts on dam deformation and can serve as reliable metrics for dam management.

Download Full-text

Abnormal Detection of Electricity Consumption of User Based on Particle Swarm Optimization and Long Short Term Memory With the Attention Mechanism

IEEE Access ◽

10.1109/access.2021.3062675 ◽

2021 ◽

Vol 9 ◽

pp. 47252-47265

Author(s):

Jiahao Bian ◽

Lei Wang ◽

Rafal Scherer ◽

Marcin Wozniak ◽

Pengchao Zhang ◽

...

Keyword(s):

Particle Swarm Optimization ◽

Short Term Memory ◽

Particle Swarm ◽

Electricity Consumption ◽

Attention Mechanism ◽

Short Term ◽

Swarm Optimization ◽

Term Memory ◽

Abnormal Detection ◽

Long Short Term Memory

Download Full-text

A Method Based on Attention Mechanism using Bidirectional Long-Short Term Memory(BLSTM) for Question Answering

10.1109/icee52715.2021.9544258 ◽

2021 ◽

Author(s):

Seyed Vahid Moravvej ◽

Mohammad Javad Maleki Kahaki ◽

Moein Salimi Sartakhti ◽

Abdolreza Mirzaei

Keyword(s):

Question Answering ◽

Short Term Memory ◽

Attention Mechanism ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

ConAnomaly: Content-Based Anomaly Detection for System Logs

Sensors ◽

10.3390/s21186125 ◽

2021 ◽

Vol 21 (18) ◽

pp. 6125

Author(s):

Dan Lv ◽

Nurbol Luktarhan ◽

Yiyong Chen

Keyword(s):

Anomaly Detection ◽

Semantic Information ◽

Short Term Memory ◽

Weighted Average ◽

Detection Methods ◽

Detection Model ◽

Part Of Speech Tagging ◽

Part Of Speech ◽

System Logs ◽

System Maintenance

Enterprise systems typically produce a large number of logs to record runtime states and important events. Log anomaly detection is efficient for business management and system maintenance. Most existing log-based anomaly detection methods use log parser to get log event indexes or event templates and then utilize machine learning methods to detect anomalies. However, these methods cannot handle unknown log types and do not take advantage of the log semantic information. In this article, we propose ConAnomaly, a log-based anomaly detection model composed of a log sequence encoder (log2vec) and multi-layer Long Short Term Memory Network (LSTM). We designed log2vec based on the Word2vec model, which first vectorized the words in the log content, then deleted the invalid words through part of speech tagging, and finally obtained the sequence vector by the weighted average method. In this way, ConAnomaly not only captures semantic information in the log but also leverages log sequential relationships. We evaluate our proposed approach on two log datasets. Our experimental results show that ConAnomaly has good stability and can deal with unseen log types to a certain extent, and it provides better performance than most log-based anomaly detection methods.

Download Full-text

Predicting Outcomes of Business Process Executions Based on LSTM Neural Networks and Attention Mechanism

10.21203/rs.3.rs-260970/v1 ◽

2021 ◽

Author(s):

Jiaojiao Wang ◽

Dongjin Yu ◽

Chengfei Liu ◽

Xiaoxiao Sun

Keyword(s):

Short Term Memory ◽

Attention Mechanism ◽

Sequential Data ◽

Time Prediction ◽

Prediction Time ◽

Highly Sensitive ◽

Long Short Term Memory ◽

Early Decision ◽

Lstm Network ◽

Hidden Layer

Abstract To effectively predict the outcome of an on-going process instance helps make an early decision, which plays an important role in so-called predictive process monitoring. Existing methods in this field are tailor-made for some empirical operations such as the prefix extraction, clustering, and encoding, leading that their relative accuracy is highly sensitive to the dataset. Moreover, they have limitations in real-time prediction applications due to the lengthy prediction time. Since Long Short-term Memory (LSTM) neural network provides a high precision in the prediction of sequential data in several areas, this paper investigates LSTM and its enhancements and proposes three different approaches to build more effective and efficient models for outcome prediction. The first move on enhancement is that we combine the original LSTM network from two directions, forward and backward, to capture more features from the completed cases. The second move on enhancement is that we add attention mechanism after extracting features in the hidden layer of LSTM network to distinct them from their attention weight. A series of extensive experiments are evaluated on twelve real datasets when comparing with other approaches. The results show that our approaches outperform the state-of-the-art ones in terms of prediction effectiveness and time performance.

Download Full-text