Malware Classification and Analysis Using Convolutional and Recurrent Neural Network

Handbook of Research on Deep Learning Innovations and Trends - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-7862-8.ch014 ◽

2019 ◽

pp. 233-255

Author(s):

Yassine Maleh

Keyword(s):

Classification Accuracy ◽

Short Term Memory ◽

Academic Community ◽

Data Sets ◽

Learning Approaches ◽

Validation Data ◽

Malware Classification ◽

Open Source Data ◽

Source Data ◽

Long Short Term Memory

Over the past decade, malware has grown exponentially. Traditional signature-based approaches to detecting malware have proven their limitations against new malware, and categorizing malware samples has become essential to understanding the basics of malware behavior. Recently, antivirus solutions have increasingly started to adopt machine learning approaches. Unfortunately, there are few open source data sets available for the academic community. One of the largest data sets available was published last year in a competition on Kaggle with data provided by Microsoft for the big data innovators gathering. This chapter explores the problem of malware classification. In particular, this chapter proposes an innovative and scalable approach using convolutional neural networks (CNN) and long short-term memory (LSTM) to assign malware to the corresponding family. The proposed method achieved a classification accuracy of 98.73% and an average log loss of 0.0698 on the validation data.

Download Full-text

Tourism Demand Forecasting Based on an LSTM Network and Its Variants

Algorithms ◽

10.3390/a14080243 ◽

2021 ◽

Vol 14 (8) ◽

pp. 243

Author(s):

Shun-Chieh Hsieh

Keyword(s):

Adaptive Learning ◽

Time Series Data ◽

Short Term Memory ◽

Demand Forecasting ◽

Series Data ◽

Learning Approaches ◽

Tourism Demand ◽

Long Short Term Memory ◽

Lstm Network ◽

Gated Recurrent Unit

The need for accurate tourism demand forecasting is widely recognized. The unreliability of traditional methods makes tourism demand forecasting still challenging. Using deep learning approaches, this study aims to adapt Long Short-Term Memory (LSTM), Bidirectional LSTM (Bi-LSTM), and Gated Recurrent Unit networks (GRU), which are straightforward and efficient, to improve Taiwan’s tourism demand forecasting. The networks are able to seize the dependence of visitor arrival time series data. The Adam optimization algorithm with adaptive learning rate is used to optimize the basic setup of the models. The results show that the proposed models outperform previous studies undertaken during the Severe Acute Respiratory Syndrome (SARS) events of 2002–2003. This article also examines the effects of the current COVID-19 outbreak to tourist arrivals to Taiwan. The results show that the use of the LSTM network and its variants can perform satisfactorily for tourism demand forecasting.

Download Full-text

Application of Long Short-Term Memory (LSTM) Neural Network for Flood Forecasting

Water ◽

10.3390/w11071387 ◽

2019 ◽

Vol 11 (7) ◽

pp. 1387 ◽

Cited By ~ 50

Author(s):

Le ◽

Ho ◽

Lee ◽

Jung

Keyword(s):

Neural Network ◽

River Basin ◽

Input Data ◽

Short Term Memory ◽

Flood Forecasting ◽

Data Sets ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Hoa Binh

Flood forecasting is an essential requirement in integrated water resource management. This paper suggests a Long Short-Term Memory (LSTM) neural network model for flood forecasting, where the daily discharge and rainfall were used as input data. Moreover, characteristics of the data sets which may influence the model performance were also of interest. As a result, the Da River basin in Vietnam was chosen and two different combinations of input data sets from before 1985 (when the Hoa Binh dam was built) were used for one-day, two-day, and three-day flowrate forecasting ahead at Hoa Binh Station. The predictive ability of the model is quite impressive: The Nash–Sutcliffe efficiency (NSE) reached 99%, 95%, and 87% corresponding to three forecasting cases, respectively. The findings of this study suggest a viable option for flood forecasting on the Da River in Vietnam, where the river basin stretches between many countries and downstream flows (Vietnam) may fluctuate suddenly due to flood discharge from upstream hydroelectric reservoirs.

Download Full-text

De-noising of transient electromagnetic data based on the long short-term memory-autoencoder

Geophysical Journal International ◽

10.1093/gji/ggaa424 ◽

2020 ◽

Vol 224 (1) ◽

pp. 669-681

Author(s):

Sihong Wu ◽

Qinghua Huang ◽

Li Zhao

Keyword(s):

Field Data ◽

Short Term Memory ◽

Noise Removal ◽

Data Sets ◽

Late Time ◽

Short Term ◽

Term Memory ◽

Transient Electromagnetic ◽

Long Short Term Memory

SUMMARY Late-time transient electromagnetic (TEM) data contain deep subsurface information and are important for resolving deeper electrical structures. However, due to their relatively small signal amplitudes, TEM responses later in time are often dominated by ambient noises. Therefore, noise removal is critical to the application of TEM data in imaging electrical structures at depth. De-noising techniques for TEM data have been developed rapidly in recent years. Although strong efforts have been made to improving the quality of the TEM responses, it is still a challenge to effectively extract the signals due to unpredictable and irregular noises. In this study, we develop a new type of neural network architecture by combining the long short-term memory (LSTM) network with the autoencoder structure to suppress noise in TEM signals. The resulting LSTM-autoencoders yield excellent performance on synthetic data sets including horizontal components of the electric field and vertical component of the magnetic field generated by different sources such as dipole, loop and grounded line sources. The relative errors between the de-noised data sets and the corresponding noise-free transients are below 1% for most of the sampling points. Notable improvement in the resistivity structure inversion result is achieved using the TEM data de-noised by the LSTM-autoencoder in comparison with several widely-used neural networks, especially for later-arriving signals that are important for constraining deeper structures. We demonstrate the effectiveness and general applicability of the LSTM-autoencoder by de-noising experiments using synthetic 1-D and 3-D TEM signals as well as field data sets. The field data from a fixed loop survey using multiple receivers are greatly improved after de-noising by the LSTM-autoencoder, resulting in more consistent inversion models with significantly increased exploration depth. The LSTM-autoencoder is capable of enhancing the quality of the TEM signals at later times, which enables us to better resolve deeper electrical structures.

Download Full-text

Sentiment Analysis of Lithuanian Texts Using Traditional and Deep Learning Approaches

Computers ◽

10.3390/computers8010004 ◽

2019 ◽

Vol 8 (1) ◽

pp. 4 ◽

Cited By ~ 4

Author(s):

Jurgita Kapočiūtė-Dzikienė ◽

Robertas Damaševičius ◽

Marcin Woźniak

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Sentiment Analysis ◽

Short Term Memory ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Approaches ◽

Full Dataset ◽

Learning Techniques ◽

Long Short Term Memory

We describe the sentiment analysis experiments that were performed on the Lithuanian Internet comment dataset using traditional machine learning (Naïve Bayes Multinomial—NBM and Support Vector Machine—SVM) and deep learning (Long Short-Term Memory—LSTM and Convolutional Neural Network—CNN) approaches. The traditional machine learning techniques were used with the features based on the lexical, morphological, and character information. The deep learning approaches were applied on the top of two types of word embeddings (Vord2Vec continuous bag-of-words with negative sampling and FastText). Both traditional and deep learning approaches had to solve the positive/negative/neutral sentiment classification task on the balanced and full dataset versions. The best deep learning results (reaching 0.706 of accuracy) were achieved on the full dataset with CNN applied on top of the FastText embeddings, replaced emoticons, and eliminated diacritics. The traditional machine learning approaches demonstrated the best performance (0.735 of accuracy) on the full dataset with the NBM method, replaced emoticons, restored diacritics, and lemma unigrams as features. Although traditional machine learning approaches were superior when compared to the deep learning methods; deep learning demonstrated good results when applied on the small datasets.

Download Full-text

DIANet: Dense-and-Implicit Attention Network

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5842 ◽

2020 ◽

Vol 34 (04) ◽

pp. 4206-4214

Author(s):

Zhongzhan Huang ◽

Senwei Liang ◽

Mingfu Liang ◽

Haizhao Yang

Keyword(s):

Classification Accuracy ◽

Short Term Memory ◽

Residual Network ◽

Short Term ◽

Long Distance ◽

Term Memory ◽

Attention Networks ◽

Network Layers ◽

Benchmark Datasets ◽

Long Short Term Memory

Attention networks have successfully boosted the performance in various vision problems. Previous works lay emphasis on designing a new attention module and individually plug them into the networks. Our paper proposes a novel-and-simple framework that shares an attention module throughout different network layers to encourage the integration of layer-wise information and this parameter-sharing module is referred to as Dense-and-Implicit-Attention (DIA) unit. Many choices of modules can be used in the DIA unit. Since Long Short Term Memory (LSTM) has a capacity of capturing long-distance dependency, we focus on the case when the DIA unit is the modified LSTM (called DIA-LSTM). Experiments on benchmark datasets show that the DIA-LSTM unit is capable of emphasizing layer-wise feature interrelation and leads to significant improvement of image classification accuracy. We further empirically show that the DIA-LSTM has a strong regularization ability on stabilizing the training of deep networks by the experiments with the removal of skip connections (He et al. 2016a) or Batch Normalization (Ioffe and Szegedy 2015) in the whole residual network.

Download Full-text

Detection of Cyberattacks Traces in IoT Data

JUCS - Journal of Universal Computer Science ◽

10.3897/jucs.2020.075 ◽

2020 ◽

Vol 26 (11) ◽

pp. 1422-1434

Author(s):

Vibekananda Dutta ◽

Michał Choraś ◽

Marek Pawlicki ◽

Rafał Kozik

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Short Term Memory ◽

Research Network ◽

Network Intrusion Detection ◽

Learning Approaches ◽

Short Term ◽

Term Memory ◽

Network Intrusion ◽

Long Short Term Memory

Artificial Intelligence plays a significant role in building effective cybersecurity tools. Security has a crucial role in the modern digital world and has become an essential area of research. Network Intrusion Detection Systems (NIDS) are among the first security systems that encounter network attacks and facilitate attack detection to protect a network. Contemporary machine learning approaches, like novel neural network architectures, are succeeding in network intrusion detection. This paper tests modern machine learning approaches on a novel cybersecurity benchmark IoT dataset. Among other algorithms, Deep AutoEncoder (DAE) and modified Long Short Term Memory (mLSTM) are employed to detect network anomalies in the IoT-23 dataset. The DAE is employed for dimensionality reduction and a host of ML methods, including Deep Neural Networks and Long Short-Term Memory to classify the outputs of into normal/malicious. The applied method is validated on the IoT-23 dataset. Furthermore, the results of the analysis in terms of evaluation matrices are discussed.

Download Full-text

Latent representation of the human pan-celltype epigenome through a deep recurrent neural network

10.1101/2021.03.08.434446 ◽

2021 ◽

Author(s):

Kevin B. Dsouza ◽

Adam Y. Li ◽

Vijay K. Bhargava ◽

Maxwell W. Libbrecht

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Short Term Memory ◽

Replication Timing ◽

Cell Types ◽

Data Sets ◽

Deep Recurrent Neural Network ◽

Long Short Term Memory ◽

Latent Representations

AbstractThe availability of thousands of assays of epigenetic activity necessitates compressed representations of these data sets that summarize the epigenetic landscape of the genome. Until recently, most such representations were celltype specific, applying to a single tissue or cell state. Recently, neural networks have made it possible to summarize data across tissues to produce a pan-celltype representation. In this work, we propose Epi-LSTM, a deep long short-term memory (LSTM) recurrent neural network autoencoder to capture the long-term dependencies in the epigenomic data. The latent representations from Epi-LSTM capture a variety of genomic phenomena, including gene-expression, promoter-enhancer interactions, replication timing, frequently interacting regions and evolutionary conservation. These representations outperform existing methods in a majority of cell-types, while yielding smoother representations along the genomic axis due to their sequential nature.

Download Full-text

Biomedical named entity recognition using deep neural networks with contextual information

BMC Bioinformatics ◽

10.1186/s12859-019-3321-4 ◽

2019 ◽

Vol 20 (1) ◽

Cited By ~ 8

Author(s):

Hyejin Cho ◽

Hyunju Lee

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Contextual Information ◽

Named Entity Recognition ◽

Entity Recognition ◽

Learning Approaches ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory

Abstract Background In biomedical text mining, named entity recognition (NER) is an important task used to extract information from biomedical articles. Previously proposed methods for NER are dictionary- or rule-based methods and machine learning approaches. However, these traditional approaches are heavily reliant on large-scale dictionaries, target-specific rules, or well-constructed corpora. These methods to NER have been superseded by the deep learning-based approach that is independent of hand-crafted features. However, although such methods of NER employ additional conditional random fields (CRF) to capture important correlations between neighboring labels, they often do not incorporate all the contextual information from text into the deep learning layers. Results We propose herein an NER system for biomedical entities by incorporating n-grams with bi-directional long short-term memory (BiLSTM) and CRF; this system is referred to as a contextual long short-term memory networks with CRF (CLSTM). We assess the CLSTM model on three corpora: the disease corpus of the National Center for Biotechnology Information (NCBI), the BioCreative II Gene Mention corpus (GM), and the BioCreative V Chemical Disease Relation corpus (CDR). Our framework was compared with several deep learning approaches, such as BiLSTM, BiLSTM with CRF, GRAM-CNN, and BERT. On the NCBI corpus, our model recorded an F-score of 85.68% for the NER of diseases, showing an improvement of 1.50% over previous methods. Moreover, although BERT used transfer learning by incorporating more than 2.5 billion words, our system showed similar performance with BERT with an F-scores of 81.44% for gene NER on the GM corpus and a outperformed F-score of 86.44% for the NER of chemicals and diseases on the CDR corpus. We conclude that our method significantly improves performance on biomedical NER tasks. Conclusion The proposed approach is robust in recognizing biological entities in text.

Download Full-text

Long short-term memory-based Malware classification method for information security

Computers & Electrical Engineering ◽

10.1016/j.compeleceng.2019.06.014 ◽

2019 ◽

Vol 77 ◽

pp. 366-375 ◽

Cited By ~ 9

Author(s):

Jungho Kang ◽

Sejun Jang ◽

Shuyu Li ◽

Young-Sik Jeong ◽

Yunsick Sung

Keyword(s):

Information Security ◽

Short Term Memory ◽

Classification Method ◽

Short Term ◽

Term Memory ◽

Malware Classification ◽

Long Short Term Memory

Download Full-text

Sentimental analysis using recurrent neural network

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.27.12635 ◽

2018 ◽

Vol 7 (2.27) ◽

pp. 88 ◽

Cited By ~ 1

Author(s):

Merin Thomas ◽

Latha C.A

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Learning Approaches ◽

Short Term ◽

Indian Language ◽

Natural Languages ◽

South Indian ◽

Learning Technique ◽

Long Short Term Memory ◽

Traditional Approaches

Sentiment analysis has been an important topic of discussion from two decades since Lee published his first paper on the sentimental analysis in 2002. Apart from the sentimental analysis in English, it has spread its wing to other natural languages whose significance is very important in a multi linguistic country like India. The traditional approaches in machine learning have paved better accuracy for the Analysis. Deep Learning approaches have gained its momentum in recent years in sentimental analysis. Deep learning mimics the human learning so expectations are to meet higher levels of accuracy. In this paper we have implemented sentimental analysis of tweets in South Indian language Malayalam. The model used is Recurrent Neural Networks Long Short-Term Memory, a deep learning technique to predict the sentiments analysis. Achieved accuracy was found increasing with quality and depth of the datasets.

Download Full-text