Information Extraction of Cybersecurity Concepts: An LSTM Approach

Extracting cybersecurity entities and the relationships between them from online textual resources such as articles, bulletins, and blogs and converting these resources into more structured and formal representations has important applications in cybersecurity research and is valuable for professional practitioners. Previous works to accomplish this task were mainly based on utilizing feature-based models. Feature-based models are time-consuming and need labor-intensive feature engineering to describe the properties of entities, domain knowledge, entity context, and linguistic characteristics. Therefore, to alleviate the need for feature engineering, we propose the usage of neural network models, specifically the long short-term memory (LSTM) models to accomplish the tasks of Named Entity Recognition (NER) and Relation Extraction (RE). We evaluated the proposed models on two tasks. The first task is performing NER and evaluating the results against the state-of-the-art Conditional Random Fields (CRFs) method. The second task is performing RE using three LSTM models and comparing their results to assess which model is more suitable for the domain of cybersecurity. The proposed models achieved competitive performance with less feature-engineering work. We demonstrate that exploiting neural network models in cybersecurity text mining is effective and practical.

Download Full-text

An Experimental Study of Hybrid Machine Learning Models for Extracting Named Entities

10.29007/dp5m ◽

2019 ◽

Author(s):

Lei Jiang ◽

Elena Bolshakova

Keyword(s):

Neural Network ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Network Models ◽

Entity Recognition ◽

Neural Network Models ◽

Named Entities ◽

Hybrid Neural Network ◽

Named Entity ◽

Two Hybrid

The paper describes two hybrid neural network models for named entity recognition (NER) in texts, as well as results of experiments with them. The first model, namely Bi-LSTM-CRF, is known and used for NER, while the other model named Gated-CNN- CRF is proposed in this work. It combines convolutional neural network (CNN), gated linear units, and conditional random fields (CRF). Both models were tested for NER on three different language datasets, for English, Russian, and Chinese. All resulted scores of precision, recall and F1-measure for both models are close to the state-of-the-art for NER, and for the English dataset CoNLL-2003, Gated-CNN-CRF model achieves 92.66 of F1-measure, outperforming the known result.

Download Full-text

Delayed Combination of Feature Embedding in Bidirectional LSTM CRF for NER

Applied Sciences ◽

10.3390/app10217557 ◽

2020 ◽

Vol 10 (21) ◽

pp. 7557

Author(s):

Chirawan Ronran ◽

Seungwoo Lee ◽

Hong Jun Jang

Keyword(s):

Neural Network ◽

Language Processing ◽

Short Term Memory ◽

Conditional Random Field ◽

Named Entity Recognition ◽

Network Models ◽

Word Embedding ◽

Entity Recognition ◽

Neural Network Models ◽

Bidirectional Lstm

Named Entity Recognition (NER) plays a vital role in natural language processing (NLP). Currently, deep neural network models have achieved significant success in NER. Recent advances in NER systems have introduced various feature selections to identify appropriate representations and handle Out-Of-the-Vocabulary (OOV) words. After selecting the features, they are all concatenated at the embedding layer before being fed into a model to label the input sequences. However, when concatenating the features, information collisions may occur and this would cause the limitation or degradation of the performance. To overcome the information collisions, some works tried to directly connect some features to latter layers, which we call the delayed combination and show its effectiveness by comparing it to the early combination. As feature encodings for input, we selected the character-level Convolutional Neural Network (CNN) or Long Short-Term Memory (LSTM) word encoding, the pre-trained word embedding, and the contextual word embedding and additionally designed CNN-based sentence encoding using a dictionary. These feature encodings are combined at early or delayed position of the bidirectional LSTM Conditional Random Field (CRF) model according to each feature’s characteristics. We evaluated the performance of this model on the CoNLL 2003 and OntoNotes 5.0 datasets using the F1 score and compared the delayed combination model with our own implementation of the early combination as well as the previous works. This comparison convinces us that our delayed combination is more effective than the early one and also highly competitive.

Download Full-text

End-to-End Recurrent Neural Network Models for Vietnamese Named Entity Recognition: Word-Level Vs. Character-Level

Communications in Computer and Information Science - Computational Linguistics ◽

10.1007/978-981-10-8438-6_18 ◽

2018 ◽

pp. 219-232 ◽

Cited By ~ 5

Author(s):

Thai-Hoang Pham ◽

Phuong Le-Hong

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Named Entity Recognition ◽

Network Models ◽

Entity Recognition ◽

Neural Network Models ◽

Named Entity ◽

Word Level ◽

End To End

Download Full-text

Application of Deep Learning for Characterization of Drivers’ Engagement in Secondary Tasks in In-Vehicle Systems

Transportation Research Record Journal of the Transportation Research Board ◽

10.1177/0361198120926507 ◽

2020 ◽

Vol 2674 (8) ◽

pp. 429-440

Author(s):

Osama A. Osman ◽

Hesham Rakha

Keyword(s):

Neural Network ◽

Deep Learning ◽

Time Series Data ◽

Short Term Memory ◽

Network Models ◽

Sensor Technology ◽

Series Data ◽

Neural Network Models ◽

Secondary Tasks ◽

Driving Assistance

Distracted driving (i.e., engaging in secondary tasks) is an epidemic that threatens the lives of thousands every year. Data collected from vehicular sensor technologies and through connectivity provide comprehensive information that, if used to detect driver engagement in secondary tasks, could save thousands of lives and millions of dollars. This study investigates the possibility of achieving this goal using promising deep learning tools. Specifically, two deep neural network models (a multilayer perceptron neural network model and a long short-term memory networks [LSTMN] model) were developed to identify three secondary tasks: cellphone calling, cellphone texting, and conversation with adjacent passengers. The Second Strategic Highway Research Program Naturalistic Driving Study (SHRP 2 NDS) time series data, collected using vehicle sensor technology, were used to train and test the model. The results show excellent performance for the developed models, with a slight improvement for the LSTMN model, with overall classification accuracies ranging between 95 and 96%. Specifically, the models are able to identify the different types of secondary tasks with high accuracies of 100% for calling, 96%–97% for texting, 90%–91% for conversation, and 95%–96% for the normal driving. Based on this performance, the developed models improve on the results of a previous model developed by the author to classify the same three secondary tasks, which had an accuracy of 82%. The model is promising for use in in-vehicle driving assistance technology to report engagement in unlawful tasks or alert drivers to take over control in level 1 and 2 automated vehicles.

Download Full-text

Forecasting variance of NiftyIT index with RNN and DNN

Journal of Physics Conference Series ◽

10.1088/1742-6596/2161/1/012005 ◽

2022 ◽

Vol 2161 (1) ◽

pp. 012005

Author(s):

C R Karthik ◽

Raghunandan ◽

B Ashwath Rao ◽

N V Subba Reddy

Keyword(s):

Neural Network ◽

Time Series ◽

Time Series Analysis ◽

Short Term Memory ◽

Stock Exchange ◽

Network Models ◽

Training Data ◽

Complex Data ◽

Neural Network Models ◽

Series Analysis

Abstract A time series is an order of observations engaged serially in time. The prime objective of time series analysis is to build mathematical models that provide reasonable descriptions from training data. The goal of time series analysis is to forecast the forthcoming values of a series based on the history of the same series. Forecasting of stock markets is a thought-provoking problem because of the number of possible variables as well as volatile noise that may contribute to the prices of the stock. However, the capability to analyze stock market leanings could be vital to investors, traders and researchers, hence has been of continued interest. Plentiful arithmetical and machine learning practices have been discovered for stock analysis and forecasting/prediction. In this paper, we perform a comparative study on two very capable artificial neural network models i) Deep Neural Network (DNN) and ii) Long Short-Term Memory (LSTM) a type of recurrent neural network (RNN) in predicting the daily variance of NIFTYIT in BSE (Bombay Stock Exchange) and NSE (National Stock Exchange) markets. DNN was chosen due to its capability to handle complex data with substantial performance and better generalization without being saturated. LSTM model was decided, as it contains intermediary memory which can hold the historic patterns and occurrence of the next prediction depends on the values that preceded it. With both networks, measures were taken to reduce overfitting. Daily predictions of the NIFTYIT index were made to test the generalizability of the models. Both networks performed well at making daily predictions, and both generalized admirably to make daily predictions of the NiftyIT data. The LSTM-RNN outpaced the DNN in terms of forecasting and thus, grips more potential for making longer-term estimates.

Download Full-text

Investigating the Differences in Predictive Oculomotor Strategies using Long Short-Term Memory Recurrent Neural Network Models

Journal of Vision ◽

10.1167/18.10.184 ◽

2018 ◽

Vol 18 (10) ◽

pp. 184

Author(s):

Kamran Binaee ◽

Rakshit Kothari ◽

Jeff Pelz ◽

Gabriel Diaz

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Short Term Memory ◽

Network Models ◽

Short Term ◽

Neural Network Models ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

A GARCH Model with Artificial Neural Networks

Information ◽

10.3390/info11100489 ◽

2020 ◽

Vol 11 (10) ◽

pp. 489

Author(s):

Wing Ki Liu ◽

Mike K. P. So

Keyword(s):

Neural Network ◽

Short Term Memory ◽

Garch Model ◽

Network Models ◽

Variable Importance ◽

Neural Network Models ◽

Out Of Sample ◽

Importance Analysis ◽

Artificial Neural ◽

T Distribution

In this paper, we incorporate a GARCH model into an artificial neural network (ANN) for financial volatility modeling and estimate the parameters in Tensorflow. Our goal was to better predict stock volatility. We evaluate the performance of the models using the mean absolute errors of powers of the out-of-sample returns between 2 March 2018 and 28 February 2020. Our results show that our modeling procedure with an ANN can outperform the standard GARCH(1,1) model with standardized Student’s t distribution. Our variable importance analysis shows that Net Debt/EBITA is among the six most important predictor variables in all of the neural network models we have examined. The main contribution of this paper is that we propose a Long Short-Term Memory (LSTM) model with a GARCH framework because LSTM can systematically take into consideration potential nonlinearity in volatility structure at different time points. One of the advantages of our research is that the proposed models are easy to implement because our proposed models can be run in Tensorflow, a Python package that enables fast and automatic optimization. Another advantage is that the proposed models enable variable importance analysis.

Download Full-text

Methods for Integrating Extraterrestrial Radiation into Neural Network Models for Day-Ahead PV Generation Forecasting

Energies ◽

10.3390/en14092601 ◽

2021 ◽

Vol 14 (9) ◽

pp. 2601

Author(s):

Seung Chan Jo ◽

Young Gyu Jin ◽

Yong Tae Yoon ◽

Ho Chan Kim

Keyword(s):

Neural Network ◽

Short Term Memory ◽

Mean Squared Error ◽

Network Models ◽

Scheduling Problems ◽

Neural Network Models ◽

Intermediate Target ◽

The Neural Network ◽

Forecasting Performance ◽

Pv Generation

Variability, intermittency, and limited controllability are inherent characteristics of photovoltaic (PV) generation that result in inaccurate solutions to scheduling problems and the instability of the power grid. As the penetration level of PV generation increases, it becomes more important to mitigate these problems by improving forecasting accuracy. One of the alternatives to improving forecasting performance is to include a seasonal component. Thus, this study proposes using information on extraterrestrial radiation (ETR), which is the solar radiation outside of the atmosphere, in neural network models for day-ahead PV generation forecasting. Specifically, five methods for integrating the ETR into the neural network models are presented: (1) division preprocessing, (2) multiplication preprocessing, (3) replacement of existing input, (4) inclusion as additional input, and (5) inclusion as an intermediate target. The methods were tested using two datasets in Australia using four neural network models: Multilayer perceptron and three recurrent neural network(RNN)-based models including vanilla RNN, long short-term memory, and gated recurrent unit. It was found that, among the integration methods, including the ETR as the intermediate target improved the mean squared error by 4.1% on average, and by 12.28% at most in RNN-based models. These results verify that the integration of ETR into the PV forecasting models based on neural networks can improve the forecasting performance.

Download Full-text

Anomaly Detection Using XGBoost Ensemble of Deep Neural Network Models

Cybernetics and Information Technologies ◽

10.2478/cait-2021-0037 ◽

2021 ◽

Vol 21 (3) ◽

pp. 175-188

Author(s):

Sumaiya Thaseen Ikram ◽

Aswani Kumar Cherukuri ◽

Babu Poorva ◽

Pamidi Sai Ushasree ◽

Yishuo Zhang ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Anomaly Detection ◽

Deep Neural Network ◽

Short Term Memory ◽

Network Models ◽

Neural Network Models ◽

Detection Model ◽

Detection Systems ◽

Learning Techniques

Abstract Intrusion Detection Systems (IDSs) utilise deep learning techniques to identify intrusions with maximum accuracy and reduce false alarm rates. The feature extraction is also automated in these techniques. In this paper, an ensemble of different Deep Neural Network (DNN) models like MultiLayer Perceptron (MLP), BackPropagation Network (BPN) and Long Short Term Memory (LSTM) are stacked to build a robust anomaly detection model. The performance of the ensemble model is analysed on different datasets, namely UNSW-NB15 and a campus generated dataset named VIT_SPARC20. Other types of traffic, namely unencrypted normal traffic, normal encrypted traffic, encrypted and unencrypted malicious traffic, are captured in the VIT_SPARC20 dataset. Encrypted normal and malicious traffic of VIT_SPARC20 is categorised by the deep learning models without decrypting its contents, thus preserving the confidentiality and integrity of the data transmitted. XGBoost integrates the results of each deep learning model to achieve higher accuracy. From experimental analysis, it is inferred that UNSW_ NB results in a maximal accuracy of 99.5%. The performance of VIT_SPARC20 in terms of accuracy, precision and recall are 99.4%. 98% and 97%, respectively.

Download Full-text

AimeLaw at ALQAC 2021: Enriching Neural Network Models with Legal-Domain Knowledge

10.1109/kse53942.2021.9648636 ◽

2021 ◽

Author(s):

Ngo Quang Huy ◽

Nguyen Manh Duc Tuan ◽

Nguyen Anh Duong ◽

Pham Quang Nhat Minh

Keyword(s):

Neural Network ◽

Domain Knowledge ◽

Network Models ◽

Neural Network Models ◽

Legal Domain

Download Full-text