A Hierarchical Long Short-Term Memory Encoder-Decoder Model for Abstractive Summarization

Text summarization (TS) is considered one of the most difficult tasks in natural language processing (NLP). It is one of the most important challenges that stand against the modern computer system’s capabilities with all its new improvement. Many papers and research studies address this task in literature but are being carried out in extractive summarization, and few of them are being carried out in abstractive summarization, especially in the Arabic language due to its complexity. In this paper, an abstractive Arabic text summarization system is proposed, based on a sequence-to-sequence model. This model works through two components, encoder and decoder. Our aim is to develop the sequence-to-sequence model using several deep artificial neural networks to investigate which of them achieves the best performance. Different layers of Gated Recurrent Units (GRU), Long Short-Term Memory (LSTM), and Bidirectional Long Short-Term Memory (BiLSTM) have been used to develop the encoder and the decoder. In addition, the global attention mechanism has been used because it provides better results than the local attention mechanism. Furthermore, AraBERT preprocess has been applied in the data preprocessing stage that helps the model to understand the Arabic words and achieves state-of-the-art results. Moreover, a comparison between the skip-gram and the continuous bag of words (CBOW) word2Vec word embedding models has been made. We have built these models using the Keras library and run-on Google Colab Jupiter notebook to run seamlessly. Finally, the proposed system is evaluated through ROUGE-1, ROUGE-2, ROUGE-L, and BLEU evaluation metrics. The experimental results show that three layers of BiLSTM hidden states at the encoder achieve the best performance. In addition, our proposed system outperforms the other latest research studies. Also, the results show that abstractive summarization models that use the skip-gram word2Vec model outperform the models that use the CBOW word2Vec model.

Download Full-text

Sleep Breathing Disorders Detection with Bioradar Using a Long Short-Term Memory Network

2020 XXXIIIrd General Assembly and Scientific Symposium of the International Union of Radio Science ◽

10.23919/ursigass49373.2020.9232203 ◽

2020 ◽

Author(s):

Lesya Anishchenko ◽

Ludmila Korostovtseva ◽

Mikhail Bochkarev ◽

Yurii Sviryaev

Keyword(s):

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Sleep Breathing Disorders ◽

Breathing Disorders ◽

Memory Network ◽

Long Short Term Memory

Download Full-text

Incorporating Financial News for Forecasting Bitcoin Prices Based on Long Short-Term Memory Networks

SSRN Electronic Journal ◽

10.2139/ssrn.3733398 ◽

2020 ◽

Author(s):

Abdolreza Nazemi ◽

Johannes Jakubik ◽

Andreas Geyer-Schulz ◽

Frank J. Fabozzi

Keyword(s):

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Financial News ◽

Long Short Term Memory

Download Full-text

Unit Selection with Hierarchical Cascaded Long Short Term Memory Bidirectional Recurrent Neural Nets

10.21437/interspeech.2017-428 ◽

2017 ◽

Cited By ~ 2

Author(s):

Vincent Pollet ◽

Enrico Zovato ◽

Sufian Irhimeh ◽

Pier Batzu

Keyword(s):

Short Term Memory ◽

Neural Nets ◽

Short Term ◽

Unit Selection ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Improving Mandarin Tone Recognition Using Convolutional Bidirectional Long Short-Term Memory with Attention

10.21437/interspeech.2018-2561 ◽

2018 ◽

Author(s):

Longfei Yang ◽

Yanlu Xie ◽

Jinsong Zhang

Keyword(s):

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Tone Recognition ◽

Long Short Term Memory ◽

Mandarin Tone

Download Full-text

Articulatory-to-speech Conversion Using Bi-directional Long Short-term Memory

10.21437/interspeech.2018-999 ◽

2018 ◽

Cited By ~ 1

Author(s):

Fumiaki Taguchi ◽

Tokihiko Kaburagi

Keyword(s):

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Long Short-term Memory for Tibetan Speech Recognition

2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) ◽

10.1109/itnec48623.2020.9084681 ◽

2020 ◽

Author(s):

Weizhe Wang ◽

Ziyan Chen ◽

Hongwu Yang

Keyword(s):

Speech Recognition ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Application of Improved Long-short-term Memory Network in Human Morphology Detection

2019 IEEE 7th International Conference on Computer Science and Network Technology (ICCSNT) ◽

10.1109/iccsnt47585.2019.8962454 ◽

2019 ◽

Author(s):

Ming Huang ◽

Tao Wen ◽

Xu Liang

Keyword(s):

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Human Morphology ◽

Memory Network ◽

Long Short Term Memory

Download Full-text

Stock price trend prediction with long short-term memory neural networks

International Journal of Computational Intelligence Studies ◽

10.1504/ijcistudies.2019.103619 ◽

2019 ◽

Vol 8 (4) ◽

pp. 289

Author(s):

Varun Gupta ◽

Mujahid Ahmad

Keyword(s):

Neural Networks ◽

Stock Price ◽

Short Term Memory ◽

Short Term ◽

Trend Prediction ◽

Price Trend ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Dam Deformation Interpretation and Prediction Based on a Long Short-Term Memory Model Coupled with an Attention Mechanism

Applied Sciences ◽

10.3390/app11146625 ◽

2021 ◽

Vol 11 (14) ◽

pp. 6625

Author(s):

Yan Su ◽

Kailiang Weng ◽

Chuan Lin ◽

Zeqin Chen

Keyword(s):

Short Term Memory ◽

Attention Mechanism ◽

Impact Factors ◽

Nonlinear Prediction ◽

Time Dimension ◽

Short Term ◽

Deformation Prediction ◽

Term Memory ◽

Long Short Term Memory ◽

Dam Deformation

An accurate dam deformation prediction model is vital to a dam safety monitoring system, as it helps assess and manage dam risks. Most traditional dam deformation prediction algorithms ignore the interpretation and evaluation of variables and lack qualitative measures. This paper proposes a data processing framework that uses a long short-term memory (LSTM) model coupled with an attention mechanism to predict the deformation response of a dam structure. First, the random forest (RF) model is introduced to assess the relative importance of impact factors and screen input variables. Secondly, the density-based spatial clustering of applications with noise (DBSCAN) method is used to identify and filter the equipment based abnormal values to reduce the random error in the measurements. Finally, the coupled model is used to focus on important factors in the time dimension in order to obtain more accurate nonlinear prediction results. The results of the case study show that, of all tested methods, the proposed coupled method performed best. In addition, it was found that temperature and water level both have significant impacts on dam deformation and can serve as reliable metrics for dam management.

Download Full-text