Multilabel Text Classification in News Articles Using Long-Term Memory with Word2Vec

Winda Kurnia Sari; Dian Palupi Rini;  Reza Firsandaya Malik;  Iman Saladin B. Azhar

doi:10.29207/resti.v4i2.1655

Multilabel Text Classification in News Articles Using Long-Term Memory with Word2Vec

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v4i2.1655 ◽

2020 ◽

Vol 4 (2) ◽

pp. 276-285

Author(s):

Winda Kurnia Sari ◽

Dian Palupi Rini ◽

Reza Firsandaya Malik ◽

Iman Saladin B. Azhar

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Text Classification ◽

Large Scale ◽

Short Term Memory ◽

Short Term ◽

Learning Methods ◽

Processing Variable ◽

Term Memory ◽

Long Short Term Memory

Multilabel text classification is a task of categorizing text into one or more categories. Like other machine learning, multilabel classification performance is limited to the small labeled data and leads to the difficulty of capturing semantic relationships. It requires a multilabel text classification technique that can group four labels from news articles. Deep Learning is a proposed method for solving problems in multilabel text classification techniques. Some of the deep learning methods used for text classification include Convolutional Neural Networks, Autoencoders, Deep Belief Networks, and Recurrent Neural Networks (RNN). RNN is one of the most popular architectures used in natural language processing (NLP) because the recurrent structure is appropriate for processing variable-length text. One of the deep learning methods proposed in this study is RNN with the application of the Long Short-Term Memory (LSTM) architecture. The models are trained based on trial and error experiments using LSTM and 300-dimensional words embedding features with Word2Vec. By tuning the parameters and comparing the eight proposed Long Short-Term Memory (LSTM) models with a large-scale dataset, to show that LSTM with features Word2Vec can achieve good performance in text classification. The results show that text classification using LSTM with Word2Vec obtain the highest accuracy is in the fifth model with 95.38, the average of precision, recall, and F1-score is 95. Also, LSTM with the Word2Vec feature gets graphic results that are close to good-fit on seventh and eighth models.

Download Full-text

Chinese Text Classification Model Based on Deep Learning

Future Internet ◽

10.3390/fi10110113 ◽

2018 ◽

Vol 10 (11) ◽

pp. 113 ◽

Cited By ~ 17

Author(s):

Yue Li ◽

Xutao Wang ◽

Pengjian Xu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Language Processing ◽

Chinese Text ◽

Text Classification ◽

Short Term Memory ◽

Classification Model ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Text classification is of importance in natural language processing, as the massive text information containing huge amounts of value needs to be classified into different categories for further use. In order to better classify text, our paper tries to build a deep learning model which achieves better classification results in Chinese text than those of other researchers’ models. After comparing different methods, long short-term memory (LSTM) and convolutional neural network (CNN) methods were selected as deep learning methods to classify Chinese text. LSTM is a special kind of recurrent neural network (RNN), which is capable of processing serialized information through its recurrent structure. By contrast, CNN has shown its ability to extract features from visual imagery. Therefore, two layers of LSTM and one layer of CNN were integrated to our new model: the BLSTM-C model (BLSTM stands for bi-directional long short-term memory while C stands for CNN.) LSTM was responsible for obtaining a sequence output based on past and future contexts, which was then input to the convolutional layer for extracting features. In our experiments, the proposed BLSTM-C model was evaluated in several ways. In the results, the model exhibited remarkable performance in text classification, especially in Chinese texts.

Download Full-text

Application of deep learning methods to predict ionosphere parameters in real time

E3S Web of Conferences ◽

10.1051/e3sconf/202019602007 ◽

2020 ◽

Vol 196 ◽

pp. 02007

Author(s):

Vladimir Mochalov ◽

Anastasia Mochalova

Keyword(s):

Neural Network ◽

Deep Learning ◽

Real Time ◽

Network Architecture ◽

Short Term Memory ◽

Neural Network Architecture ◽

Short Term ◽

Learning Methods ◽

Term Memory ◽

Long Short Term Memory

In this paper, the previously obtained results on recognition of ionograms using deep learning are expanded to predict the parameters of the ionosphere. After the ionospheric parameters have been identified on the ionogram using deep learning in real time, we can predict the parameters for some time ahead on the basis of the new data obtained Examples of predicting the ionosphere parameters using an artificial recurrent neural network architecture long short-term memory are given. The place of the block for predicting the parameters of the ionosphere in the system for analyzing ionospheric data using deep learning methods is shown.

Download Full-text

A Blind Spectrum Sensing Method Based on Deep Learning

Sensors ◽

10.3390/s19102270 ◽

2019 ◽

Vol 19 (10) ◽

pp. 2270 ◽

Cited By ~ 6

Author(s):

Kai Yang ◽

Zhitao Huang ◽

Xiang Wang ◽

Xueqiong Li

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Spectrum Sensing ◽

Short Term Memory ◽

Signal To Noise Ratio ◽

Signal To Noise ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Noise Ratio

Spectrum sensing is one of the technologies that is used to solve the current problem of low utilization of spectrum resources. However, when the signal-to-noise ratio is low, current spectrum sensing methods cannot well-handle a situation in which the prior information of the licensed user signal is lacking. In this paper, a blind spectrum sensing method based on deep learning is proposed that uses three kinds of neural networks together, namely convolutional neural networks, long short-term memory, and fully connected neural networks. Experiments show that the proposed method has better performance than an energy detector, especially when the signal-to-noise ratio is low. At the same time, this paper also analyzes the effect of different long short-term memory layers on detection performance, and explores why the deep-learning-based detector can achieve better performance.

Download Full-text

Descriptor Free QSAR Modeling Using Deep Learning With Long Short-Term Memory Neural Networks

Frontiers in Artificial Intelligence ◽

10.3389/frai.2019.00017 ◽

2019 ◽

Vol 2 ◽

Cited By ~ 8

Author(s):

Suman K. Chakravarti ◽

Sai Radha Mani Alla

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Short Term Memory ◽

Short Term ◽

Qsar Modeling ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

A Hybrid Deep Learning Model to Forecast Particulate Matter Concentration Levels in Seoul, South Korea

Atmosphere ◽

10.3390/atmos11040348 ◽

2020 ◽

Vol 11 (4) ◽

pp. 348 ◽

Cited By ~ 3

Author(s):

Guang Yang ◽

HwaMin Lee ◽

Giyeol Lee

Keyword(s):

Air Pollution ◽

Deep Learning ◽

Short Term Memory ◽

Hybrid Models ◽

Short Term ◽

Learning Methods ◽

Term Memory ◽

Long Short Term Memory ◽

Gated Recurrent Unit ◽

Concentration Levels

Both long- and short-term exposure to high concentrations of airborne particulate matter (PM) severely affect human health. Many countries now regulate PM concentrations. Early-warning systems based on PM concentration levels are urgently required to allow countermeasures to reduce harm and loss. Previous studies sought to establish accurate, efficient predictive models. Many machine-learning methods are used for air pollution forecasting. The long short-term memory and gated recurrent unit methods, typical deep-learning methods, reliably predict PM levels with some limitations. In this paper, the authors proposed novel hybrid models to combine the strength of two types of deep learning methods. Moreover, the authors compare hybrid deep-learning methods (convolutional neural network (CNN)—long short-term memory (LSTM) and CNN—gated recurrent unit (GRU)) with several stand-alone methods (LSTM, GRU) in terms of predicting PM concentrations in 39 stations in Seoul. Hourly air pollution data and meteorological data from January 2015 to December 2018 was used for these training models. The results of the experiment confirmed that the proposed prediction model could predict the PM concentrations for the next 7 days. Hybrid models outperformed single models in five areas selected randomly with the lowest root mean square error (RMSE) and mean absolute error (MAE) values for both PM10 and PM2.5. The error rate for PM10 prediction in Gangnam with RMSE is 1.688, and MAE is 1.161. For hybrid models, the CNN–GRU better-predicted PM10 for all stations selected, while the CNN–LSTM model performed better on predicting PM2.5.

Download Full-text

A Spectral-Spatial Cascaded 3D Convolutional Neural Network with a Convolutional Long Short-Term Memory Network for Hyperspectral Image Classification

Remote Sensing ◽

10.3390/rs11202363 ◽

2019 ◽

Vol 11 (20) ◽

pp. 2363 ◽

Cited By ~ 2

Author(s):

Wenchao Qi ◽

Xia Zhang ◽

Nan Wang ◽

Mao Zhang ◽

Yi Cen

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Hyperspectral Image ◽

Short Term ◽

Dynamic Learning ◽

Learning Methods ◽

Term Memory ◽

Long Short Term Memory

Deep learning methods used for hyperspectral image (HSI) classification often achieve greater accuracy than traditional algorithms but require large numbers of training epochs. To simplify model structures and reduce their training epochs, an end-to-end deep learning framework incorporating a spectral-spatial cascaded 3D convolutional neural network (CNN) with a convolutional long short-term memory (CLSTM) network, called SSCC, is proposed herein for HSI classification. The SSCC framework employs cascaded 3D CNN to learn the spectral-spatial features of HSIs and uses the CLSTM network to extract sequence features. Residual connections are used in SSCC to accelerate model convergence, with the outputs of previous convolutional layers concatenated as inputs for subsequent layers. Moreover, the data augmentation, parametric rectified linear unit, dynamic learning rate, batch normalization, and regularization (including dropout and L2) methods are used to increase classification accuracy and prevent overfitting. These attributes allow the SSCC framework to achieve good performance for HSI classification within 20 epochs. Three well-known datasets including Indiana Pines, University of Pavia, and Pavia Center were employed to evaluate the classification performance of the proposed algorithm. The GF-5 dataset of Anxin County, obtained from China’s recently launched spaceborne Advanced Hyperspectral Imager, was also used for classification experiments. The experimental results demonstrate that the proposed SSCC framework achieves state-of-the-art performance with better training efficiency than other deep learning methods.

Download Full-text

Structured Sparsification of Gated Recurrent Neural Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5938 ◽

2020 ◽

Vol 34 (04) ◽

pp. 4989-4996

Author(s):

Ekaterina Lobacheva ◽

Nadezhda Chirkova ◽

Alexander Markovich ◽

Dmitry Vetrov

Keyword(s):

Neural Network ◽

Neural Networks ◽

Text Classification ◽

Recurrent Neural Networks ◽

Short Term Memory ◽

Language Modeling ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Network Compression

One of the most popular approaches for neural network compression is sparsification — learning sparse weight matrices. In structured sparsification, weights are set to zero by groups corresponding to structure units, e. g. neurons. We further develop the structured sparsification approach for the gated recurrent neural networks, e. g. Long Short-Term Memory (LSTM). Specifically, in addition to the sparsification of individual weights and neurons, we propose sparsifying the preactivations of gates. This makes some gates constant and simplifies an LSTM structure. We test our approach on the text classification and language modeling tasks. Our method improves the neuron-wise compression of the model in most of the tasks. We also observe that the resulting structure of gate sparsity depends on the task and connect the learned structures to the specifics of the particular tasks.

Download Full-text

Deep Graph-Long Short-Term Memory: A Deep Learning Based Approach for Text Classification

Wireless Personal Communications ◽

10.1007/s11277-021-08331-4 ◽

2021 ◽

Author(s):

Varsha Mittal ◽

Duraprasad Gangodkar ◽

Bhaskar Pant

Keyword(s):

Deep Learning ◽

Text Classification ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Vietnamese Text Classification Algorithm using Long Short Term Memory and Word2Vec

Informatics and Automation - Информатика и автоматизация ◽

10.15622/ia.2020.19.6.5 ◽

2020 ◽

Vol 19 (6) ◽

pp. 1255-1279

Author(s):

Huu Nguyen Phat ◽

Nguyen Thi Minh Anh

Keyword(s):

Deep Learning ◽

Language Processing ◽

Text Classification ◽

Industrial Revolution ◽

Short Term Memory ◽

Text Processing ◽

Research Effort ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

In the context of the ongoing forth industrial revolution and fast computer science development the amount of textual information becomes huge. So, prior to applying the seemingly appropriate methodologies and techniques to the above data processing their nature and characteristics should be thoroughly analyzed and understood. At that, automatic text processing incorporated in the existing systems may facilitate many procedures. So far, text classiﬁcation is one of the basic applications to natural language processing accounting for such factors as emotions’ analysis, subject labeling etc. In particular, the existing advancements in deep learning networks demonstrate that the proposed methods may fit the documents’ classifying, since they possess certain extra efficiency; for instance, they appeared to be eﬀective for classifying texts in English. The thorough study revealed that practically no research effort was put into an expertise of the documents in Vietnamese language. In the scope of our study, there is not much research for documents in Vietnamese. The development of deep learning models for document classiﬁcation has demonstrated certain improvements for texts in Vietnamese. Therefore, the use of long short term memory network with Word2vec is proposed to classify text that improves both performance and accuracy. The here developed approach when compared with other traditional methods demonstrated somewhat better results at classifying texts in Vietnamese language. The evaluation made over datasets in Vietnamese shows an accuracy of over 90%; also the proposed approach looks quite promising for real applications.

Download Full-text

Investigating the Feasibility of Deep Learning Methods for Urdu Word Sense Disambiguation

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3477578 ◽

2022 ◽

Vol 21 (2) ◽

pp. 1-16

Author(s):

Ali Saeed ◽

Rao Muhammad Adeel Nawab ◽

Mark Stevenson

Keyword(s):

Deep Learning ◽

Language Processing ◽

Short Term Memory ◽

Word Sense Disambiguation ◽

Word Sense ◽

Short Term ◽

Learning Methods ◽

Term Memory ◽

Sense Disambiguation ◽

Long Short Term Memory

Word Sense Disambiguation (WSD), the process of automatically identifying the correct meaning of a word used in a given context, is a significant challenge in Natural Language Processing. A range of approaches to the problem has been explored by the research community. The majority of these efforts has focused on a relatively small set of languages, particularly English. Research on WSD for South Asian languages, particularly Urdu, is still in its infancy. In recent years, deep learning methods have proved to be extremely successful for a range of Natural Language Processing tasks. The main aim of this study is to apply, evaluate, and compare a range of deep learning methods approaches to Urdu WSD (both Lexical Sample and All-Words) including Simple Recurrent Neural Networks, Long-Short Term Memory, Gated Recurrent Units, Bidirectional Long-Short Term Memory, and Ensemble Learning. The evaluation was carried out on two benchmark corpora: (1) the ULS-WSD-18 corpus and (2) the UAW-WSD-18 corpus. Results (Accuracy = 63.25% and F1-Measure = 0.49) show that a deep learning approach outperforms previously reported results for the Urdu All-Words WSD task, whereas performance using deep learning approaches (Accuracy = 72.63% and F1-Measure = 0.60) are low in comparison to previously reported for the Urdu Lexical Sample task.

Download Full-text