A Comparison of Pre-trained Word Embeddings for Sentiment Analysis Using Deep Learning

Sentiment analysis is one of the new absorbing parts appeared in natural language processing with the emergence of community sites on the web. Taking advantage of the amount of information now available, research and industry have been seeking ways to automatically analyze the sentiments expressed in texts. The challenge for this task is the human language ambiguity, and also the lack of labeled data. In order to solve this issue, sentiment analysis and deep learning have been merged as deep learning models are effective due to their automatic learning capability. In this paper, we provide a comparative study on IMDB movie review dataset, we compare word embeddings and further deep learning models on sentiment analysis and give broad empirical outcomes for those keen on taking advantage of deep learning for sentiment analysis in real-world settings.

Download Full-text

A Multi-Layer Dual Attention Deep Learning Model With Refined Word Embeddings for Aspect-Based Sentiment Analysis

IEEE Access ◽

10.1109/access.2019.2927281 ◽

2019 ◽

Vol 7 ◽

pp. 114795-114807 ◽

Cited By ~ 8

Author(s):

Syeda Rida-E-Fatima ◽

Ali Javed ◽

Ameen Banjar ◽

Aun Irtaza ◽

Hassan Dawood ◽

...

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Learning Model ◽

Word Embeddings ◽

Deep Learning Model

Download Full-text

Word Embeddings and Deep Learning for Spanish Twitter Sentiment Analysis

Information Management and Big Data - Communications in Computer and Information Science ◽

10.1007/978-3-030-11680-4_4 ◽

2019 ◽

pp. 19-31

Author(s):

José Ochoa-Luna ◽

Disraeli Ari

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Word Embeddings

Download Full-text

Sentiment classification and aspect-based sentiment analysis on yelp reviews using deep learning and word embeddings

Journal of Decision System ◽

10.1080/12460125.2020.1864106 ◽

2021 ◽

pp. 1-23

Author(s):

Eman Saeed Alamoudi ◽

Norah Saleh Alghamdi

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Sentiment Classification ◽

Word Embeddings

Download Full-text

An Assessment of Deep Learning Models and Word Embeddings for Toxicity Detection within Online Textual Comments

Electronics ◽

10.3390/electronics10070779 ◽

2021 ◽

Vol 10 (7) ◽

pp. 779

Author(s):

Danilo Dessì ◽

Diego Reforgiato Recupero ◽

Harald Sack

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Language Processing ◽

Short Term Memory ◽

Online Communication ◽

Good Choice ◽

Learning Technologies ◽

Word Embeddings ◽

Emotion Detection ◽

Digital Platforms

Today, increasing numbers of people are interacting online and a lot of textual comments are being produced due to the explosion of online communication. However, a paramount inconvenience within online environments is that comments that are shared within digital platforms can hide hazards, such as fake news, insults, harassment, and, more in general, comments that may hurt someone’s feelings. In this scenario, the detection of this kind of toxicity has an important role to moderate online communication. Deep learning technologies have recently delivered impressive performance within Natural Language Processing applications encompassing Sentiment Analysis and emotion detection across numerous datasets. Such models do not need any pre-defined hand-picked features, but they learn sophisticated features from the input datasets by themselves. In such a domain, word embeddings have been widely used as a way of representing words in Sentiment Analysis tasks, proving to be very effective. Therefore, in this paper, we investigated the use of deep learning and word embeddings to detect six different types of toxicity within online comments. In doing so, the most suitable deep learning layers and state-of-the-art word embeddings for identifying toxicity are evaluated. The results suggest that Long-Short Term Memory layers in combination with mimicked word embeddings are a good choice for this task.

Download Full-text

Deep Learning Adaptation with Word Embeddings for Sentiment Analysis on Online Course Reviews

Algorithms for Intelligent Systems - Deep Learning-Based Approaches for Sentiment Analysis ◽

10.1007/978-981-15-1216-2_3 ◽

2020 ◽

pp. 57-83 ◽

Cited By ~ 1

Author(s):

Danilo Dessí ◽

Mauro Dragoni ◽

Gianni Fenu ◽

Mirko Marras ◽

Diego Reforgiato Recupero

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Online Course ◽

Word Embeddings

Download Full-text

Deep Learning for text in limted data settings

10.36227/techrxiv.12100692 ◽

2020 ◽

Author(s):

Pathikkumar Patel ◽

Bhargav Lad ◽

Jinan Fiaidhi

Keyword(s):

Machine Learning ◽

Time Series ◽

Deep Learning ◽

Sentiment Analysis ◽

Transfer Learning ◽

Text Classification ◽

State Of The Art ◽

Time Series Forecasting ◽

Text Data ◽

Performance Levels

During the last few years, RNN models have been extensively used and they have proven to be better for sequence and text data. RNNs have achieved state-of-the-art performance levels in several applications such as text classification, sequence to sequence modelling and time series forecasting. In this article we will review different Machine Learning and Deep Learning based approaches for text data and look at the results obtained from these methods. This work also explores the use of transfer learning in NLP and how it affects the performance of models on a specific application of sentiment analysis.

Download Full-text

Improving Sentiment Analysis using Hybrid Deep Learning Model

Recent Advances in Computer Science and Communications ◽

10.2174/2213275912666190328200012 ◽

2020 ◽

Vol 13 (4) ◽

pp. 627-640 ◽

Cited By ~ 1

Author(s):

Avinash Chandra Pandey ◽

Dharmveer Singh Rajpoot

Keyword(s):

Neural Network ◽

Deep Learning ◽

Sentiment Analysis ◽

Classification Accuracy ◽

Short Term Memory ◽

Computational Cost ◽

Extraction Process ◽

Learning Model ◽

Sentiment Classification ◽

Deep Learning Model

Background: Sentiment analysis is a contextual mining of text which determines viewpoint of users with respect to some sentimental topics commonly present at social networking websites. Twitter is one of the social sites where people express their opinion about any topic in the form of tweets. These tweets can be examined using various sentiment classification methods to find the opinion of users. Traditional sentiment analysis methods use manually extracted features for opinion classification. The manual feature extraction process is a complicated task since it requires predefined sentiment lexicons. On the other hand, deep learning methods automatically extract relevant features from data hence; they provide better performance and richer representation competency than the traditional methods. Objective: The main aim of this paper is to enhance the sentiment classification accuracy and to reduce the computational cost. Method: To achieve the objective, a hybrid deep learning model, based on convolution neural network and bi-directional long-short term memory neural network has been introduced. Results: The proposed sentiment classification method achieves the highest accuracy for the most of the datasets. Further, from the statistical analysis efficacy of the proposed method has been validated. Conclusion: Sentiment classification accuracy can be improved by creating veracious hybrid models. Moreover, performance can also be enhanced by tuning the hyper parameters of deep leaning models.

Download Full-text

Investigating the impact of pre-processing techniques and pre-trained word embeddings in detecting Arabic health information on social media

Journal Of Big Data ◽

10.1186/s40537-021-00488-w ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Yahya Albalawi ◽

Jim Buckley ◽

Nikola S. Nikolov

Keyword(s):

Social Media ◽

Deep Learning ◽

Comprehensive Evaluation ◽

Classification Problem ◽

Data Sets ◽

Word Embeddings ◽

Data Set ◽

Lower Accuracy ◽

Health Related ◽

The Impact

AbstractThis paper presents a comprehensive evaluation of data pre-processing and word embedding techniques in the context of Arabic document classification in the domain of health-related communication on social media. We evaluate 26 text pre-processings applied to Arabic tweets within the process of training a classifier to identify health-related tweets. For this task we use the (traditional) machine learning classifiers KNN, SVM, Multinomial NB and Logistic Regression. Furthermore, we report experimental results with the deep learning architectures BLSTM and CNN for the same text classification problem. Since word embeddings are more typically used as the input layer in deep networks, in the deep learning experiments we evaluate several state-of-the-art pre-trained word embeddings with the same text pre-processing applied. To achieve these goals, we use two data sets: one for both training and testing, and another for testing the generality of our models only. Our results point to the conclusion that only four out of the 26 pre-processings improve the classification accuracy significantly. For the first data set of Arabic tweets, we found that Mazajak CBOW pre-trained word embeddings as the input to a BLSTM deep network led to the most accurate classifier with F1 score of 89.7%. For the second data set, Mazajak Skip-Gram pre-trained word embeddings as the input to BLSTM led to the most accurate model with F1 score of 75.2% and accuracy of 90.7% compared to F1 score of 90.8% achieved by Mazajak CBOW for the same architecture but with lower accuracy of 70.89%. Our results also show that the performance of the best of the traditional classifier we trained is comparable to the deep learning methods on the first dataset, but significantly worse on the second dataset.

Download Full-text

An Evaluation of Neural Machine Translation and Pre-trained Word Embeddings in Multilingual Neural Sentiment Analysis

2020 IEEE International Conference on Progress in Informatics and Computing (PIC) ◽

10.1109/pic50277.2020.9350849 ◽

2020 ◽

Author(s):

George Manias ◽

Argyro Mavrogiorgou ◽

Athanasios Kiourtis ◽

Dimosthenis Kyriazis

Keyword(s):

Sentiment Analysis ◽

Machine Translation ◽

Word Embeddings ◽

Neural Machine Translation

Download Full-text