Bi-LSTM and Ensemble based Bilingual Sentiment Analysis for a Code-mixed Hindi-English Social Media Text

In this digitized world, the Internet has become a prominent source to glean various kinds of information. In today’s scenario, people prefer virtual reality instead of one to one communication. The Majority of the population prefers social networking sites to voice themselves through posts, blogs, comments, likes, dislikes. Their sentiments can be found/traced using opinion mining or Sentiment analysis. Sentiment analysis of social media text is a useful technique for identifying peoples’ positive, negative or neutral emotions/sentiments/opinions. Sentiment analysis has gained special attention by researchers from last few years. Traditionally many machine learning algorithms were used to implement it like navie bays, Support Vector Machine and many more. But to overcome the drawbacks of ML in terms of complex classification algorithms different deep learning-based algorithms are introduced like CNN, RNN, and HNN. In this paper, we have studied different deep learning algorithms and intended to propose a deep learning-based model to analyze the behavior of an individual using social media text. Results given by the proposed model can utilize in a range of different fields like business, education, industry, politics, psychology, security, etc.

Download Full-text

NITS-Hinglish-SentiMix at SemEval-2020 Task 9: Sentiment Analysis for Code-Mixed Social Media Text Using an Ensemble Model

10.18653/v1/2020.semeval-1.175 ◽

2020 ◽

Author(s):

Subhra Jyoti Baroi ◽

Nivedita Singh ◽

Ringki Das ◽

Thoudam Doren Singh

Keyword(s):

Social Media ◽

Sentiment Analysis ◽

Ensemble Model ◽

Social Media Text

Download Full-text

An Enhancement of Malay Social Media Text Normalization for Lexicon-Based Sentiment Analysis

2019 International Conference on Asian Language Processing (IALP) ◽

10.1109/ialp48816.2019.9037700 ◽

2019 ◽

Author(s):

Muhammad Fakhrur Razi Abu Bakar ◽

Norisma Idris ◽

Liyana Shuib

Keyword(s):

Social Media ◽

Sentiment Analysis ◽

Social Media Text ◽

Text Normalization

Download Full-text

Sentiment Analysis on Hindi–English Code-Mixed Social Media Text

Innovations in Computer Science and Engineering - Lecture Notes in Networks and Systems ◽

10.1007/978-981-33-4543-0_65 ◽

2021 ◽

pp. 615-622

Author(s):

T. Tulasi Sasidhar ◽

B. Premjith ◽

K. Sreelakshmi ◽

K. P. Soman

Keyword(s):

Social Media ◽

Sentiment Analysis ◽

Social Media Text

Download Full-text

Sentiment analysis of Social Media Text-Emoticon Post with Machine learning Models Contribution Title

Journal of Physics Conference Series ◽

10.1088/1742-6596/2070/1/012079 ◽

2021 ◽

Vol 2070 (1) ◽

pp. 012079

Author(s):

V Jagadishwari ◽

A Indulekha ◽

Kiran Raghu ◽

P Harshini

Keyword(s):

Machine Learning ◽

Social Media ◽

Sentiment Analysis ◽

Online Social Networks ◽

Data Sets ◽

Learning Models ◽

Twitter Data ◽

The Social ◽

Social Media Text ◽

Machine Learning Models

Abstract Social Media is an arena in recent times for people to share their perspectives on a variety of topics. Most of the social interactions are through the Social Media. Though all the Online Social Networks allow users to express their views and opinions in many forms like audio, video, text etc, the most popular form of expression is text, Emoticons and Emojis. The work presented in this paper aims at detecting the sentiments expressed in the Social Media posts. The Machine Learning Models namely Bernoulli Bayes, Multinomial Bayes, Regression and SVM were implemented. All these models were trained and tested with Twitter Data sets. Users on Twitter express their opinions in the form of tweets with limited characters. Tweets also contain Emoticons and Emojis therefore Twitter data sets are best suited for the sentiment analysis. The effect of emoticons present in the tweet is also analyzed. The models are first trained only with the text and then they are trained with text and emoticon in the tweet. The performance of all the four models in both cases are tested and the results are presented in the paper.

Download Full-text

IUST at SemEval-2020 Task 9: Sentiment Analysis for Code-Mixed Social Media Text Using Deep Neural Networks and Linear Baselines

10.18653/v1/2020.semeval-1.170 ◽

2020 ◽

Author(s):

Soroush Javdan ◽

Taha Shangipour ataei ◽

Behrouz Minaei-Bidgoli

Keyword(s):

Neural Networks ◽

Social Media ◽

Sentiment Analysis ◽

Deep Neural Networks ◽

Social Media Text

Download Full-text

Classification of Code-Mixed Bilingual Phonetic Text Using Sentiment Analysis

International Journal on Semantic Web and Information Systems ◽

10.4018/ijswis.2021040104 ◽

2021 ◽

Vol 17 (2) ◽

pp. 59-78

Author(s):

Shailendra Kumar Singh ◽

Manoj Kumar Sachan

Keyword(s):

Social Media ◽

Sentiment Analysis ◽

Social Networking Sites ◽

Large Scale ◽

English Language ◽

Product Reviews ◽

Internet Users ◽

Social Media Text ◽

Analysis System

The rapid growth of internet facilities has increased the comments, posts, blogs, feedback, etc., on a large scale on social networking sites. These social media data are available in an unstructured form, which includes images, text, and videos. The processing of these data is difficult, but some sentiment analysis, information retrieval, and recommender systems are used to process these unstructured data. To extract the opinion and sentiment of internet users from their written social media text, a sentiment analysis system is required to develop, which can work on both monolingual and bilingual phonetic text. Therefore, a sentiment analysis (SA) system is developed, which performs well on different domain datasets. The system performance is tested on four different datasets and achieved better accuracy of 3% on social media datasets, 1.5% on movie reviews, 1.35% on Amazon product reviews, and 4.56% on large Amazon product reviews than the state-of-art techniques. Also, the stemmer (StemVerb) for verbs of the English language is proposed, which improves the SA system's performance.

Download Full-text

Sentiment Analysis of Social Media Text Data using Back Propagation in Artificial Neural Networks

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2018.1162 ◽

2018 ◽

Vol 6 ◽

pp. 1071-1077 ◽

Cited By ~ 1

Author(s):

Neha Sharma

Keyword(s):

Neural Networks ◽

Social Media ◽

Artificial Neural Networks ◽

Sentiment Analysis ◽

Back Propagation ◽

Text Data ◽

Social Media Text ◽

Artificial Neural

Download Full-text

Deep Learning Based Sentiment Analysis in a Code-Mixed English-Hindi and English-Bengali Social Media Corpus

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213020500141 ◽

2020 ◽

Vol 29 (05) ◽

pp. 2050014

Author(s):

Anupam Jamatia ◽

Steve Durairaj Swamy ◽

Björn Gambäck ◽

Amitava Das ◽

Swapan Debbarma

Keyword(s):

Social Media ◽

Deep Learning ◽

Sentiment Analysis ◽

Language Processing ◽

Mixed Data ◽

Code Mixing ◽

Language Representation ◽

The Social ◽

Social Media Text ◽

Traditional Approaches

Sentiment analysis is a circumstantial analysis of text, identifying the social sentiment to better understand the source material. The article addresses sentiment analysis of an English-Hindi and English-Bengali code-mixed textual corpus collected from social media. Code-mixing is an amalgamation of multiple languages, which previously mainly was associated with spoken language. However, social media users also deploy it to communicate in ways that tend to be somewhat casual. The coarse nature of social media text poses challenges for many language processing applications. Here, the focus is on the low predictive nature of traditional machine learners when compared to Deep Learning counterparts, including the contextual language representation model BERT (Bidirectional Encoder Representations from Transformers), on the task of extracting user sentiment from code-mixed texts. Three deep learners (a BiLSTM CNN, a Double BiLSTM and an Attention-based model) attained accuracy 20–60% greater than traditional approaches on code-mixed data, and were for comparison also tested on monolingual English data.

Download Full-text