Sentiment Analysis in Social Media using Machine Learning Techniques

Over the last period, social media achieved a widespread use worldwide where the statistics indicate that more than three billion people are on social media, leading to large quantities of data online. To analyze these large quantities of data, a special classification method known as sentiment analysis, is used. This paper presents a new sentiment analysis system based on machine learning techniques, which aims to create a process to extract the polarity from social media texts. By using machine learning techniques, sentiment analysis achieved a great success around the world. This paper investigates this topic and proposes a sentiment analysis system built on Bayesian Rough Decision Tree (BRDT) algorithm. The experimental results show the success of this system where the accuracy of the system is more than 95% on social media data.

Download Full-text

Social media data analysis to predict mental state of users using machine learning techniques

Journal of Education and Health Promotion ◽

10.4103/jehp.jehp_446_20 ◽

2021 ◽

Vol 10 (1) ◽

pp. 301

Author(s):

R Lokeshkumar ◽

OmAshish Mishra ◽

Shivam Kalra

Keyword(s):

Machine Learning ◽

Social Media ◽

Data Analysis ◽

Mental State ◽

Machine Learning Techniques ◽

Social Media Data ◽

Learning Techniques ◽

Media Data

Download Full-text

A sentiment analysis system for social media using machine learning techniques: Social enablement

Digital Scholarship in the Humanities ◽

10.1093/llc/fqy037 ◽

2018 ◽

Vol 34 (3) ◽

pp. 569-581 ◽

Cited By ~ 1

Author(s):

Sujata Rani ◽

Parteek Kumar

Keyword(s):

Machine Learning ◽

Social Media ◽

Sentiment Analysis ◽

Media Analysis ◽

Training Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Analysis Tool ◽

Data Set ◽

Learning Techniques

Abstract In this article, an innovative approach to perform the sentiment analysis (SA) has been presented. The proposed system handles the issues of Romanized or abbreviated text and spelling variations in the text to perform the sentiment analysis. The training data set of 3,000 movie reviews and tweets has been manually labeled by native speakers of Hindi in three classes, i.e. positive, negative, and neutral. The system uses WEKA (Waikato Environment for Knowledge Analysis) tool to convert these string data into numerical matrices and applies three machine learning techniques, i.e. Naive Bayes (NB), J48, and support vector machine (SVM). The proposed system has been tested on 100 movie reviews and tweets, and it has been observed that SVM has performed best in comparison to other classifiers, and it has an accuracy of 68% for movie reviews and 82% in case of tweets. The results of the proposed system are very promising and can be used in emerging applications like SA of product reviews and social media analysis. Additionally, the proposed system can be used in other cultural/social benefits like predicting/fighting human riots.

Download Full-text

Communication Sentiment Analyzer using Machine Learning with Naive Bayes Bernoullinb

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a1610.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 5976-5979

Keyword(s):

Machine Learning ◽

Social Media ◽

Major Part ◽

Naive Bayes ◽

Naïve Bayes ◽

User Preferences ◽

Social Media Data ◽

Machine Learning Model ◽

The World ◽

Media Data

In this never-ending social media era it is estimated that over 5 billion people use smartphones. Out of these, there are over 1.5 billion active users in the world. In which we all are a major part and before opening our messages we all are curious about what message we have received. No doubt, we all always hope for a good message to be received. So Sentiment analysis on social media data has been seen by many as an effective tool to monitor user preferences and inclination. Finally, we propose a scalable machine learning model to analyze the polarity of a communicative text using Naive Bayes’ Bernoulli classifier. This paper works on only two polarities that is whether the sentence is positive or negative. Bernoulli classifier is used in this paper because it is best suited for binary inputs which in turn enhances the accuracy of up to 97%.

Download Full-text

A REVIEW ON SENTIMENT ANALYSIS OF SOCIAL MEDIA DATA USING TEXT MINING AND MACHINE LEARNING.

International Journal of Advanced Research ◽

10.21474/ijar01/526 ◽

2016 ◽

Vol 4 (5) ◽

pp. 772-775

Author(s):

GURPREET KAUR ◽

◽

MANOJ KUMAR ◽

Keyword(s):

Machine Learning ◽

Social Media ◽

Text Mining ◽

Sentiment Analysis ◽

Social Media Data ◽

Media Data

Download Full-text

A Comprehensive Analysis of Approaches for Sentiment Analysis Using Twitter Data on COVID-19 Vaccines

Journal of Informatics Electrical and Electronics Engineering (JIEEE) ◽

10.54060/jieee/002.02.009 ◽

2021 ◽

Vol 2 (2) ◽

pp. 1-10

Author(s):

Amrita Mishra ◽

Keyword(s):

Machine Learning ◽

Social Media ◽

Sentiment Analysis ◽

Text Classification ◽

Comprehensive Analysis ◽

Social Media Data ◽

The Public ◽

Opinion Analysis ◽

Twitter Data ◽

Media Data

Sentiment Analysis has paved routes for opinion analysis of masses over unrestricted territorial limits. With the advent and growth of social media like Twitter, Facebook, WhatsApp, Snapchat in today’s world, stakeholders and the public often takes to expressing their opinion on them and drawing conclusions. While these social media data are extremely informative and well connected, the major challenge lies in incorporating efficient Text Classification strategies which not only overcomes the unstructured and humongous nature of data but also generates correct polarity of opinions (i.e. positive, negative, and neutral). This paper is a thorough effort to provide a brief study about various approaches to SA including Machine Learning, Lexicon Based, and Automatic Approaches. The paper also highlights the comparison of positive, negative, and neutral tweets of the Sputnik V, Moderna, and Covaxin vaccines used for preventive and emergency use of COVID-19 disease.

Download Full-text

A Comprehensive Review on Online News Popularity Prediction using Machine Learning Approach

SMART MOVES JOURNAL IJOSCIENCE ◽

10.24113/ijoscience.v5i1.181 ◽

2019 ◽

Vol 5 (1) ◽

pp. 7

Author(s):

Priyanka Rathord ◽

Dr. Anurag Jain ◽

Chetan Agrawal

Keyword(s):

Machine Learning ◽

Social Media ◽

Online News ◽

Machine Learning Techniques ◽

News Article ◽

Learning Approach ◽

Learning Techniques ◽

Machine Learning Approach ◽

The World ◽

Popularity Prediction

With the help of Internet, the online news can be instantly spread around the world. Most of peoples now have the habit of reading and sharing news online, for instance, using social media like Twitter and Facebook. Typically, the news popularity can be indicated by the number of reads, likes or shares. For the online news stake holders such as content providers or advertisers, it’s very valuable if the popularity of the news articles can be accurately predicted prior to the publication. Thus, it is interesting and meaningful to use the machine learning techniques to predict the popularity of online news articles. Various works have been done in prediction of online news popularity. Popularity of news depends upon various features like sharing of online news on social media, comments of visitors for news, likes for news articles etc. It is necessary to know what makes one online news article more popular than another article. Unpopular articles need to get optimize for further popularity. In this paper, different methodologies are analyzed which predict the popularity of online news articles. These methodologies are compared, their parameters are considered and improvements are suggested. The proposed methodology describes online news popularity predicting system.

Download Full-text

Sentiment Analysis of Tweets for Estimating Criticality and Security of Events

Improving the Safety and Efficiency of Emergency Services ◽

10.4018/978-1-7998-2535-7.ch013 ◽

2020 ◽

pp. 293-319 ◽

Cited By ~ 1

Author(s):

V. Subramaniyaswamy ◽

R. Logesh ◽

M. Abejith ◽

Sunil Umasankar ◽

A. Umamakeswari

Keyword(s):

Social Media ◽

Data Collection ◽

Sentiment Analysis ◽

Quantitative Result ◽

Law Enforcement Agencies ◽

Social Media Data ◽

Software Application ◽

The World ◽

Media Data ◽

Different Levels

Social Media has become one of the major industries in the world. It has been noted that almost three fourth of the world's population use social media. This has instigated many researches towards social media. One such useful application is the sentimental analysis of real time social media data for security purposes. The insights that are generated can be used by law enforcement agencies and for intelligence purposes. There are many types of analyses that have been done for security purposes. Here, the authors propose a comprehensive software application which will meticulously scrape data from Twitter and analyse them using the lexicon based analysis to look for possible threats. They propose a methodology to obtain a quantitative result called criticality to assess the level of threat for a public event. The results can be used to understand people's opinions and comments with regard to specific events. The proposed system combines this lexicon based sentimental analysis along with deep data collection and segregates the emotions into different levels to analyse the threat for an event.

Download Full-text

Social Media Data-Based Sentiment Analysis of Tourists’ Air Quality Perceptions

Sustainability ◽

10.3390/su11185070 ◽

2019 ◽

Vol 11 (18) ◽

pp. 5070 ◽

Cited By ~ 3

Author(s):

Yuguo Tao ◽

Feng Zhang ◽

Chunyun Shi ◽

Yun Chen

Keyword(s):

Machine Learning ◽

Social Media ◽

Content Analysis ◽

Air Quality ◽

Sentiment Analysis ◽

Social Media Data ◽

Sina Weibo ◽

Emotion Words ◽

Tourist Destinations ◽

Media Data

Analyzing tourists’ perceptions of air quality is of great significance to the study of tourist experience satisfaction and the image construction of tourism destinations. In this study, using the web crawler technique, we collected 27,500 comments regarding the air quality of 195 of China’s Class 5A tourist destinations posted by tourists on Sina Weibo from January 2011 to December 2017; these comments were then subjected to a content analysis using the Gooseeker, ROST CM (Content Mining System) and BosonNLP (Natural Language Processing) tools. Based on an analysis of the proportions of sentences with different emotional polarities with ROST EA (Emotion Analysis), we measured the sentiment value of texts using the artificial neural network (ANN) machine learning method implemented through a Chinese social media data-oriented Boson platform based on the Python programming language. The content analysis results indicated that in the adaption stage in Sina Weibo, tourists’ perceptions of air quality were mainly positive and had poor air pollution crisis awareness. Objective emotion words exhibited a similarly high proportion as subjective emotion words, indicating that taking both objective and subjective emotion words into account simultaneously helps to comprehensively understand the emotional content of the comments. The sentiment analysis results showed that for the entire text, sentences with positive emotions accounted for 85.53% of the total comments, with a sentiment value of 0.786, which belonged to the positive medium level; the direction of the temporal “up-down-up” changes and the spatial pattern of high in the south and low in the north (while having little difference between the east and the west) were basically consistent with reality. A further exploration of the theoretical basis of the semi-supervised ANN approach or the introduction of other machine learning methods using different data sources will help to analyze this phenomenon in greater depth. The paper provides evidence for new data and methods for air quality research in tourist destinations and provides a new tool for air quality monitoring.

Download Full-text

SOCIAL MEDIA DATA PROCESSING AND ANALYSIS BY MEANS OF MACHINE LEARNING FOR RAPID DETECTION, ASSESSMENT AND MAPPING THE IMPACT OF DISASTERS

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b3-2020-1237-2020 ◽

2020 ◽

Vol XLIII-B3-2020 ◽

pp. 1237-1241

Author(s):

P. M. Kikin ◽

A. A. Kolesnikov ◽

E. A. Panidi

Keyword(s):

Machine Learning ◽

Social Media ◽

Relevant Information ◽

Important Task ◽

Machine Learning Techniques ◽

Social Media Data ◽

Accurate Localization ◽

The Impact ◽

Official Sources ◽

Media Data

Abstract. The main factor determining the possibility of using data obtained from social media as a source of information about the threat of emergencies is their relevance and accuracy. Thus, the important task is the determination of metrics for evaluating these parameters for a specific publication in a social media. It is worth noting the importance of this information channel as a source of eyewitness accounts from the scene. A comparison of social media data and official sources shows that social media contain a significant amount of unique information at different stages of emergency development. Also, when monitoring the situation for a specific event, social media allows to get more relevant information in comparison to official sources. Another important task is to search for emergency messages and their most accurate localization in space. A promising solution for the analysis and processing of social media data during emergency response is the application of artificial intelligence methods, and, particularly, machine learning techniques.

Download Full-text

Identification of HATE speech tweets in Pashto language using Machine Learning techniques

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2021/021032021 ◽

2021 ◽

Vol 10 (3) ◽

pp. 1501-1508

Keyword(s):

Machine Learning ◽

Social Media ◽

Decision Tree ◽

Sentiment Analysis ◽

Hate Speech ◽

Research Work ◽

Machine Learning Techniques ◽

Related Data ◽

Learning Techniques ◽

The Impact

From the last few years, researchers are very much attracted to sentiment analysis, especially towards hate speech detectionsystems. As in different languages procreation of hate speech has compelling and symbolic consideration on social media. Hate speech has a great impact on society, using hate words harms others dignity. Hate speech detectionsystems areimportant to stop the transformation of hate words into crimes. In this research,a frameworkis developedfor hate speech detectionsystemin the Pashto language. A datasetis created for which data is collected from Twitter. Because there is no related data available. Most of the research work has been done in this domain for other languages, and it’s very maturein the context of detecting hate speech. But when it arrives at the morphological languages not much work has been done especially in the Pashto language. This researchaimed and collected data from Twitter, Tweets related to ethnicity and religion. The data collected from twitter has been annotated manually and categorized the data as hate or not by comparing it with the offensive content. For hate speechdetection systemsto view the impact of different features/attribute this study performed experiments on the existing classifiers i.e.,SVM, Naïve Bayes, Decision tree and KNN. SVM produced the highest result at dataset of 500 i.e.,74% among all the classifiers. KNN and Decision Tree produced same result at dataset of 1500 i.e.,65.0%. Dataset of 2800 Decision Tree produced the highest result i.e.,72% and SVM produced 71.9%.

Download Full-text