scholarly journals Predicting Election Results using NLTK

In today’s world, people are usually using social media networks for trying to communicate with other users and for sharing information across the world. The online social networking sites have become considerable tools and are providing a common medium for a number of users to communicate with each other. Twitter is the most prominent microblogging website and one among the social networking sites that grow on a daily basis. Social media incorporates an extensive amount of data in the form of tweets, forums, status updates, comments, etc. in an attempt to automatically process and analyze these data, applications can rely on analysis approaches such as sentiment analysis. Twitter sentiment analysis is an application of sentiment analysis on data from Twitter (tweets), to obtain user's opinions and sentiments. Natural Language Toolkit (NLTK) is a library based on machine learning methods in python & sentiment analysis tool. Which provides the base for text processing and classification? The research work proposed a machine learning-based classifier to extract the tweets on elections and analyze the opinion of the tweeples (people who use twitter). The tweets can be categorized as positive, negative and neutral towards a particular politician. We classify these processed tweets using a supervised machine learning classification approach. The classifier used to classify the tweets as positive, negative or neutral is Naive Bayes Classifier. The classifier is trained with tweets bearing a distinctive polarity. The percentage of positive and negative tweets is then measured and graphically represented.

Author(s):  
Kusumanchi Naga Sireesha and Padala Srinivasa Reddy

Along with the Coronavirus pandemic, another crisis has manifested itself in the form of mass fear and panic phenomena, fuelled by incomplete and often inaccurate information. There is therefore a tremendous need to address and better understand COVID-19’s informational crisis. The diverse use of social networking sites, like Twitter, speeds up the process of sharing information and having views on community events and health crises COVID-19 has been one of Twitter's trending areas. The Twitter messages created via Twitter are named Tweets. In this paper, we identify public sentiment associated with the pandemic using Coronavirus-specific Tweets and Python, along with its sentiment analysis packages. We provide an overview of two essential machine learning classification methods, in the context of textual analytics, and compare their effectiveness in classifying Coronavirus Tweets of varying lengths. This research provides insights into Coronavirus fear sentiment progression, associated methods, limitations, and different opportunities. In this project, we have designed a Sentiment analysis System that would identify the sentiment of a tweet and classify it into one of the five classes they include:”ExtremelyPositive”,“Positive”,”Neutral”, ”Negative” and “Extremely Negative”.


2018 ◽  
Vol 34 (3) ◽  
pp. 569-581 ◽  
Author(s):  
Sujata Rani ◽  
Parteek Kumar

Abstract In this article, an innovative approach to perform the sentiment analysis (SA) has been presented. The proposed system handles the issues of Romanized or abbreviated text and spelling variations in the text to perform the sentiment analysis. The training data set of 3,000 movie reviews and tweets has been manually labeled by native speakers of Hindi in three classes, i.e. positive, negative, and neutral. The system uses WEKA (Waikato Environment for Knowledge Analysis) tool to convert these string data into numerical matrices and applies three machine learning techniques, i.e. Naive Bayes (NB), J48, and support vector machine (SVM). The proposed system has been tested on 100 movie reviews and tweets, and it has been observed that SVM has performed best in comparison to other classifiers, and it has an accuracy of 68% for movie reviews and 82% in case of tweets. The results of the proposed system are very promising and can be used in emerging applications like SA of product reviews and social media analysis. Additionally, the proposed system can be used in other cultural/social benefits like predicting/fighting human riots.


Author(s):  
Veronica Ravaglia ◽  
Luca Zanazzi ◽  
Elvis Mazzoni

Through Social Media, like social networking sites, wikis, web forums or blogs, people can debate and influence each other. Due to this reason, the analysis of online conversations has been recognized to be relevant to organizations. In the chapter we introduce two strategic tools to monitor and analyze online conversations, Sentiment Text Analysis (STA) and Network Text Analysis (NTA). Finally, we propose one empirical example in which these tools are integrated to analyze Word-of-Mouth regarding products and services in the Digital Marketplace.


2021 ◽  
Vol 8 (12) ◽  
pp. 234-237
Author(s):  
Hrishabh Patidar ◽  
Jayesh Umre

Depression is a major concern snowballing day by day. There can be various causes of depression but mental illness is the main problem. A lot of people suffer from depression and a very few of them go through treatment. One out of six people between ages 10 to 19 years are suffering from depression. At its worst, depression can lead to suicide. Depression reduces user’s ability to do work study or socialize. One solution to this problem is study of individual’s behaviour through social media. We could know a person’s opinion, thinking, mood etc. through his social media. These attributes of user can be collected from different social networking sites like Instagram, Facebook, and Twitter etc. Social networking sites can be used as an analysis tool to predict depression level. Our projects aim is to gather information of user from their social media posts and predict his depression level.


Every year tens of millions of people suffer from depression and few of them get proper treatment on time. So, it is crucial to detect human stress and relaxation automatically via social media on a timely basis. It is very important to detect and manage stress before it goes into a severe problem. A huge number of informal messages are posted every day in social networking sites, blogs and discussion forums. This paper describes an approach to detect the stress using the information from social media networking sites, like tweeter.This paper presents a method to detect expressions of stress and relaxation on tweeter dataset i.e. working on sentiment analysis to find emotions or feelings about daily life. Sentiment analysis works the automatic extraction of sentiment related information from text. Here using TensiStrengthframework for sentiment strength detection on social networking sites to extract sentiment strength from the informal English text. TensiStrength is a system to detect the strength of stress and relaxation expressed in social media text messages. TensiStrength uses a lexical approach and a set of rules to detect direct and indirect expressions of stress or relaxation. This classifies both positive and negative emotions based on the strength scale from -5 to +5 indications of sentiments. Stressed sentences from the conversation are considered &categorised into stress and relax. TensiStrength is robust, it can be applied to a widevarietyofdifferent social web contexts. Theeffectiveness of TensiStrength depends on the nature of the tweets.In human being there is inborn capability to differentiate the multiple senses of an ambiguous word in a particular context, but machine executes only according to the instructions. The major drawback of machine translation is Word Sense Disambiguation. There is a fact that a single word can have multiple meanings or "senses." In the pre-processing partof-speech disambiguation is analysed and the drawback of WSD overcomes in the proposed method by unigram, bigram and trigram to give better result on ambiguous words. Here, SVM with Ngram gives better resultPrecision is65% and Recall is 67% .But, the main objective of this technique is to find the explicit and implicit amounts of stress and relaxation expressed in tweets. Keywords: Stress Detection, Data Mining, TensiStrength, word sense disambiguation.


Social media is one of the most influential tool for sharing information across different regions among different users .The people sharing their interests in various aspects in online social networking platforms like Facebook, twitter etc. Therefore the usage of hate text steadily increasing. Nowadays it has been reviled unfair behavior of the users in social networking sites. The existence of abusive text on different online social networking platforms and identification of such text is a big challenging task. To understand the complexity of language constructs in different languages is very difficult .Already lot of research work has completed in English language. This paper gives detail analysis of detecting hate text in various languages Hindi, urdu, Arabic, Bengali, Telugu. We incorporated various kinds of ML and DL based algorithms to identify hate text in OSN’s. A review is done related to different classifiers where a comparison made between different models of ML, DL algorithms. Finally finds the accurate method to classify the text is offensive or not by finding the parameters i.e. accuracy and F1score


2019 ◽  
Vol 8 (2) ◽  
pp. 2421-2428

Social Media is a popular medium of communication amongst youngsters to remain connected with their friends. Facebook is one of the most preferred Social Media Sites which store the gigantic amount of data which can be explored for Sentiment Analysis. In this study, we have applied hybrid analysis approach which combines the best features of a lexical analysis and SVM machine learning classification algorithm on Facebook Posts. The analysis is further improved by incorporating language discourse features to detect intensity of sentiment and the prominent emotions expressed through these posts.


The main objective of this paper is Analyze the reviews of Social Media Big Data of E-Commerce product’s. And provides helpful result to online shopping customers about the product quality and also provides helpful decision making idea to the business about the customer’s mostly liking and buying products. This covers all features or opinion words, like capitalized words, sequence of repeated letters, emoji, slang words, exclamatory words, intensifiers, modifiers, conjunction words and negation words etc available in tweets. The existing work has considered only two or three features to perform Sentiment Analysis with the machine learning technique Natural Language Processing (NLP). In this proposed work familiar Machine Learning classification models namely Multinomial Naïve Bayes, Support Vector Machine, Decision Tree Classifier, and, Random Forest Classifier are used for sentiment classification. The sentiment classification is used as a decision support system for the customers and also for the business.


Author(s):  
Miss. Pooja Dilip Dhotre

Social media websites are among the internet's most far-reaching digital sites. Billions of social network users exist Users' frequent interactions with social networking sites, like Twitter, have a widespread and sometimes unfortunate effect on day-to-day life. Social networking sites make it easy for large amounts of unwanted and unrelated information to spread around the world. Twitter is a popular micro blogging service where users connect with others with similar interests. Because of the current popularity of Twitter, it is vulnerable to public shaming. Recently, Twitter has emerged as a rich source of human-generated information, with the added benefit of connecting you with customers and enabling two-way communication. It is generally accepted that when someone posts a comment in an occurrence, it is likely to humiliate the victim. The fact that shaming users' follower counts increase faster than that of the people who don't use shame is interesting. Using machine learning algorithms, users will be able to identify disrespectful words, as well as the overall negativity of those words, which is displayed in a percentage.


2019 ◽  
Vol 23 (1) ◽  
pp. 52-71 ◽  
Author(s):  
Siyoung Chung ◽  
Mark Chong ◽  
Jie Sheng Chua ◽  
Jin Cheon Na

PurposeThe purpose of this paper is to investigate the evolution of online sentiments toward a company (i.e. Chipotle) during a crisis, and the effects of corporate apology on those sentiments.Design/methodology/approachUsing a very large data set of tweets (i.e. over 2.6m) about Company A’s food poisoning case (2015–2016). This case was selected because it is widely known, drew attention from various stakeholders and had many dynamics (e.g. multiple outbreaks, and across different locations). This study employed a supervised machine learning approach. Its sentiment polarity classification and relevance classification consisted of five steps: sampling, labeling, tokenization, augmentation of semantic representation, and the training of supervised classifiers for relevance and sentiment prediction.FindingsThe findings show that: the overall sentiment of tweets specific to the crisis was neutral; promotions and marketing communication may not be effective in converting negative sentiments to positive sentiments; a corporate crisis drew public attention and sparked public discussion on social media; while corporate apologies had a positive effect on sentiments, the effect did not last long, as the apologies did not remove public concerns about food safety; and some Twitter users exerted a significant influence on online sentiments through their popular tweets, which were heavily retweeted among Twitter users.Research limitations/implicationsEven with multiple training sessions and the use of a voting procedure (i.e. when there was a discrepancy in the coding of a tweet), there were some tweets that could not be accurately coded for sentiment. Aspect-based sentiment analysis and deep learning algorithms can be used to address this limitation in future research. This analysis of the impact of Chipotle’s apologies on sentiment did not test for a direct relationship. Future research could use manual coding to include only specific responses to the corporate apology. There was a delay between the time social media users received the news and the time they responded to it. Time delay poses a challenge to the sentiment analysis of Twitter data, as it is difficult to interpret which peak corresponds with which incident/s. This study focused solely on Twitter, which is just one of several social media sites that had content about the crisis.Practical implicationsFirst, companies should use social media as official corporate news channels and frequently update them with any developments about the crisis, and use them proactively. Second, companies in crisis should refrain from marketing efforts. Instead, they should focus on resolving the issue at hand and not attempt to regain a favorable relationship with stakeholders right away. Third, companies can leverage video, images and humor, as well as individuals with large online social networks to increase the reach and diffusion of their messages.Originality/valueThis study is among the first to empirically investigate the dynamics of corporate reputation as it evolves during a crisis as well as the effects of corporate apology on online sentiments. It is also one of the few studies that employs sentiment analysis using a supervised machine learning method in the area of corporate reputation and communication management. In addition, it offers valuable insights to both researchers and practitioners who wish to utilize big data to understand the online perceptions and behaviors of stakeholders during a corporate crisis.


Sign in / Sign up

Export Citation Format

Share Document