scholarly journals Text Preprocessing Method on Twitter Sentiment Analysis using Machine Learning

In real world, twitter sentimental analysis (TSA) acting a major role in observing the public opinion about customer side. TSA is complex compared to general sentiment analysis due to pre-processing of text on Twitter. The maximum limit on the number of characters allowed on Twitter is 280. In this article we discuss the influence of the text pre-processing technique on the classification efficiency of emotions in two kinds of classification problems and summarize the classification efficiency of the four pre-processing methods. This paper contributes to the consumer satisfaction classification sentiment analysis and is useful in evaluating the details in the context of the amount of tweets where views are somewhat unstructured and are either positive or negative, or somewhere in between. We first pre-processed the dataset, then extracted the adjective from the dataset with some meaning called the feature vector, then selected the feature vector list and subsequently applied machine learning based classification algorithms namely: Naive Bayes, Random Forest and SVM along with WordNet based Semantic Orientation which extracts synonyms and similarity for the features of content. Experiments display that the accuracy (Acc) and average F1-measure (F1-M) of the classification classifier on Twitter are enhanced by using methods of pre-processing the extension of acronyms and swapping negation, but barely deleting numbers or stop words

2021 ◽  
Vol 11 (10) ◽  
pp. 4443
Author(s):  
Rokas Štrimaitis ◽  
Pavel Stefanovič ◽  
Simona Ramanauskaitė ◽  
Asta Slotkienė

Financial area analysis is not limited to enterprise performance analysis. It is worth analyzing as wide an area as possible to obtain the full impression of a specific enterprise. News website content is a datum source that expresses the public’s opinion on enterprise operations, status, etc. Therefore, it is worth analyzing the news portal article text. Sentiment analysis in English texts and financial area texts exist, and are accurate, the complexity of Lithuanian language is mostly concentrated on sentiment analysis of comment texts, and does not provide high accuracy. Therefore in this paper, the supervised machine learning model was implemented to assign sentiment analysis on financial context news, gathered from Lithuanian language websites. The analysis was made using three commonly used classification algorithms in the field of sentiment analysis. The hyperparameters optimization using the grid search was performed to discover the best parameters of each classifier. All experimental investigations were made using the newly collected datasets from four Lithuanian news websites. The results of the applied machine learning algorithms show that the highest accuracy is obtained using a non-balanced dataset, via the multinomial Naive Bayes algorithm (71.1%). The other algorithm accuracies were slightly lower: a long short-term memory (71%), and a support vector machine (70.4%).


Author(s):  
Basant Agarwal ◽  
Namita Mittal

Opinion Mining or Sentiment Analysis is the study that analyzes people's opinions or sentiments from the text towards entities such as products and services. It has always been important to know what other people think. With the rapid growth of availability and popularity of online review sites, blogs', forums', and social networking sites' necessity of analysing and understanding these reviews has arisen. The main approaches for sentiment analysis can be categorized into semantic orientation-based approaches, knowledge-based, and machine-learning algorithms. This chapter surveys the machine learning approaches applied to sentiment analysis-based applications. The main emphasis of this chapter is to discuss the research involved in applying machine learning methods mostly for sentiment classification at document level. Machine learning-based approaches work in the following phases, which are discussed in detail in this chapter for sentiment classification: (1) feature extraction, (2) feature weighting schemes, (3) feature selection, and (4) machine-learning methods. This chapter also discusses the standard free benchmark datasets and evaluation methods for sentiment analysis. The authors conclude the chapter with a comparative study of some state-of-the-art methods for sentiment analysis and some possible future research directions in opinion mining and sentiment analysis.


Big Data ◽  
2016 ◽  
pp. 1917-1933
Author(s):  
Basant Agarwal ◽  
Namita Mittal

Opinion Mining or Sentiment Analysis is the study that analyzes people's opinions or sentiments from the text towards entities such as products and services. It has always been important to know what other people think. With the rapid growth of availability and popularity of online review sites, blogs', forums', and social networking sites' necessity of analysing and understanding these reviews has arisen. The main approaches for sentiment analysis can be categorized into semantic orientation-based approaches, knowledge-based, and machine-learning algorithms. This chapter surveys the machine learning approaches applied to sentiment analysis-based applications. The main emphasis of this chapter is to discuss the research involved in applying machine learning methods mostly for sentiment classification at document level. Machine learning-based approaches work in the following phases, which are discussed in detail in this chapter for sentiment classification: (1) feature extraction, (2) feature weighting schemes, (3) feature selection, and (4) machine-learning methods. This chapter also discusses the standard free benchmark datasets and evaluation methods for sentiment analysis. The authors conclude the chapter with a comparative study of some state-of-the-art methods for sentiment analysis and some possible future research directions in opinion mining and sentiment analysis.


Author(s):  
Neha Gupta ◽  
Rashmi Agrawal

Online social media (forums, blogs, and social networks) are increasing explosively, and utilization of these new sources of information has become important. Semantics plays a significant role in accurate analysis of an emotion speech context. Adding to this area, the already advanced semantic technologies have proven to increase the precision of the tests. Deep learning has emerged as a prominent machine learning technique that learns multiple layers or data characteristics and delivers state-of-the-art output. Throughout recent years, deep learning has been widely used in the study of sentiments, along with the growth of deep learning in many other fields of use. This chapter will offer a description of deep learning and its application in the analysis of sentiments. This chapter will focus on the semantic orientation-based approaches for sentiment analysis. In this work, a semantically enhanced methodology for the annotation of sentiment polarity in Twitter/ Facebook data will be presented.


2021 ◽  
Author(s):  
Kevin Qu ◽  
Yu Sun

A number of social issues have been grown due to the increasing amount of “fake news”. With the inevitable exposure to this misinformation, it has become a real challenge for the public to process the correct truth and knowledge with accuracy. In this paper, we have applied machine learning to investigate the correlations between the information and the way people treat it. With enough data, we are able to safely and accurately predict which groups are most vulnerable to misinformation. In addition, we realized that the structure of the survey itself could help with future studies, and the method by which the news articles are presented, and the news articles itself also contributes to the result.


Author(s):  
Amrita Mishra ◽  

Sentiment Analysis has paved routes for opinion analysis of masses over unrestricted territorial limits. With the advent and growth of social media like Twitter, Facebook, WhatsApp, Snapchat in today’s world, stakeholders and the public often takes to expressing their opinion on them and drawing conclusions. While these social media data are extremely informative and well connected, the major challenge lies in incorporating efficient Text Classification strategies which not only overcomes the unstructured and humongous nature of data but also generates correct polarity of opinions (i.e. positive, negative, and neutral). This paper is a thorough effort to provide a brief study about various approaches to SA including Machine Learning, Lexicon Based, and Automatic Approaches. The paper also highlights the comparison of positive, negative, and neutral tweets of the Sputnik V, Moderna, and Covaxin vaccines used for preventive and emergency use of COVID-19 disease.


Author(s):  
Ganesh K. Shinde

Abstract: Sentiment Analysis has improvement in online shopping platforms, scientific surveys from political polls, business intelligence, etc. In this we trying to analyse the twitter posts about Hashtag like #MakeinIndia using Machine Learning approach. By doing opinion mining in a specific area, it is possible to identify the effect of area information in sentiment analysis. We put forth a feature vector for classifying the tweets as positive, negative and neutral. After that applied machine learning algorithms namely: MaxEnt and SVM. We utilised Unigram, Bigram and Trigram Features to generate a set of features to train a linear MaxEnt and SVM classifiers. In the end we have measured the performance of classifier in terms of overall accuracy. Keywords: Sentiment analysis, support vector machine, maximum entropy, N-gram, Machine Learning


2021 ◽  
Vol 09 (02) ◽  
pp. 536-556
Author(s):  
Panagiota Pampouktsi ◽  
Spyridon Avdimiotis ◽  
Manolis Μaragoudakis ◽  
Markos Avlonitis

2021 ◽  
Vol 8 (1) ◽  
pp. 147
Author(s):  
Primandani Arsi ◽  
Retno Waluyo

<p class="Abstrak">Dewasa ini, media sosial berkembang pesat di internet, salah satu yang banyak digemari adalah Twitter. Berbagai topik ramai diperbincangkan di Twitter mulai dari ekonomi, politik, sosial, budaya, hukum dan lain-lain. Salah satu topik yang ramai diperbincangkan di Twitter adalah terkait isu pemindahan ibu kota Indonesia. Namun dibalik hal tersebut terdapat kontroversi dari  pihak yang merasa  pro dan kontra, masing-masing memiiki sudut pandang yang berbeda.  Hal ini menyebabkan munculnya fenomena perdebatan khususnya di Twitter yang sebenarnya menunjukkan perhatian kolektif mengenai wacana publik tersebut. Analisis sentimen adalah proses mengekstraksi, memahami dan mengolah data berupa teks yang tidak terstruktur secara otomatis guna mendapatkan informasi sentimen yang terdapat pada sebuah kalimat pendapat atau opini. Dalam penerapan analisis sentimen menggunakan metode <em>machine learning</em> terdapat beberapa metode yang sering digunakan. Dalam penelitian ini diusulkan metode <em>Support Vector Machine</em> (SVM) untuk diterapkan pada <em>tweets</em> topik pemindahan ibu kota Indonesia untuk tujuan klasifikasi kelas sentimen pada media sosial <em>twitter</em>. Teknis klasifikasi  dilakukan dengan cara mengklasifikasikan menjadi 2 kelas yakni positif dan negatif. Berdasarkan hasil pengujian yang dilakukan terhadap <em>tweets</em> sentimen pemindahan ibu kota dari media sosial twitter sebanyak 1.236 <em>tweets</em> (404 positif dan 832 negatif) menggunakan SVM diperoleh akurasi =96,68%, <em>precision=</em>95.82%, <em>recall</em>=94.04% dan AUC = 0,979.</p><p class="Abstrak"> </p><p class="Abstrak"><em><strong>Abstract</strong></em></p><p class="Abstrak"><em><em>Today, social media is growing fast on the internet<span lang="EN-GB">.</span><span lang="EN-GB">On</span>e of the most popular<span lang="EN-GB"> social media</span> is Twitter. Many topics are discussed on Twitter such as economic, politic, socia<span lang="EN-GB">l</span>, cultur<span lang="EN-GB">e</span>, <span lang="EN-GB">and l</span>aw<span lang="EN-GB">.</span> One of the hot topics discussed on Twitter is the issue of relocating Indonesia's capital city. However<span lang="EN-GB">, </span>there is controversy from supporters and opponents<span lang="EN-GB">. They</span> have different views. <span lang="EN-GB">This issue leads to</span> a phenomenon of debate on Twitter <span lang="EN-GB">that </span>actually show<span lang="EN-GB">s a </span>collective concern about the public discourse. Sentiment analysis is a process of extracting, understand<span lang="EN-GB">ing </span>and process<span lang="EN-GB">ing</span> unstructured data to get sentiment information which is<span lang="EN-GB"> found</span> in an opinion sentence. Application of sentiment analysis using machine learning methods<span lang="EN-GB"> shows that</span> there are several methods that are often used. In this study, the Support Vector Machine (SVM) method is proposed to be applied to tweets on the topic of relocating Indonesia's capital city for sentiment classification on social media twitter. The classification technique is carried out into 2 classes, namely positive and negative. Based on testing on the sentiment of relocating Indonesia's capital city from social media twitter from 1,116 tweets (404 positive and 832 negative) using SVM obtained accuracy = 96.68%, precision = 95.82%, recall = 94.04% and AUC = 0.979.</em></em></p>


Sign in / Sign up

Export Citation Format

Share Document