SENTIMENT ANALYSIS ON TWITTER BY USING MAXIMUM ENTROPY AND SUPPORT VECTOR MACHINE METHOD

With the advancement of social media and its growth, there is a lot of data that can be presented for research in social mining. Twitter is a microblogging that can be used. In this event, a lot of companies used the data on Twitter to analyze the satisfaction of their customer about product quality. On the other hand, a lot of users use social media to express their daily emotions. The case can be developed into a research study that can be used both to improve product quality, as well as to analyze the opinion on certain events. The research is often called sentiment analysis or opinion mining. While The previous research does a particularly useful feature for sentiment analysis, but it is still a lack of performance. Furthermore, they used Support Vector Machine as a classification method. On the other hand, most researchers found another classification method, which is considered more efficient such as Maximum Entropy. So, this research used two types of a dataset, the general opinion data, and the airline's opinion data. For feature extraction, we employ four feature extraction, such as pragmatic, lexical-grams, pos-grams, and sentiment lexical. For the classification, we use both of Support Vector Machine and Maximum Entropy to find the best result. In the end, the best result is performed by Maximum Entropy with 85,8% accuracy on general opinion data, and 92,6% accuracy on airlines opinion data.

Download Full-text

Studi Komparatif Metode Ekstraksi Fitur pada Analisis Sentimen Maskapai Penerbangan Menggunakan Support Vector Machine dan Maximum Entropy

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v3i3.1159 ◽

2019 ◽

Vol 3 (3) ◽

pp. 402-407 ◽

Cited By ~ 1

Author(s):

Mona Cindo ◽

Dian Palupi Rini ◽

Ermatita

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Maximum Entropy ◽

Entropy Method ◽

Support Vector ◽

Machine Method ◽

N Gram ◽

Almost All

Almost all companies use social media to improve their product services and provide after-sales services that allow their customers to review the quality of their products. By using Twitter social media to be an important source for tracking sentiment analysis. Sentiment analysis is one of the most popular studies today, using sentiment analysis companies can analyze customer satisfaction to improve their services. This study aims to analyze airline sentiments with five different features such as pragmatic, lexical n-gram, POS, sentiment, and LDA using the Support Vector Machine and Maximum Entropy methods. The best results can be obtained using the Maximum Entropy method using all feature extraction with an accuracy of 92.7% and in the Support Vector Machine method, the accuracy obtained is 89.2%.

Download Full-text

Analisis Sentimen Data Twitter Tentang Pasangan Capres-Cawapres Pemilu 2019 Dengan Metode Lexicon Based Dan Support Vector Machine

Jurnal Ilmiah FIFO ◽

10.22441/fifo.2019.v11i2.004 ◽

2019 ◽

Vol 11 (2) ◽

pp. 144

Author(s):

Danar Wido Seno ◽

Arief Wibowo

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Vice President ◽

Training Data ◽

Support Vector ◽

New Words ◽

Textual Data ◽

Data Content ◽

Combination Of Methods

Social media writing content growing make a lot of new words that appear on Twitter in the form of words and abbreviations that appear so that sentiment analysis is increasingly difficult to get high accuracy of textual data on Twitter social media. In this study, the authors conducted research on sentiment analysis of the pairs of candidates for President and Vice President of Indonesia in the 2019 Elections. To obtain higher accuracy results and accommodate the problem of textual data development on Twitter, the authors conducted a combination of methods to conduct the sentiment analysis with unsupervised and supervised methods. namely Lexicon Based. This study used Twitter data in October 2018 using the search keywords with the names of each pair of candidates for President and Vice President of the 2019 Elections totaling 800 datasets. From the study with 800 datasets the best accuracy was obtained with a value of 92.5% with 80% training data composition and 20% testing data with a Precision value in each class between 85.7% - 97.2% and Recall value for each class among 78, 2% - 93.5%. With the Lexicon Based method as a labeling dataset, the process of labeling the Support Vector Machine dataset is no longer done manually but is processed by the Lexicon Based method and the dictionary on the lexicon can be added along with the development of data content on Twitter social media.

Download Full-text

Sentiment Classification

Advances in Linguistics and Communication Studies - Modern Computational Models of Semantic Discovery in Natural Language ◽

10.4018/978-1-4666-8690-8.ch001 ◽

2015 ◽

pp. 1-26

Author(s):

Jalel Akaichi

Keyword(s):

Support Vector Machine ◽

Text Mining ◽

Sentiment Analysis ◽

Training Model ◽

Sentiment Classification ◽

The Other ◽

Support Vector ◽

Analysis Techniques ◽

Sentiment Lexicon ◽

And Behavior

In this work, we focus on the application of text mining and sentiment analysis techniques for analyzing Tunisian users' statuses updates on Facebook. We aim to extract useful information, about their sentiment and behavior, especially during the “Arabic spring” era. To achieve this task, we describe a method for sentiment analysis using Support Vector Machine and Naïve Bayes algorithms, and applying a combination of more than two features. The output of this work consists, on one hand, on the construction of a sentiment lexicon based on the Emoticons and Acronyms' lexicons that we developed based on the extracted statuses updates; and on the other hand, it consists on the realization of detailed comparative experiments between the above algorithms by creating a training model for sentiment classification.

Download Full-text

Sentiment Analysis with Social Media Analytics, Methods, Process, and Applications

Advances in Business Information Systems and Analytics - Handbook of Research on Advanced Data Mining Techniques and Applications for Business Intelligence ◽

10.4018/978-1-5225-2031-3.ch011 ◽

2017 ◽

pp. 192-208

Author(s):

Karteek Ramalinga Ponnuru ◽

Rashik Gupta ◽

Shrawan Kumar Trivedi

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Support Vector ◽

Social Media Analytics ◽

Nearest Neighbour ◽

Huge Amount ◽

Word Clouds

Firms are turning their eye towards social media analytics to get to know what people are really talking about their firm or their product. With the huge amount of buzz being created online about anything and everything social media has become ‘the' platform of the day to understand what public on a whole are talking about a particular product and the process of converting all the talking into valuable information is called Sentiment Analysis. Sentiment Analysis is a process of identifying and categorizing a piece of text into positive or negative so as to understand the sentiment of the users. This chapter would take the reader through basic sentiment classifiers like building word clouds, commonality clouds, dendrograms and comparison clouds to advanced algorithms like K Nearest Neighbour, Naïve Biased Algorithm and Support Vector Machine.

Download Full-text

Implementation of n-gram Methodology for Rotten Tomatoes Review Dataset Sentiment Analysis

International Journal of Knowledge Discovery in Bioinformatics ◽

10.4018/ijkdb.2017010103 ◽

2017 ◽

Vol 7 (1) ◽

pp. 30-41 ◽

Cited By ~ 12

Author(s):

Prayag Tiwari ◽

Brojo Kishore Mishra ◽

Sachin Kumar ◽

Vivek Kumar

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Maximum Entropy ◽

Learning Strategies ◽

Supervised Machine Learning ◽

Support Vector ◽

N Gram ◽

F Measure ◽

Blog Posts

Sentiment Analysis intends to get the basic perspective of the content, which may be anything that holds a subjective supposition, for example, an online audit, Comments on Blog posts, film rating and so forth. These surveys and websites might be characterized into various extremity gatherings, for example, negative, positive, and unbiased keeping in mind the end goal to concentrate data from the info dataset. Supervised machine learning strategies group these reviews. In this paper, three distinctive machine learning calculations, for example, Support Vector Machine (SVM), Maximum Entropy (ME) and Naive Bayes (NB), have been considered for the arrangement of human conclusions. The exactness of various strategies is basically inspected keeping in mind the end goal to get to their execution on the premise of parameters, e.g. accuracy, review, f-measure, and precision.

Download Full-text

Ooredoo Rayek

International Journal of Technology Diffusion ◽

10.4018/ijtd.2020040105 ◽

2020 ◽

Vol 11 (2) ◽

pp. 66-81

Author(s):

Badia Klouche ◽

Sidi Mohamed Benslimane ◽

Sakina Rim Bennabi

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Text Mining ◽

Sentiment Analysis ◽

Experimental Results ◽

Support Vector ◽

Textual Data ◽

New Strategy ◽

Set Up

Sentiment analysis is one of the recent areas of emerging research in the classification of sentiment polarity and text mining, particularly with the considerable number of opinions available on social media. The Algerian Operator Telephone Ooredoo, as other operators, deploys in its new strategy to conquer new customers, by exploiting their opinions through a sentiments analysis. The purpose of this work is to set up a system called “Ooredoo Rayek”, whose objective is to collect, transliterate, translate and classify the textual data expressed by the Ooredoo operator's customers. This article developed a set of rules allowing the transliteration from Algerian Arabizi to Algerian dialect. Furthermore, the authors used Naïve Bayes (NB) and (Support Vector Machine) SVM classifiers to assign polarity tags to Facebook comments from the official pages of Ooredoo written in multilingual and multi-dialect context. Experimental results show that the system obtains good performance with 83% of accuracy.

Download Full-text

Gender Classification Based on Geometry Features of Palm Image

The Scientific World JOURNAL ◽

10.1155/2014/734564 ◽

2014 ◽

Vol 2014 ◽

pp. 1-7 ◽

Cited By ~ 7

Author(s):

Ming Wu ◽

Yubo Yuan

Keyword(s):

Image Processing ◽

Support Vector Machine ◽

The Other ◽

Classification Method ◽

Support Vector ◽

Gender Classification ◽

Gender Recognition ◽

Classification Rate ◽

Classification Approach ◽

Smooth Support Vector Machine

This paper presents a novel gender classification method based on geometry features of palm image which is simple, fast, and easy to handle. This gender classification method based on geometry features comprises two main attributes. The first one is feature extraction by image processing. The other one is classification system with polynomial smooth support vector machine (PSSVM). A total of 180 palm images were collected from 30 persons to verify the validity of the proposed gender classification approach and the results are satisfactory with classification rate over 85%. Experimental results demonstrate that our proposed approach is feasible and effective in gender recognition.

Download Full-text

PUBLIC’S SENTIMENT ANALYSIS ON SHOPEE-FOOD SERVICE USING LEXICON-BASED AND SUPPORT VECTOR MACHINE

Jurnal Riset Informatika ◽

10.34288/jri.v4i1.287 ◽

2021 ◽

Vol 4 (1) ◽

pp. 1-8

Author(s):

Shafira Shalehanny ◽

Agung Triayudi ◽

Endah Tri Esti Handayani

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Food Service ◽

Support Vector ◽

Accuracy Score ◽

Online Business ◽

Data Source ◽

Sentiment Score ◽

Processing Language

Technology field following how era keep evolving. Social media already on everyone’s daily life and being a place for writing their opinion, either review or response for product and service that already being used. Twitter are one of popular social media on Indonesia, according to Statista data it reach 17.55 million users. For online business sector, knowing sentiment score are really important to stepping up their business. The use of machine learning, NLP (Natural Processing Language), and text mining for knowing the real meaning of opinion words given by customer called sentiment analysis. Two methods are using for data testing, the first is Lexicon Based and the second is Support Vector Machine (SVM). Data source that used for sentiment analyst are from keyword ‘ShopeeFood’ and ‘syopifud’. The result of analysis giving accuracy score 87%, precision score 81%, recall score 75%, and f1-score 78%.

Download Full-text

Multi-aspect sentiment analysis on netflix application using latent dirichlet allocation and support vector machine methods

JURNAL INFOTEL ◽

10.20895/infotel.v13i3.670 ◽

2021 ◽

Vol 13 (3) ◽

pp. 128-133

Author(s):

Attala Rafid Abelard ◽

Yuliant Sibaroni

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Latent Dirichlet Allocation ◽

The Other ◽

Support Vector ◽

Performance Score ◽

Negative Class ◽

Google Play ◽

Dirichlet Allocation

Among many film streaming platforms that have sprung up, Netflix is the platform that has the most subscribers compared to the other platforms. However, not all reviews provided by the Netflix users are good reviews. These reviews will later be analyzed to determine what aspects are reviewed by the users based on reviews written on the Google Play Store, using the Latent Dirichlet Allocation (LDA) method. Then, the classification process using the Support Vector Machine (SVM) method will be carried out to determine whether each of these reviews is included in the positive or negative class (Sentiment Analysis). There are 2 scenarios that were carried out in this study. The first scenario resulted that the best number of LDA topics to be used is 40, and the second scenario resulted that the use of filtering process in the preprocessing stage reduces the score of the f1-score. Thus, this study resulted in the best performance score on LDA and SVM testing with 40 topics, and without running the filtering process with the score of 78.15%.

Download Full-text

Effects of kernels and the proportion of training data on the accuracy of SVM sentiment analysis in lecturer evaluation

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v9.i4.pp734-743 ◽

2020 ◽

Vol 9 (4) ◽

pp. 734

Author(s):

Daniel Febrian Sengkey ◽

Agustinus Jacobus ◽

Fabian Johanes Manoppo

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Statistical Methods ◽

Statistical Test ◽

The Other ◽

Training Data ◽

Support Vector ◽

Linear Kernel ◽

Linear Polynomial ◽

Accuracy Data

Support vector machine (SVM) is a known method for supervised learning in sentiment analysis and there are many studies about the use of SVM in classifying the sentiments in lecturer evaluation. SVM has various parameters that can be tuned and kernels that can be chosen to improve the classifier accuracy. However, not all options have been explored. Therefore, in this study we compared the four SVM kernels: radial, linear, polynomial, and sigmoid, to discover how each kernel influences the accuracy of the classifier. To make a proper assessment, we used our labeled dataset of students’ evaluations toward the lecturer. The dataset was split, one for training the classifier, and another one for testing the model. As an addition, we also used several different ratios of the training:testing dataset. The split ratios are 0.5 to 0.95, with the increment factor of 0.05. The dataset was split randomly, hence the splitting-training-testing processes were repeated 1,000 times for each kernel and splitting ratio. Therefore, at the end of the experiment, we got 40,000 accuracy data. Later, we applied statistical methods to see whether the differences are significant. Based on the statistical test, we found that in this particular case, the linear kernel significantly has higher accuracy compared to the other kernels. However, there is a tradeoff, where the results are getting more varied with a higher proportion of data used for training.

Download Full-text