Analisis Sentimen Data Twitter Tentang Pasangan Capres-Cawapres Pemilu 2019 Dengan Metode Lexicon Based Dan Support Vector Machine

Social media writing content growing make a lot of new words that appear on Twitter in the form of words and abbreviations that appear so that sentiment analysis is increasingly difficult to get high accuracy of textual data on Twitter social media. In this study, the authors conducted research on sentiment analysis of the pairs of candidates for President and Vice President of Indonesia in the 2019 Elections. To obtain higher accuracy results and accommodate the problem of textual data development on Twitter, the authors conducted a combination of methods to conduct the sentiment analysis with unsupervised and supervised methods. namely Lexicon Based. This study used Twitter data in October 2018 using the search keywords with the names of each pair of candidates for President and Vice President of the 2019 Elections totaling 800 datasets. From the study with 800 datasets the best accuracy was obtained with a value of 92.5% with 80% training data composition and 20% testing data with a Precision value in each class between 85.7% - 97.2% and Recall value for each class among 78, 2% - 93.5%. With the Lexicon Based method as a labeling dataset, the process of labeling the Support Vector Machine dataset is no longer done manually but is processed by the Lexicon Based method and the dictionary on the lexicon can be added along with the development of data content on Twitter social media.

Download Full-text

Ooredoo Rayek

International Journal of Technology Diffusion ◽

10.4018/ijtd.2020040105 ◽

2020 ◽

Vol 11 (2) ◽

pp. 66-81

Author(s):

Badia Klouche ◽

Sidi Mohamed Benslimane ◽

Sakina Rim Bennabi

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Text Mining ◽

Sentiment Analysis ◽

Experimental Results ◽

Support Vector ◽

Textual Data ◽

New Strategy ◽

Set Up

Sentiment analysis is one of the recent areas of emerging research in the classification of sentiment polarity and text mining, particularly with the considerable number of opinions available on social media. The Algerian Operator Telephone Ooredoo, as other operators, deploys in its new strategy to conquer new customers, by exploiting their opinions through a sentiments analysis. The purpose of this work is to set up a system called “Ooredoo Rayek”, whose objective is to collect, transliterate, translate and classify the textual data expressed by the Ooredoo operator's customers. This article developed a set of rules allowing the transliteration from Algerian Arabizi to Algerian dialect. Furthermore, the authors used Naïve Bayes (NB) and (Support Vector Machine) SVM classifiers to assign polarity tags to Facebook comments from the official pages of Ooredoo written in multilingual and multi-dialect context. Experimental results show that the system obtains good performance with 83% of accuracy.

Download Full-text

A Survey on Sentiment Analysis Algorithms and Techniques For Arabic Textual Data

10.54216/fpa.020205 ◽

2020 ◽

pp. 74-87

Author(s):

admin admin ◽

◽

Gawaher Soliman Hussein ◽

...

Keyword(s):

Data Mining ◽

Social Media ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Arab World ◽

Arabic Language ◽

Emotional Reaction ◽

Support Vector ◽

K Nearest Neighbor ◽

Textual Data

The concept Sentiment means the feeling, behavior, belief, or attitude towards something that almost being embedded. sentiment analysis is the process of analyzing, extracting, studying, and classifying the various reviews, opinions are given by people, and human’s emictions into positive, negative, neutral. It is considered one of the most significant scientific branches that aim to determine the behavior of the speaker, the attitude of the writer according to some topic, or the overall emotional reaction to website, document, event, interaction, products, or services. many users can share every day various opinions on different topics that may be detected or embedded by using micro-blogging which considered a rich resource for sentiment analysis and belief mining such as Facebook, Twitter, forums, and Blogs. recently a huge number of posted comments, tweets, and reviews of different social media websites include rich information in addition to most of the on-line shopping sites provide the opportunity to customers to write reviews about products in order to enhance the sales of those products and to improve both of product quality and customer satisfaction. manual analysis of these large reviews is practically impossible thus it is needed to discover an automated approach to solving such a hard process. In the Middle East and particularly in the Arab world, social media websites continue to be the top-visited websites especially with the current social and political changes in this part of the world. the main objective of that research is to differentiate between various algorithms and techniques of sentiment analysis and classification dependent on the Arabic language as a little number of researchers discusses that point relevant to the Arabic language. Different algorithms and techniques of data mining such as Support Vector Machine (SVM), Naïve Bayes (NB), Bayesian Network (BN), Decision tree (DT), k-nearest neighbor (KNN), Maximum Entropy (ME), and Neural Network (NN) in addition to many other alternative techniques which are used for analyzing and classifying textual data. For the reasons of difficulties in analyzing and mining a large number of linguistic words for their Those techniques are estimated based on the Arabic language due to its richness and diversity. The comparison between data mining techniques showed that the most accurate technique is the support vector machine (SVM) algorithm. every successful sentiment depends on two essential analysis tools are language and culture.

Download Full-text

A sentiment analysis system for social media using machine learning techniques: Social enablement

Digital Scholarship in the Humanities ◽

10.1093/llc/fqy037 ◽

2018 ◽

Vol 34 (3) ◽

pp. 569-581 ◽

Cited By ~ 1

Author(s):

Sujata Rani ◽

Parteek Kumar

Keyword(s):

Machine Learning ◽

Social Media ◽

Sentiment Analysis ◽

Media Analysis ◽

Training Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Analysis Tool ◽

Data Set ◽

Learning Techniques

Abstract In this article, an innovative approach to perform the sentiment analysis (SA) has been presented. The proposed system handles the issues of Romanized or abbreviated text and spelling variations in the text to perform the sentiment analysis. The training data set of 3,000 movie reviews and tweets has been manually labeled by native speakers of Hindi in three classes, i.e. positive, negative, and neutral. The system uses WEKA (Waikato Environment for Knowledge Analysis) tool to convert these string data into numerical matrices and applies three machine learning techniques, i.e. Naive Bayes (NB), J48, and support vector machine (SVM). The proposed system has been tested on 100 movie reviews and tweets, and it has been observed that SVM has performed best in comparison to other classifiers, and it has an accuracy of 68% for movie reviews and 82% in case of tweets. The results of the proposed system are very promising and can be used in emerging applications like SA of product reviews and social media analysis. Additionally, the proposed system can be used in other cultural/social benefits like predicting/fighting human riots.

Download Full-text

Sentiment Analysis with Social Media Analytics, Methods, Process, and Applications

Advances in Business Information Systems and Analytics - Handbook of Research on Advanced Data Mining Techniques and Applications for Business Intelligence ◽

10.4018/978-1-5225-2031-3.ch011 ◽

2017 ◽

pp. 192-208

Author(s):

Karteek Ramalinga Ponnuru ◽

Rashik Gupta ◽

Shrawan Kumar Trivedi

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Support Vector ◽

Social Media Analytics ◽

Nearest Neighbour ◽

Huge Amount ◽

Word Clouds

Firms are turning their eye towards social media analytics to get to know what people are really talking about their firm or their product. With the huge amount of buzz being created online about anything and everything social media has become ‘the' platform of the day to understand what public on a whole are talking about a particular product and the process of converting all the talking into valuable information is called Sentiment Analysis. Sentiment Analysis is a process of identifying and categorizing a piece of text into positive or negative so as to understand the sentiment of the users. This chapter would take the reader through basic sentiment classifiers like building word clouds, commonality clouds, dendrograms and comparison clouds to advanced algorithms like K Nearest Neighbour, Naïve Biased Algorithm and Support Vector Machine.

Download Full-text

PUBLIC’S SENTIMENT ANALYSIS ON SHOPEE-FOOD SERVICE USING LEXICON-BASED AND SUPPORT VECTOR MACHINE

Jurnal Riset Informatika ◽

10.34288/jri.v4i1.287 ◽

2021 ◽

Vol 4 (1) ◽

pp. 1-8

Author(s):

Shafira Shalehanny ◽

Agung Triayudi ◽

Endah Tri Esti Handayani

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Food Service ◽

Support Vector ◽

Accuracy Score ◽

Online Business ◽

Data Source ◽

Sentiment Score ◽

Processing Language

Technology field following how era keep evolving. Social media already on everyone’s daily life and being a place for writing their opinion, either review or response for product and service that already being used. Twitter are one of popular social media on Indonesia, according to Statista data it reach 17.55 million users. For online business sector, knowing sentiment score are really important to stepping up their business. The use of machine learning, NLP (Natural Processing Language), and text mining for knowing the real meaning of opinion words given by customer called sentiment analysis. Two methods are using for data testing, the first is Lexicon Based and the second is Support Vector Machine (SVM). Data source that used for sentiment analyst are from keyword ‘ShopeeFood’ and ‘syopifud’. The result of analysis giving accuracy score 87%, precision score 81%, recall score 75%, and f1-score 78%.

Download Full-text

Effects of kernels and the proportion of training data on the accuracy of SVM sentiment analysis in lecturer evaluation

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v9.i4.pp734-743 ◽

2020 ◽

Vol 9 (4) ◽

pp. 734

Author(s):

Daniel Febrian Sengkey ◽

Agustinus Jacobus ◽

Fabian Johanes Manoppo

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Statistical Methods ◽

Statistical Test ◽

The Other ◽

Training Data ◽

Support Vector ◽

Linear Kernel ◽

Linear Polynomial ◽

Accuracy Data

Support vector machine (SVM) is a known method for supervised learning in sentiment analysis and there are many studies about the use of SVM in classifying the sentiments in lecturer evaluation. SVM has various parameters that can be tuned and kernels that can be chosen to improve the classifier accuracy. However, not all options have been explored. Therefore, in this study we compared the four SVM kernels: radial, linear, polynomial, and sigmoid, to discover how each kernel influences the accuracy of the classifier. To make a proper assessment, we used our labeled dataset of students’ evaluations toward the lecturer. The dataset was split, one for training the classifier, and another one for testing the model. As an addition, we also used several different ratios of the training:testing dataset. The split ratios are 0.5 to 0.95, with the increment factor of 0.05. The dataset was split randomly, hence the splitting-training-testing processes were repeated 1,000 times for each kernel and splitting ratio. Therefore, at the end of the experiment, we got 40,000 accuracy data. Later, we applied statistical methods to see whether the differences are significant. Based on the statistical test, we found that in this particular case, the linear kernel significantly has higher accuracy compared to the other kernels. However, there is a tradeoff, where the results are getting more varied with a higher proportion of data used for training.

Download Full-text

Detection Of Spam Comments On Instagram Using Complementary Naïve Bayes

IJCCS (Indonesian Journal of Computing and Cybernetics Systems) ◽

10.22146/ijccs.47046 ◽

2019 ◽

Vol 13 (3) ◽

pp. 263

Author(s):

Nur Azizul Haqimi ◽

Nur Rokhman ◽

Sigit Priyanta

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Test Data ◽

Training Data ◽

Classification Method ◽

Support Vector ◽

Test Results ◽

Imbalanced Dataset ◽

Web Based ◽

F Measure

Instagram (IG) is a web-based and mobile social media application where users can share photos or videos with available features. Upload photos or videos with captions that contain an explanation of the photo or video that can reap spam comments. Comments on spam containing comments that are not relevant to the caption and photos. The problem that arises when identifying spam is non-spam comments are more dominant than spam comments so that it leads to the problem of the imbalanced dataset. A balanced dataset can influence the performance of a classification method. This is the focus of research related to the implementation of the CNB method in dealing with imbalance datasets for the detection of Instagram spam comments. The study used TF-IDF weighting with Support Vector Machine (SVM) as a comparison classification. Based on the test results with 2500 training data and 100 test data on the imbalanced dataset (25% spam and 75% non-spam), the CNB accuracy was 92%, precision 86% and f-measure 93%. Whereas SVM produces 87% accuracy, 79% precision, 88% f-measure. In conclusion, the CNB method is more suitable for detecting spam comments in cases of imbalanced datasets.

Download Full-text

Studi Komparatif Metode Ekstraksi Fitur pada Analisis Sentimen Maskapai Penerbangan Menggunakan Support Vector Machine dan Maximum Entropy

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v3i3.1159 ◽

2019 ◽

Vol 3 (3) ◽

pp. 402-407 ◽

Cited By ~ 1

Author(s):

Mona Cindo ◽

Dian Palupi Rini ◽

Ermatita

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Maximum Entropy ◽

Entropy Method ◽

Support Vector ◽

Machine Method ◽

N Gram ◽

Almost All

Almost all companies use social media to improve their product services and provide after-sales services that allow their customers to review the quality of their products. By using Twitter social media to be an important source for tracking sentiment analysis. Sentiment analysis is one of the most popular studies today, using sentiment analysis companies can analyze customer satisfaction to improve their services. This study aims to analyze airline sentiments with five different features such as pragmatic, lexical n-gram, POS, sentiment, and LDA using the Support Vector Machine and Maximum Entropy methods. The best results can be obtained using the Maximum Entropy method using all feature extraction with an accuracy of 92.7% and in the Support Vector Machine method, the accuracy obtained is 89.2%.

Download Full-text