ANALISIS SENTIMEN PADA PEMERINTAHAN TERPILIH PADA PILPRES 2019 DITWITTER MENGGUNAKAN ALGORITME NAÏVEBAYES

Abstract: The Presidential general election on 2019 became one of the most popular topics on twitter nowdays. The society give their opinion about the pair of candidates that they are support through the social media. This research was predicts about the society sentimens toward the candidates of President and Vice President of Republic of Indonesia. The data was used based on the tweet on the @jokowi twitter account. The retrieval of data by using the Tweepy library with the Python 2.7 programming language. This research was classified became of two of society sentiments classes, namely positive and negative. The modeling was used of the weighting method Unigram, Bigram, Trigram, N-Gram (1-2) and N-Gram (1-3) that used the Naïve Bayes Algorithm on the Weka Application. The modeling data was used by the dataset of 646 sentences. The highest results of this reseach were obtained by Unigram Weighting, namely: 81.4% accuracy, 81.5% precision, 81.3% recall with a time of 0.3 s.Keywords: classification, naïve bayes, 2019 presidential election, twitter, unigram Abstrak: Pemilihan Umum tentang Pilpres 2019 menjadi salah satu topik yang ramai diperbincangkan di Twitter. Adu pendapat di sosial media oleh masyarakat mengandung opini terhadap pasangan calon yang didukungnya. Penelitian ini memprediksi sentimen masyarakat kepada pasangan calon Presiden dan Wakil Presiden Republik Indonesia. Data yang digunakan adalah tweet yang ada pada akun Twitter @jokowi. Pengambilan data menggunakan library Tweepy dengan bahasa pemrograman Python 2.7. Penelitian ini mengklasifikasi sentimen masyarakat menjadi 2 kelas, yaitu positif dan negatif. Kemudian dilakukan pemodelan dengan metode pembobotan Unigram, Bigram, Trigram, N-Gram (1-2) Dan N-Gram (1-3) menggunakan Algoritme Naïve Bayes pada Aplikasi Weka. Pembuatan model menggunakan dataset yang berjumlah 646 kalimat. Hasil tertinggi yang diperoleh pada penelitian ini adalah dengan menggunakan Pembobotan Unigram, yaitu : akurasi 81,4%, presisi 81,5 % , recall 81,3 % dengan catatan waktu 0,3s.Kata kunci: klasifikasi, naïve bayes, pilpres 2019, twitter, unigram.

Download Full-text

Application of Naïve Bayes Algorithm in Sentiment Analysis of Filipino, English and Taglish Facebook Comments

Regular issue - International Journal of Management and Humanities ◽

10.35940/ijmh.e0524.014520 ◽

2020 ◽

Vol 4 (5) ◽

pp. 73-77

Keyword(s):

Social Media ◽

Sentiment Analysis ◽

Language Processing ◽

Opinion Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Product Reviews ◽

Documentary Data ◽

The Social ◽

Bayes Algorithm

The World Wide Web has boosted its content for the past years, it has a vast amount of multimedia resources that continuously grow specifically in documentary data. One of the major contributors of documentary contents can be evidently found on the social media called Facebook. People or netizens on Facebook are actively sharing their opinion about a certain topic or posts that can be related to them or not. With the huge amount of accessible documentary data that are seen on the so-called social media, there are research trends that can be made by the researchers in the field of opinion mining. A netizen’s comment on a particular post can either be a negative or a positive one. This study will discuss the opinion or comment of a netizen whether it is positive or negative or how she/he feels about a specific topic posted on Facebook; this is can be measured by the use of Sentiment Analysis. The combination of the Natural Language Processing and the analytics in textual form is also known as Sentiment Analysis that is use to the extraction of data in a useful manner. This study will be based on the product reviews of Filipinos in Filipino, English and Taglish (mixed Filipino and English) languages. To categorize a comment effectively, the Naïve Bayes Algorithm was implemented to the developed web system.

Download Full-text

Pengaruh N-Gram terhadap Klasifikasi Buku menggunakan Ekstraksi dan Seleksi Fitur pada Multinomial Naïve Bayes

JURNAL MEDIA INFORMATIKA BUDIDARMA ◽

10.30865/mib.v5i1.2672 ◽

2021 ◽

Vol 5 (1) ◽

pp. 264

Author(s):

Esti Mulyani ◽

Fachrul Pralienka Bani Muhamad ◽

Kurnia Adi Cahyanto

Keyword(s):

Naive Bayes ◽

Automatic Classification ◽

Naïve Bayes ◽

Main Task ◽

Test Results ◽

Book Title ◽

Feature Extraction And Selection ◽

N Gram ◽

Bayes Algorithm

Libraries have the main task in the processing of library materials by classifying books according to certain ways. Dewey Decimal Classification (DDC) is the method most commonly used in the world to determine book classification (labeling) in libraries. The advantages of this DDC method are universal and more systematic. However, this method is less efficient considering the large number of books that must be classified in a library, as well as labeling that must follow label updates on the DDC. An automatic classification system will be the perfect solution to this problem. Automatic classification can be done by applying the text mining method. In this study, searching for words in the book title was carried out with N-Gram (Unigram, Bigram, Trigram) as a feature generation. The features that have been raised are then selected for features. The process of book title classification is carried out using the Naïve Bayes Multinomial algorithm. This study examines the effect of Unigram, Bigram, Trigram on the classification of book titles using the feature extraction and selection feature on Multinomial Naïve Bayes algorithm. The test results show Unigram has the highest accuracy value of 74.4%.

Download Full-text

THE IMPLEMENTATION OF THE MACHINE LEARNING ALGORITHM FOR THE SENTIMENT ANALYSIS OF INDONESIA’S 2019 PRESIDENTIAL ELECTION

IIUM Engineering Journal ◽

10.31436/iiumej.v22i1.1532 ◽

2021 ◽

Vol 22 (1) ◽

pp. 78-92

Author(s):

GA Buntoro ◽

R Arifin ◽

GN Syaifuddiin ◽

A Selamat ◽

O Krejcar ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Sentiment Analysis ◽

Presidential Election ◽

Naive Bayes ◽

Learning Algorithm ◽

Naïve Bayes ◽

Machine Learning Algorithm ◽

Presidential Candidates ◽

N Gram

In 2019, citizens of Indonesia participated in the democratic process of electing a new president, vice president, and various legislative candidates for the country. The 2019 Indonesian presidential election was very tense in terms of the candidates' campaigns in cyberspace, especially on social media sites such as Facebook, Twitter, Instagram, Google+, Tumblr, LinkedIn, etc. The Indonesian people used social media platforms to express their positive, neutral, and also negative opinions on the respective presidential candidates. The campaigning of respective social media users on their choice of candidates for regents, governors, and legislative positions up to presidential candidates was conducted via the Internet and online media. Therefore, the aim of this paper is to conduct sentiment analysis on the candidates in the 2019 Indonesia presidential election based on Twitter datasets. The study used datasets on the opinions expressed by the Indonesian people available on Twitter with the hashtags (#) containing "Jokowi and Prabowo." We conducted data pre-processing using a selection of comments, data cleansing, text parsing, sentence normalization and tokenization based on the given text in the Indonesian language, determination of class attributes, and, finally, we classified the Twitter posts with the hashtags (#) using Naïve Bayes Classifier (NBC) and a Support Vector Machine (SVM) to achieve an optimal and maximum optimization accuracy. The study provides benefits in terms of helping the community to research opinions on Twitter that contain positive, neutral, or negative sentiments. Sentiment Analysis on the candidates in the 2019 Indonesian presidential election on Twitter using non-conventional processes resulted in cost, time, and effort savings. This research proved that the combination of the SVM machine learning algorithm and alphabetic tokenization produced the highest accuracy value of 79.02%. While the lowest accuracy value in this study was obtained with a combination of the NBC machine learning algorithm and N-gram tokenization with an accuracy value of 44.94%. ABSTRAK: Pada tahun 2019 rakyat Indonesia telah terlibat dalam proses demokrasi memilih presiden baru, wakil presiden, dan berbagai calon legislatif negara. Pemilihan presiden Indonesia 2019 sangat tegang dalam kempen calon di ruang siber, terutama di laman media sosial seperti Facebook, Twitter, Instagram, Google+, Tumblr, LinkedIn, dll. Rakyat Indonesia menggunakan platfom media sosial bagi menyatakan pendapat positif, berkecuali, dan juga negatif terhadap calon presiden masing-masing. Kampen pencalonan menteri, gabenor, dan perundangan hingga pencalonan presiden dilakukan melalui media internet dan atas talian. Oleh itu, kajian ini dilakukan bagi menilai sentimen terhadap calon pemilihan presiden Indonesia 2019 berdasarkan kumpulan data Twitter. Kajian ini menggunakan kumpulan data yang diungkapkan oleh rakyat Indonesia yang terdapat di Twitter dengan hashtag (#) yang mengandungi "Jokowi dan Prabowo." Proses data dibuat menggunakan pilihan komentar, pembersihan data, penguraian teks, normalisasi kalimat, dan tokenisasi teks dalam bahasa Indonesia, penentuan atribut kelas, dan akhirnya, pengklasifikasian catatan Twitter dengan hashtag (#) menggunakan Klasifikasi Naïve Bayes (NBC) dan Mesin Vektor Sokongan (SVM) bagi mencapai ketepatan optimum dan maksimum. Kajian ini memberikan faedah dari segi membantu masyarakat meneliti pendapat di Twitter yang mengandungi sentimen positif, neutral, atau negatif. Analisis Sentimen terhadap calon dalam pemilihan presiden Indonesia 2019 di Twitter menggunakan proses bukan konvensional menghasilkan penjimatan kos, waktu, dan usaha. Penyelidikan ini membuktikan bahawa gabungan algoritma pembelajaran mesin SVM dan tokenisasi abjad menghasilkan nilai ketepatan tertinggi iaitu 79.02%. Manakala nilai ketepatan terendah dalam kajian ini diperoleh dengan kombinasi algoritma pembelajaran mesin NBC dan tokenisasi N-gram dengan nilai ketepatan 44.94%.

Download Full-text

Klasifikasi Rating Otomatis pada Dokumen Teks Ulasan Produk Elektronik Menggunakan Metode N-gram dan Naïve Bayes

Jurnal Informatika Universitas Pamulang ◽

10.32493/informatika.v5i3.6110 ◽

2020 ◽

Vol 5 (3) ◽

pp. 295

Author(s):

Rahmawan Bagus Trianto ◽

Andri Triyono ◽

Dhika Malita Puspita Arum

Keyword(s):

Feature Extraction ◽

Naive Bayes ◽

Automatic Classification ◽

Naïve Bayes ◽

Lack Of Information ◽

N Gram ◽

Bayes Algorithm ◽

Online Product Ratings ◽

Product Description

Online product ratings usually provide descriptive reviews and also reviews in the form of ratings. Likewise, what was done at the Lazada online store. Descriptive review can provide a clear view compared to a rating review to other potential buyers. However, in reality there is a mismatch between the description review and the rating given. This creates a lack of information for sellers as well as potential buyers. Automatic classification of buyer descriptive reviews is proposed in this study so that there is a match between descriptive reviews and rating reviews. This automatic classification descriptive review uses the Naive Bayes algorithm with n-gram feature extraction and TF-IDF word weighting. The results of this study obtained the best accuracy of 94.06%, a recall of 91.73% and precision of 90.71% in Bigram feature extraction. With this accuracy value it can be used as a reference or model for classifying product description reviews, so that the feedback process between sellers and buyers can run well.

Download Full-text

Indonesian language email spam detection using N-gram and Naïve Bayes algorithm

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v9i5.2444 ◽

2020 ◽

Vol 9 (5) ◽

pp. 2012-2019

Author(s):

Yustinus Vernanda ◽

Seng Hansun ◽

Marcel Bonar Kristanda

Keyword(s):

Data Exchange ◽

Naive Bayes ◽

Naïve Bayes ◽

Bayesian Filtering ◽

Spam Filter ◽

N Gram ◽

Bayes Algorithm ◽

Rest Api ◽

Email Spam ◽

F Measure

Indonesia is ranked the top 8th out of the total country population in the world for the global spammers. Web-based spam filter service with the REST API type can be used to detect email spam in the Indonesian language on the email server or various types of email server applications. With REST API, then there will be data exchange between the applications with JSON data type using existing HTTP commands. One type of spam filter commonly used is Bayesian Filtering, where the Naïve Bayes algorithm is used as a classification algorithm. Meanwhile, the N-gram method is used to increase the accuracy of the implementation of the Naïve Bayes algorithm in this study. N-gram and Naïve Bayes algorithms to detect spam email in the Indonesian language have successfully been implemented with accuracy around 0.615 until 0.94, precision at 0.566 until 0.924, recall at 0.96 until 1.00, and F-measure at 0.721 until 0.942. The best solution is found by using the 5-gram method with the highest score of accuracy at 0.94, precision at 0.924, recall at 0.96, and F-measure value at 0.942.

Download Full-text

CLASSIFICATION OF CUSTOMER COMPLAINTS ON INSTAGRAM COMMENTS USING NAÏVE BAYES ALGORITHM WITH N-GRAM FEATURE EXTENSION

Jurnal Techno Nusa Mandiri ◽

10.33480/techno.v17i2.1632 ◽

2020 ◽

Vol 17 (2) ◽

pp. 109-116

Author(s):

Fachri Amsury ◽

Nanang Ruhyana ◽

Irwansyah Saputra ◽

Daning Nur Sulistyowati

Keyword(s):

Social Media ◽

Text Mining ◽

Electronic Mail ◽

Naive Bayes ◽

Naïve Bayes ◽

Customer Complaints ◽

N Gram ◽

Bayes Algorithm ◽

And Performance ◽

Usual Process

Customer complaints about the company can be used as a form of self-evaluation and performance that has been carried out by the company, based on customer complaints the company can find out the weaknesses that exist in the company and fix them. The forms of submitting customer complaints are very diverse, currently not only by telephone, but customers also submit suggestions or complaints, customers can submit suggestions or complaints via electronic mail or e-mail or forums in cyberspace that are indeed created by product-producing companies to accommodate various complaints, suggestions, and direct criticism from consumers, especially social media that are free to express opinions on the delivery services used. Instagram is a social media that is more inclined towards images and on the other hand, has captions and comments text, a study is needed for the problem of customer complaints from shipping service users on an Instagram account of a delivery service company. Based on this background, a solution is needed in solving problems for text mining classification using Naïve Bayes with SMOTE techniques and N-Gram feature extraction with the usual process for text mining so that it can produce Naïve Bayes and SMOTE accuracy with an accuracy of 88.54%, before implementation. N-Gram and the accuracy rate increased by 1.44% after the N-Gram Term was applied to 89.98% by using a dataset of 776 Instagram comment text records that had to preprocess text.

Download Full-text

Algoritma Multinomial Naïve Bayes Untuk Klasifikasi Sentimen Pemerintah Terhadap Penanganan Covid-19 Menggunakan Data Twitter

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v5i4.3146 ◽

2021 ◽

Vol 5 (4) ◽

pp. 820-826

Author(s):

Yuyun ◽

Nurul Hidayah ◽

Supriadi Sahibu

Keyword(s):

Social Media ◽

Naive Bayes ◽

Confusion Matrix ◽

Weighted Average ◽

Naïve Bayes ◽

Public Sentiment ◽

The Social ◽

Bayes Algorithm ◽

Special Approach ◽

Class Labels

Currently, the spread of information Covid-19 is spreading rapidly. Not only through electronic media, but this information is also disseminated by user posts on social media. Due to the user text posted is varies greatly, it’s needs a special approach to classify these types of posts. This research aims to classify the public sentiment towards the handling of COVID-19. The data from this study were obtained from the social media application i.e., Twitter. This study uses a derivative of the Naïve Bayes algorithm, namely Multinomial Nave Bayes to optimize the classification results. Three class labels are used to classify public sentiment namely positive, negative, and neutral sentiments. The stage starts with text preprocessing; cleaning, case folding, tokenization, filtering and stemming. Then proceed with weighting using the TF-IDF approach. To evaluate the classification results, data is tested using confusion matrix by testing accuracy, precision, and recall. From the test results, it is found that the weighted average for precision, recall and accuracy is 74%. Research shows that the accuracy of the proposed method has fair classification levels.

Download Full-text

Sentiment Analysis Of Government Policy On Corona Case Using Naive Bayes Algorithm

IJCCS (Indonesian Journal of Computing and Cybernetics Systems) ◽

10.22146/ijccs.60718 ◽

2021 ◽

Vol 15 (1) ◽

pp. 55

Author(s):

Auliya Rahman Isnain ◽

Nurman Satya Marga ◽

Debby Alita

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Naive Bayes ◽

Text Processing ◽

Naïve Bayes ◽

New Normal ◽

Public Sentiment ◽

N Gram ◽

Bayes Algorithm ◽

Economic Stabilization

The Indonesian government has enforced the New Normal rule in maintaining economic stabilization and also restraining the spread of the virus during the Covid 19 pandemic. This has become a hot topic of conversation on social media Twitter, many people think positive and negative.The research conducted is a representation of text mining and text processing using machine learning using the Naive Bayes Classifier classification method, the objective of the analysis is to determine whether public sentiment towards the New Normal policy is positive or negative, and also as a basis for measuring the performance of the TF-IDF feature extraction and N-gram in machine learning uses the Naive Bayes method.The results of this study resulted in the accuracy rate of the Naive Bayes method with the TF-IDF feature selection. The total accuracy was 81% with a Precision value of 78%, Recall 91%, and f1-Score 84%. The highest results were obtained from the use of the Naive Bayes and Trigram algorithm parameters, namely 84%, namely 84% Precision, 86% Recall, and 85% f1-Score. The Naive Bayes algorithm with the use of the trigram type N-Gram feature extraction shows a fairly good performance in the process of classifying public tweet data.

Download Full-text

Emotion Identification between POMS and Multinomial Naive Bayes Algorithm Using Twitter API

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i7.1419 ◽

2019 ◽

Vol 7 (7) ◽

pp. 14-19 ◽

Cited By ~ 1

Author(s):

Asharani S Dandoti ◽

Sunil M Sangve

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Emotion Identification ◽

Bayes Algorithm

Download Full-text

Algorithm Comparation of Naive Bayes and Support Vector Machine based on Particle Swarm Optimization in Sentiment Analysis of Freight Forwarding Services

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v4i2.1840 ◽

2020 ◽

Vol 4 (2) ◽

pp. 362-369

Author(s):

Sharazita Dyah Anggita ◽

Ikmah

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

The Public ◽

Svm Algorithm ◽

Bayes Algorithm ◽

Freight Forwarding ◽

Improved Accuracy

The needs of the community for freight forwarding are now starting to increase with the marketplace. User opinion about freight forwarding services is currently carried out by the public through many things one of them is social media Twitter. By sentiment analysis, the tendency of an opinion will be able to be seen whether it has a positive or negative tendency. The methods that can be applied to sentiment analysis are the Naive Bayes Algorithm and Support Vector Machine (SVM). This research will implement the two algorithms that are optimized using the PSO algorithms in sentiment analysis. Testing will be done by setting parameters on the PSO in each classifier algorithm. The results of the research that have been done can produce an increase in the accreditation of 15.11% on the optimization of the PSO-based Naive Bayes algorithm. Improved accuracy on the PSO-based SVM algorithm worth 1.74% in the sigmoid kernel.

Download Full-text