scholarly journals Application of Naïve Bayes Algorithm in Sentiment Analysis of Filipino, English and Taglish Facebook Comments

The World Wide Web has boosted its content for the past years, it has a vast amount of multimedia resources that continuously grow specifically in documentary data. One of the major contributors of documentary contents can be evidently found on the social media called Facebook. People or netizens on Facebook are actively sharing their opinion about a certain topic or posts that can be related to them or not. With the huge amount of accessible documentary data that are seen on the so-called social media, there are research trends that can be made by the researchers in the field of opinion mining. A netizen’s comment on a particular post can either be a negative or a positive one. This study will discuss the opinion or comment of a netizen whether it is positive or negative or how she/he feels about a specific topic posted on Facebook; this is can be measured by the use of Sentiment Analysis. The combination of the Natural Language Processing and the analytics in textual form is also known as Sentiment Analysis that is use to the extraction of data in a useful manner. This study will be based on the product reviews of Filipinos in Filipino, English and Taglish (mixed Filipino and English) languages. To categorize a comment effectively, the Naïve Bayes Algorithm was implemented to the developed web system.

Author(s):  
Mir Habeebullah Shah Quadri ◽  
R. K. Selvakumar

Both sellers and buyers heavily depend on the opinions of customers in purchasing and selling products online. When it comes to text-based data, sentiment analysis of user reviews has become a prominent facet of machine learning. Text data is generally unstructured which makes opinion mining very challenging. A wide array of pre-processing and post-processing techniques need to be applied. But the major challenge is selecting the right classifier for the job. Naïve Bayes algorithm is a commonly used machine learning classifier when it comes to opinion mining and sentiment analysis. The focus of this survey is to observe and analyze the performance of Naïve Bayes algorithm in sentiment analysis of user reviews online. Recent research from a wide array of use-cases such as sentiment analysis of movie reviews, product reviews, book reviews, blog posts, microblogs and other sources of data have been taken into account. The results show that Naïve Bayes algorithm performs exceptionally well with accuracies between 75% to 99% across the board.


2020 ◽  
Vol 1 (2) ◽  
pp. 61-66
Author(s):  
Febri Astiko ◽  
Achmad Khodar

This study aims to design a machine learning model of sentiment analysis on Indosat Ooredoo service reviews on social media twitter using the Naive Bayes algorithm as a classifier of positive and negative labels. This sentiment analysis uses machine learning to get patterns an model that can be used again to predict new data.


2019 ◽  
Vol 15 (2) ◽  
pp. 247-254
Author(s):  
Heru Sukma Utama ◽  
Didi Rosiyadi ◽  
Dedi Aridarma ◽  
Bobby Suryo Prakoso

Analysis of the odd even-numbered sentiment systems in Bekasi toll using the Naïve Bayes Algorithm, is a process of understanding, extracting, and processing textual data automatically from social media. The purpose of this study was to determine the level of accuracy, recall and precision of opinion mining generated using the Naïve Bayes algorithm to provide information community sentiment towards the effectiveness of the odd system of Bekasi tiolls on social media. The research method used in this study was to do text mining in comments-comments regarding posts regarding even odd oddities on Bekasi toll on Twitter, Instagram, Youtube and Facebook. The steps taken are starting from preprocessing, transformation, datamining and evaluation, followed by information gaon feature selection, select by weight and applying NB Algorithm model. The results obtained from the study using the NB model are obtained Confusion Matrix result, namely accuracy of 79,55%, Precision of 80,51%, and Sensitivity or Recall of 80,91%. Thus this study concludes that the use of Support Vector Machine Algorithms can analyze even odd sentiments on the Bekasi toll road.


2019 ◽  
Vol 5 (2) ◽  
pp. 227-234
Author(s):  
Riska Aryanti ◽  
Atang Saepudin ◽  
Eka Fitriani ◽  
Rifky Permana ◽  
Dede Firmansyah Saefudin

Congestion major cities in Indonesi caused by the proliferation of the use of private vehicles. Some expressing he thinks about busway user through the social media and other web site, This opinion can be used as a sentiment analysis to see if the user busway proposes a review of positive or negative. The results of the analysis sentiment can help in the sight of and evaluate the use of busway, also expected to improve and transjakarta facility from so they tend to have an opinion positive. Based on the results of the analysis, sentiment it is hoped people will switch to using the will of course will reduce congestion. In the study also added the stages preprocesing by using the framework gataframework to complete the process that cannot be done on tools rapidminer. The methodology that was used in this research was it is anticipated that analysis the sentiment of the by the application of an genetic algorithm for an election features with an algorithm naive bayes. From the results of the testing to the case in research it is found that classification algorithm naive bayes based genetic algorithm having the kind of accuracy that good enough 88,55 % and value of auc reached 0,813 % with the level of the diagnosis classifications good. So that in this research classification algorithm naive bayes based genetic algorithm can be recommended as algorithms classifications good enough to analyze the busway user sentimen. Based on analysis is expected to private transport users will switch to using the busway will reduce congestion


2020 ◽  
Vol 1 (1) ◽  
pp. 19-26
Author(s):  
Rakhmi Khalida ◽  
Siti Setiawati

Abstract   The Government of Indonesia took steps to change the system to improve public services in traffic violations by implementing the e-ticketing system. This system is a solution for disciplining motorized motorists from committing traffic violations. The existence of e-ticketing is also a solution to prevent the delinquency of law enforcers from illegal levies, peace terms in place, to accountability of fines. In this study, sentiment analysis of the e-ticketing system or opinion mining to classify the variety of public comments that give a positive, negative or neutral impression. Twitter social media is one of the objects to express opinions because it is user friendly, updated topics, and openly accesses tweets. Opinions on Twitter are collected, then the preprocessing stage is performed, then the selection of information gain features helps reduce noise caused by irrelevant labels, the next step is the classification of sentiments with the Naïve Bayes algorithm and finally polarity sentiments. This research resulted in an accuracy of 41.82%, a precision of 50.51% and a recall of 45.45%.   Keywords: Sentiment analysis, E-ticketing, Information Gain, Naive Bayes   Abstrak   Pemerintah Indonesia melakukan langkah perubahan untuk memperbaiki sistem pelayanan publik dalam pelanggaran berlalu-lintas yaitu dengan menerapkan sistem e-Tilang. Sistem ini menjadi solusi mendisiplinkan para pengendara kendaraan bermotor dari banyaknya melakukan pelanggaran berlalu-lintas. Keberadaan e-Tilang juga menjadi solusi mencegah kenakalan penegak hukum dari pungutan liar, istilah damai ditempat, hingga akuntabilitas uang denda. Dalam penelitian ini melakukan analisis sentimen tentang sistem e-Tilang atau opinion mining untuk mengelompokan ragam komentar masyarakat yang memberikan kesan positif, negatif atau netral. Media sosial Twitter menjadi salah satu objek untuk menyampaikan opini karena user friendly, topik ter-update, dan terbuka mengakses tweet. Opini pada twitter dikumpulkan, lalu dilakukan tahapan preprocessing, selanjutnya dengan seleksi fitur information gain membantu mengurangi noise yang disebabkan oleh label-label yang tidak relevan, tahap selanjutnya adalah klasifikasi sentimen dengan algoritma Naïve Bayes dan terakhir sentimen polarity. Penelitian ini menghasilkan accuracy 41,82%, presisi 50,51% dan recall 45,45%.   Kata kunci: Analisis sentimen, E-Tilang, Information Gain, Naive Bayes


Author(s):  
Amira M. Idrees ◽  
Fatma Gamal Eldin ◽  
Amr Mansour Mohsen ◽  
Hesham Ahmed Hassan

Every successful business aims to know how customers feel about its brands, services, and products. People freely express their views, ideas, sentiments, and opinions on social media for their day-to-day activities, for product reviews, for surveys, and even for their public opinions. This process provides a fortune of valuable resources about the market for any type of business. Unfortunately, it's impossible to manually analyze this massive quantity of information. Sentiment analysis (SA) and opinion mining (OM), as new fields of natural language processing, have the potential benefit of analyzing such a huge amount of data. SA or OM is the computational treatment of opinions, sentiments, and subjectivity of text. This chapter introduces the reader to a survey of different text SA and OM proposed techniques and approaches. The authors discuss in detail various approaches to perform a computational treatment for sentiments and opinions with their strengths and drawbacks.


Various fields like Text Mining, Linguistics, Decision Making and Natural Language Processing together form the basis for Opinion Mining or Sentiment Analysis. People share their feelings, observations and thoughts on social media, which has emerged as a powerful tool for rapidly growing enormous repository of real time discussions and thoughts shared by people. In this paper, we aim to decipher the current popular opinions or emotions from various sources, hence, contributing to sentiment analysis domain. Text from social media, blogs and product reviews are classified according to the sentiment they project. We re-examine the traditional processes of sentiment extraction, to incorporate the increase in complexity and number of the data sources and relevant topics, while re-populating the meaning of sentiment. Working across and within numerous streams of social media, expression of sentiment and classification of polarity is re-examined, thereby redefining and enhancing the realm of sentiment. Numerous social media streams are analyzed to build datasets that are topical for each stream and are later polarized according to their sentiment expression. In conclusion, defining a sentiment and developing tools for its analysis in real time of human idea exchange is the motive.


2021 ◽  
Vol 10 (3) ◽  
pp. 426-431
Author(s):  
Wiyanto Wiyanto ◽  
Zulita Setyaningsih

The Pandemic Covid-19  in Indonesia in 2020 had an impact on Termination of Employment (PHK), this has received various public opinions on social media. At a time when the poverty rate is high and unemployment increases every year, it becomes a factor of public disapproval of Termination of Employment (PHK). It is necessary to classify public opinion into a negative opinion or a positive opinion on this issue. The purpose of this study is to analyze the sentiment towards layoffs to determine negative or positive opinions using the Naïve Bayes algorithm by adding feature selection. The research stages consist of data collection, text preprocessing, feature selection, and application of algorithms. The testing process in this study uses the Rapid Miner application. The test results in this study using the Naive Bayes Algorithm, the accuracy value is 93.57% and for addition to the Naïve Bayes + PSO feature selection, the accuracy value is 93.71%. The best accuracy value in sentiment analysis of layoffs in the covid-19 pandemic is the addition of the PSO feature selection in the Naïve Bayes Algorithm, which is 0.14% better.


Sign in / Sign up

Export Citation Format

Share Document