Sentiment Analysis System for Myanmar News using K Nearest Neighbor and Naïve Bayes

Sentiment Analysis about E-Commerce from Tweets Using Decision Tree, K-Nearest Neighbor, and Naïve Bayes

2018 International Conference on Orange Technologies (ICOT) ◽

10.1109/icot.2018.8705796 ◽

2018 ◽

Cited By ~ 2

Author(s):

Achmad Bayhaqy ◽

Sfenrianto Sfenrianto ◽

Kaman Nainggolan ◽

Emil R. Kaburuan

Keyword(s):

Decision Tree ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

K Nearest Neighbor

Download Full-text

Analisis Sentimen Masyarakat Terhadap COVID-19 Pada Media Sosial Twitter

Journal of Dinda : Data Science, Information Technology, and Data Analytics ◽

10.20895/dinda.v1i1.180 ◽

2021 ◽

Vol 1 (1) ◽

pp. 42-51

Author(s):

Ardianne Luthfika Fairuz ◽

Rima Dias Ramadhani ◽

Nia Annisa Ferani Tanjung

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Sentiment Analysis ◽

Language Processing ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

K Nearest Neighbor

Akhir tahun 2019 lalu dunia digemparkan oleh munculnya suatu penyakit yang disebabkan oleh virus SARS-CoV-2 yang merupakan jenis virus terbaru dari coronavirus. Penyakit ini dikenal dengan nama COVID-19. Penyebaran penyakit ini terbilang cukup luas dan cepat. Dalam waktu singkat penyakit ini mulai menyebar ke segala penjuru dunia tak terkecuali Indonesia. Dengan tingkat penyebaran yang begitu tinggi dan belum ditemukannya vaksin untuk COVID-19, menyebabkan kekacauan di tengah masyarakat. Hal ini mempengaruhi banyak sektor kehidupan masyarakat. Tak sedikit masyarakat yang aktif bersosial media dan menuliskan pendapat, opini serta pemikirannya di platform media sosial seperti Twitter. Terjadinya pandemi ini mendorong masyarakat untuk menuliskan opini, pemikiran serta pendapatnya terhadap COVID-19 pada media sosial Twitter. Dibutuhkan suatu model sentiment analysis untuk mengklasifikasi tweet masyarakat di Twitter menjadi positif dan negatif. Sentiment analysis merupakan bagian dari Natural Language Processing yang membuat sebuah sistem guna mengenali serta mengekstraksi opini dalam bentuk teks. Pada penelitian ini digunakan algoritma Naive Bayes dan K-Nearest Neighbor untuk digunakan dalam membangun model sentiment analysis terhadap tweet pengguna Twitter terhadap COVID-19. Didapatkan akurasi sebesar 85% untuk algoritma Naïve Bayes dan 82% untuk algoritma K-Nearest Neighbor pada nilai k=6, 8, dan 14.

Download Full-text

Sentiment Analysis of the Enforcement of PSBB Part II in Jakarta

JELIKU (Jurnal Elektronik Ilmu Komputer Udayana) ◽

10.24843/jlk.2020.v09.i02.p13 ◽

2020 ◽

Vol 9 (2) ◽

pp. 259

Author(s):

Gede Putra Aditya Brahmantha ◽

I Wayan Santiyasa

Keyword(s):

Social Media ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Web Crawling ◽

K Nearest Neighbor ◽

Bayes Method ◽

The Public ◽

Naive Bayes Method

In addition to communicating, Social Media is a place to issue opinions by the public on many things that are currently taking place, Twitter is one of these social medias that is widely used in conveying opinions regardless of whether these opinions are negative, positive, or even neutral. Tweets data about the Enforcement of PSBB Part II in Jakarta were obtained as many as 200 opinions using web crawling then advanced to the preprocessing stage before being classified using the K-Nearest Neighbor and Multinomial Naive Bayes algorithms. In 3 tests, the highest accuracy was 65.00% for K-Nearest Neighbor and the highest accuracy was 85.00% for Multinomial Naive Bayes method.

Download Full-text

Sentiment Analysis of Tweets Using Naïve Bayes, KNN, and Decision Tree

International Journal of Organizational and Collective Intelligence ◽

10.4018/ijoci.2020100103 ◽

2020 ◽

Vol 10 (4) ◽

pp. 35-49

Author(s):

Kadda Zerrouki ◽

Reda Mohamed Hamou ◽

Abdellatif Rahmoun

Keyword(s):

Decision Tree ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

K Nearest Neighbor ◽

Use Of Social Media ◽

The Masses

Making use of social media for analyzing the perceptions of the masses over a product, event, or a person has gained momentum in recent times. Out of a wide array of social networks, the authors chose Twitter for their analysis as the opinions expressed there are concise and bear a distinctive polarity. Sentiment analysis is an approach to analyze data and retrieve sentiment that it embodies. The paper elaborately discusses three supervised machine learning algorithms—naïve bayes, k-nearest neighbor (KNN), and decision tree—and compares their overall accuracy, precision, as well as recall values, f-measure, number of tweets correctly classified, number of tweets incorrectly classified, and execution time.

Download Full-text

Sentiment analysis on film review in Gujarati language using machine learning

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v12i1.pp1030-1039 ◽

2022 ◽

Vol 12 (1) ◽

pp. 1030

Author(s):

Parita Shah ◽

Priya Swaminarayan ◽

Maitri Patel

Keyword(s):

Sentiment Analysis ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

K Nearest Neighbor ◽

K Nearest Neighbors ◽

Word Level ◽

Document Frequency ◽

Accuracy Result ◽

Gujarati Language

<span>Opinion analysis is by a long shot most basic zone of characteristic language handling. It manages the portrayal of information to choose the motivation behind the wellspring of the content. The reason might be of a type of gratefulness (positive) or study (negative). This paper offers a correlation between the outcomes accomplished by applying the calculation arrangement using various classifiers for instance K-nearest neighbor and multinomial naive Bayes. These techniques are utilized to assess a significant assessment with either a positive remark or negative remark. The gathered information considered on the grounds of the extremity film datasets and an association with the results accessible proof has been created for a careful assessment. This paper investigates the word level count vectorizer and term frequency inverse document frequency (TF-IDF) influence on film sentiment analysis. We concluded that multinomial Naive Bayes (MNB) classier generate more accurate result using TF-IDF vectorizer compared to CountVectorizer, K-nearest-neighbors (KNN) classifier has the same accuracy result in case of TF-IDF and CountVectorizer.</span>

Download Full-text

Sentiment Analysis About Indonesian Lawyers Club Television Program Using K-Nearest Neighbor, Naïve Bayes Classifier, And Decision Tree

International Journal of New Media Technology ◽

10.31937/ijnmt.v8i1.1965 ◽

2021 ◽

Vol 8 (1) ◽

pp. 50-56

Author(s):

Nico Nathanael Wilim ◽

Raymond Sunardi Oetama

Keyword(s):

Decision Tree ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

K Nearest Neighbor ◽

Naïve Bayes Classifier ◽

Talk Show

Indonesia Lawyers Club (ILC) is a talk show on TVOne that discusses topics around public phenomena, legal issues, crime, and other similar topics. In 2018, ILC won the Panasonic Gobel Awards as the best news talk show program. But in 2019, ILC failed to win the award which was won by Mata Najwa which featured a talk show event that appeared on Trans7. As one of the television shows that has won awards, ILC has pros and cons for its shows from the public. This study applies a sentiment analysis approach to examine public opinion on Twitter about Mata Najwa and ILC in 2018 and 2019. This study applies K-Nearest Neighbor, Naïve Bayes Classifier, and Decision Tree classification algorithm to validate the result. The contribution of this study is to show that public opinion on Twitter can be examined to figure out community sentiment on a tv talk show as well as to confirm the Award winner of tv Talkshow. Index Terms—datamining; Decision Tree; K-NN; Naïve Bayes Classifier; sentiment analysis

Download Full-text

Spatial modeling and susceptibility zonation of landslides using random forest, naïve bayes and K-nearest neighbor in a complicated terrain

Earth Science Informatics ◽

10.1007/s12145-021-00653-y ◽

2021 ◽

Author(s):

Sherif Ahmed Abu El-Magd ◽

Sk Ajim Ali ◽

Quoc Bao Pham

Keyword(s):

Random Forest ◽

Spatial Modeling ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

K Nearest Neighbor

Download Full-text

Combination of Naïve Bayes Classifier and K-Nearest Neighbor (cNK) in the Classification Based Predictive Models

Computer and Information Science ◽

10.5539/cis.v6n3p48 ◽

2013 ◽

Vol 6 (3) ◽

Cited By ~ 5

Author(s):

Elma Zannatul Ferdousy ◽

Md. Mafijul Islam ◽

M. Abdul Matin

Keyword(s):

Predictive Models ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

K Nearest Neighbor ◽

Naïve Bayes Classifier

Download Full-text

Performance of Naïve Bayes, C4.5 and KNN using Breast Cancer, Iris and Hypothyroid Datasets

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c8795.019320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 2193-2197

Keyword(s):

Breast Cancer ◽

Data Mining ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Specific Pattern ◽

K Nearest Neighbor ◽

Data Mining Technique ◽

Digital Format ◽

Tree Classifier

Data mining usually specifies the discovery of specific pattern or analysis of data from a large dataset. Classification is one of an efficient data mining technique, in which class the data are classified are already predefined using the existing datasets. The classification of medical records in terms of its symptoms using computerized method and storing the predicted information in the digital format is of great importance in the diagnosis of various diseases in the medical field. In this paper, finding the algorithm with highest accuracy range is concentrated so that a cost-effective algorithm can be found. Here the data mining classification algorithms are compared with their accuracy of finding exact data according to the diagnosis report and their execution rate to identify how fast the records are classified. The classification technique based algorithms used in this study are the Naive Bayes Classifier, the C4.5 tree classifier and the K-Nearest Neighbor (KNN) to predict which algorithm is the best suited for classifying any kind of medical dataset. Here the datasets such as Breast Cancer, Iris and Hypothyroid are used to predict which of the three algorithms is suitable for classifying the datasets with highest accuracy of finding the records of patients with the particular health problems. The experimental results represented in the form of table and graph shows the performance and the importance of Naïve Bayes, C4.5 and K-Nearest Neighbor algorithms. From the performance outcome of the three algorithms the C4.5 algorithm is a lot better than the Naïve Bayes and the K-Nearest Neighbor algorithm.

Download Full-text

RB-Bayes algorithm for the prediction of diabetic in Pima Indian dataset

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v9i6.pp4866-4872 ◽

2019 ◽

Vol 9 (6) ◽

pp. 4866

Author(s):

Rajni Rajni ◽

Amandeep Amandeep

Keyword(s):

Nearest Neighbor ◽

Naive Bayes ◽

Early Stage ◽

Human Life ◽

Naïve Bayes ◽

Support Vector ◽

Pima Indians ◽

K Nearest Neighbor ◽

Fast Pace ◽

Bayes Algorithm

<p>Diabetes is a major concern all over the world. It is increasing at a fast pace. People can avoid diabetes at an early stage without any test. The goal of this paper is to predict the probability of whether the person has a risk of diabetes or not at an early stage. This would lead to having a great impact on their quality of human life. The datasets are Pima Indians diabetes and Cleveland coronary illness and consist of 768 records. Though there are a number of solutions available for information extraction from a huge datasets and to predict the possibility of having diabetes, but the accuracy of their mining process is far from accurate. For achieving highest accuracy, the issue of zero probability which is generally faced by naïve bayes analysis needs to be addressed suitably. The proposed framework RB-Bayes aims to extract the required information with high accuracy that could survive the problem of zero probability and also configure accuracy with other methods like Support Vector Machine, Naive Bayes, and K Nearest Neighbor. We calculated mean to handle missing data and calculated probability for yes (positive) and no (negative). The highest value between yes and no decide the value for the tuple. It is mostly used in text classification. The outcomes on Pima Indian diabetes dataset demonstrate that the proposed methodology enhances the precision as a contrast with other regulated procedures. The accuracy of the proposed methodology large dataset is 72.9%.</p>

Download Full-text