Tweet Netizen Prediction Using Random Forest, Decision Tree, Naïve Bayes, And Ensemble Algorithm (Case Study The Governor Of DKI Jakarta)

Abstract — The current Governor of DKI Jakarta, even though he has been elected since 2017 is always interesting to talk about or even comment on. Comments that appear come from the media directly or through social media. Twitter has become one of the social media that is often used as a media to comment on elected governors and can even become a trending topic on Twitter social media. Netizens who comment are also varied, some are always Tweeting criticism, some are commenting Positively, and some are only re-Tweeting. In this research, a prediction of whether active Netizens will tend to always lead to Positive or Negative comments will be carried out in this study. Model algorithms used are Decision Tree, Naïve Bayes, Random Forest, and also Ensemble. Twitter data that is processed must go through preprocessing first before proceeding using Rapidminer. In trials using Rapidminer conducted in four trials by dividing into two parts, namely testing data and training data. Comparisons made are 10% testing data: 90% Training data, then 20% testing data: 80% training data, then 30% testing data: 70% training data, and the last is 35% testing data: 65% training data. The average Accuracy for the Decision Tree algorithm is 93.15%, while for the Naïve Bayes algorithm the Accuracy is 91.55%, then for the Random Forest algorithm is 93.41, and the last is the Ensemble algorithm with an Accuracy of 93, 42%. here. Keywords — Decision Tree, Naïve Bayes, Random Forest, Set, Twitter.

Download Full-text

Perbandingan Akurasi, Recall, dan Presisi Klasifikasi pada Algoritma C4.5, Random Forest, SVM dan Naive Bayes

JURNAL MEDIA INFORMATIKA BUDIDARMA ◽

10.30865/mib.v5i2.2937 ◽

2021 ◽

Vol 5 (2) ◽

pp. 640

Author(s):

Mulkan Azhari ◽

Zakaria Situmorang ◽

Rika Rosnelly

Keyword(s):

Random Forest ◽

Naive Bayes ◽

Naïve Bayes ◽

Training Data ◽

Random Forest Algorithm ◽

Svm Algorithm ◽

Testing Data ◽

C4.5 Algorithm ◽

Bayes Algorithm ◽

Using Data

In this study aims to compare the performance of several classification algorithms namely C4.5, Random Forest, SVM, and naive bayes. Research data in the form of JISC participant data amounting to 200 data. Training data amounted to 140 (70%) and testing data amounted to 60 (30%). Classification simulation using data mining tools in the form of rapidminer. The results showed that . In the C4.5 algorithm obtained accuracy of 86.67%. Random Forest algorithm obtained accuracy of 83.33%. In SVM algorithm obtained accuracy of 95%. Naive Bayes' algorithm obtained an accuracy of 86.67%. The highest algorithm accuracy is in SVM algorithm and the smallest is in random forest algorithm

Download Full-text

Sentiment Analysis of Social Media Users Using Naïve Bayes, Decision Tree, Random Forest Algorithm: A Case Study of Draft Law on the Elimination of Sexual Violence (RUU PKS)

2019 International Conference on Sustainable Engineering and Creative Computing (ICSECC) ◽

10.1109/icsecc.2019.8907228 ◽

2019 ◽

Author(s):

Khalisa Virra ◽

Rachmadita Andreswari ◽

Muhammad Azani Hasibuan

Keyword(s):

Social Media ◽

Random Forest ◽

Decision Tree ◽

Sexual Violence ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Random Forest Algorithm

Download Full-text

Penerapan Metode Naive Bayes dalam Memprediksi Kepuasan Mahasiswa Terhadap Cara Pengajaran Dosen

Prosiding Seminar Nasional Riset Information Science (SENARIS) ◽

10.30645/senaris.v1i0.34 ◽

2019 ◽

Vol 1 ◽

pp. 287

Author(s):

Desi Ratna Sari ◽

Dedy Hartama ◽

Irfan Sudahri Damanik ◽

Anjar Wanto

Keyword(s):

Higher Education ◽

Student Satisfaction ◽

Teaching Methods ◽

Naive Bayes ◽

Teaching Method ◽

Higher Education Institutions ◽

Naïve Bayes ◽

Training Data ◽

Testing Data ◽

Bayes Algorithm

This research aims to classify in determining student satisfaction with teaching methods at STIKOM Tunas Bangsa. Data obtained from the results of the 2015 and 2016 semester student questionnaires were odd, with a sample of 80 students. Attributes used are 4, namely communication (C1), Building learning atmosphere (C2), Assessment of students (C3) and delivery of material (C4). The method used in this study is the Naïve Bayes Algorithm and is processed using RapidMiner studio 5.3 software to determine student satisfaction with teaching methods. Training data used 100 data while testing data used in manual calculations as much as 5 data. From the results of data testing the five data expressed satisfaction with the way teaching lecturers at STIKOM Tunas Bangsa. While the training data that is processed with RapidMiner has an accuracy of 92.00%. With this analysis, it is expected to be able to help higher education institutions to evaluate the performance of lecturers, especially in evaluating one of the three triharma colleges, namely the teaching method of lecturers.

Download Full-text

Sentiment Analysis of Social Media Twitter with Case of Anti-LGBT Campaign in Indonesia using Naïve Bayes, Decision Tree, and Random Forest Algorithm

Procedia Computer Science ◽

10.1016/j.procs.2019.11.181 ◽

2019 ◽

Vol 161 ◽

pp. 765-772 ◽

Cited By ~ 6

Author(s):

Veny Amilia Fitri ◽

Rachmadita Andreswari ◽

Muhammad Azani Hasibuan

Keyword(s):

Social Media ◽

Random Forest ◽

Decision Tree ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Random Forest Algorithm

Download Full-text

Knowing Personality Traits on Facebook Status Using the Naïve Bayes Classifier

International Journal of Artificial Intelligence & Robotics (IJAIR) ◽

10.25139/ijair.v2i1.2636 ◽

2020 ◽

Vol 2 (1) ◽

pp. 22

Author(s):

Mohammad Zoqi Sarwani ◽

Muhammad Shubkhan Salafudin ◽

Dian Ahkam Sani

Keyword(s):

Social Media ◽

Big Five ◽

Naive Bayes ◽

Naïve Bayes ◽

Training Data ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Surrounding Environment ◽

Testing Data

With the development of social media trends among students by using Facebook social media, students can communicate and pour out everything that is felt in the form of status. Personality is the character or various characters of a person - therefore, how a person to adjust to the surrounding environment for the achievement of communication smoothly. In the personality category, many things classify a person's category in the psychologist theory. In this exercise, the Big Five, the psychologist theory, is described in five codes, namely Openness, Conscientiousness, Extraversion, Agreeables, Neuroticism. Naive Bayes Classifier is used to determine the highest probability value with the aim to determine the highest value. The data used are two namely training data and testing data obtained from the Facebook status of students. From the data obtained can be tested in the system that the accuracy value is 88%.

Download Full-text

Prediksi Angka Kelahiran Bayi Pada Desa Tridaya Sakti Dengan Menggunakan Algoritma Naive Bayes

Journal of Students‘ Research in Computer Science ◽

10.31599/jsrcs.v1i2.423 ◽

2020 ◽

Vol 1 (2) ◽

pp. 77-88

Author(s):

Nur Isnaini Parihah ◽

Sari Hartini ◽

Juarni Siregar

Keyword(s):

Data Mining ◽

Population Growth ◽

Naive Bayes ◽

Large Population ◽

Naïve Bayes ◽

Training Data ◽

Birth Rates ◽

Testing Data ◽

Bayes Algorithm ◽

Infant Birth

The birth rate is something that can affect the increase in population growth. Large population is a burden for development. According to Malthus's Theory which states that a large population growth is not the welfare that is obtained but rather poverty will be encountered if the population is not well controlled. The number of baby births in Tridaya Sakti Village is increasing every year. Therefore Data Mining using the Naive Bayes algorithm can help in the calculation of predicting infant birth rates in Tridaya Sakti Village. Data Mining in predicting the number of infant birth rates aims to determine the number of infant birth rates for the coming year using the Naive Bayes algorithm. By looking at the prediction patterns of each variable and testing training data on testing data. It is hoped that the Naive Bayes algorithm can solve the problem in Tridaya Sakti Village in handling and overcoming the calculation of infant birth rates and can help the Tridaya Sakti Village in regulating population growth in the coming years. The results obtained from the data that have been taken and calculated by Data Mining using the Naive Bayes algorithm produce an information that can be used as a reference to find out the number of births. Performance and time in data processing are more effective and efficient as well as more accurate and accurate predictions of the number of baby births. Keywords: Naive Bayes, Birth of a Baby, Prediction Abstrak Angka kelahiran merupakan suatu hal yang dapat mempengaruhi peningkatan pertumbuhan penduduk. Jumlah penduduk yang besar merupakan beban bagi pembangunan. Menurut Teori Malthus yang menyatakan bahwa pertumbuhan jumlah penduduk yang besar bukanlah kesejahteraan yang didapat tapi justru kemelaratan akan ditemui bilamana jumlah penduduk tidak dikendalikan dengan baik. Jumlah angka kelahiran bayi di Desa Tridaya Sakti setiap tahunnya semakin bertambah. Maka dari itu Data Mining dengan menggunakan algoritman Naive Bayes dapat membantu dalam perhitungan memprediksi angka kelahiran bayi di Desa Tridaya Sakti. Data Mining dalam memprediksi jumlah angka kelahiran bayi bertujuan untuk mengetahui jumlah angka kelahiran bayi tahun yang akan mendatang mengunakan algoritma Naive Bayes. Dengan melihat pola prediksi dari setiap variabel dan melakukan pengujian data training terhadap data testing. Diharapkan algoritma Naive Bayes ini dapat menyelesaikan permasalahan di Desa Tridaya Sakti dalam menangani dan mengatasi perhitungan angka kelahiran bayi dan dapat membantu pihak Desa Tridaya Sakti dalam mengatur pertumbuhan jumlah penduduk tahun yang akan mendatang. Hasil yang diperoleh dari data yang sudah diambil dan dihitung dengan Data Mining mengunakan algoritam Naive Bayes menghasilkan sebuah informasi yang dapat digunakan sebagai acuan untuk mengetahui jumlah angka kelahiran bayi. Kinerja dan waktu dalam proses pengolahan data lebih efektif dan efesien serta dari prediksi jumlah kelahiran bayi lebih tepat dan akurat. Kata Kunci: Naive Bayes, Kelahiran Bayi, Prediks

Download Full-text

Tomato Maturity Classification using Naive Bayes Algorithm and Histogram Feature Extraction

Journal of Applied Intelligent System ◽

10.33633/jais.v3i1.1988 ◽

2018 ◽

Vol 3 (1) ◽

pp. 39-48 ◽

Cited By ~ 5

Author(s):

Arya Kusuma ◽

De Rosal Ignatius Moses Setiadi ◽

M. Dalvin Marno Putra

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Training Data ◽

Bayes Classifier ◽

Nutritional Content ◽

Classification Technique ◽

Computational Speed ◽

Testing Data ◽

Bayes Algorithm ◽

Histogram Feature

Tomatoes have nutritional content that is very beneficial for human health and is one source of vitamins and minerals. Tomato classification plays an important role in many ways related to the distribution and sales of tomatoes. Classification can be done on images by extracting features and then classifying them with certain methods. This research proposes a classification technique using feature histogram extraction and Naïve Bayes Classifier. Histogram feature extractions are widely used and play a role in the classification results. Naïve Bayes is proposed because it has high accuracy and high computational speed when applied to a large number of databases, is robust to isolated noise points, and only requires small training data to estimate the parameters needed for classification. The proposed classification is divided into three classes, namely raw, mature and rotten. Based on the results of the experiment using 75 training data and 25 testing data obtained 76% accuracy

Download Full-text

A COMPARISON OF ACCURACY BETWEEN TWO METHODS: NAЇVE BAYES ALGORITHM AND DECISION TREE-J48 TO PREDICT THE STOCK PRICE OF PT ASTRA INTERNATIONAL Tbk USING DATA FROM INDONESIA STOCK EXCHANGE

Abstract Proceedings International Scholars Conference ◽

10.35974/isc.v7i1.1872 ◽

2019 ◽

Vol 7 (1) ◽

pp. 1244-1258

Author(s):

Joan Yuliana Hutapea ◽

Yusran Timur Samuel ◽

Heima Sitorus

Keyword(s):

Decision Making ◽

Decision Tree ◽

Stock Prices ◽

Stock Price ◽

Naive Bayes ◽

Stock Exchange ◽

Naïve Bayes ◽

Bayes Method ◽

Testing Data ◽

Bayes Algorithm

The ability to predict the stock prices is very important for market players, whether individual or organizational investors. The market players needs to know how to predict, that will help them in their decision making process, whether to buy or to sell its shares, so that it can maximize profits and reduce potential losses due to mistakes in decision making. In accordance to this, the authors conducted a study that aimed to analyze and to compare the accuracy of two (2) methods that is used to predict the stock prices, namely: the Naїve Bayes Method and the Decision Tree-J48 Method. The amount of data used in this study were 1,195 stock datas of PT Astra International Tbk, issued by the IDX, by the period of January 1, 2013 to November 30, 2017. This study uses 7 attributes, namely: Previews, High, Low, Close, Volume, Value, and Frequency. By using the WEKA application the result shows that, the accuracy of the Naïve Bayes Method using 20% of testing data, is 92.0502%, the precision value is 0.920 and the value of recall is 0.961, while the accuracy of the Decision Tree J-48 method, using 20% of testing data, is 98.7448%, with precision value of 0.989 and the value of recall of 0.997. Through this results, it can be concluded that the decision tree J-48 algorithm has a better accuracy results compared to the Naive Bayes algorithm in predicting the stock price of PT. Astra Internasional Tbk.

Download Full-text

Comparison Analysis of K-Nearest Neighbor and Naïve Bayes in Determining Talent of Adolescence

International Journal of Artificial Intelligence Research ◽

10.29099/ijair.v4i1.118 ◽

2020 ◽

Vol 4 (1) ◽

Author(s):

Yessi Jusman ◽

Widdya Rahmalina ◽

Juni Zarman

Keyword(s):

Nearest Neighbor ◽

Naive Bayes ◽

Confusion Matrix ◽

Naïve Bayes ◽

Training Data ◽

K Nearest Neighbor ◽

Combined Training ◽

Testing Data ◽

Bayes Algorithm ◽

Children's Interests

Adolescence always searches for the identity to shape the personality character. This paper aims to use the artificial intelligent analysis to determine the talent of the adolescence. This study uses a sample of children aged 10-18 years with testing data consisting of 100 respondents. The algorithm used for analysis is the K-Nearest Neigbor and Naive Bayes algorithm. The analysis results are performance of accuracy results of both algorithms of classification. In knowing the accurate algorithm in determining children's interests and talents, it can be seen from the accuracy of the data with the confusion matrix using the RapidMiner software for training data, testing data, and combined training and testing data. This study concludes that the K-Nearest Neighbor algorithm is better than Naive Bayes in terms of classification accuracy.

Download Full-text

Analisis Sentimen Berbasis Emoticon pada Komentar Instagram Bahasa Indonesia Menggunakan Naïve Bayes

Jurnal Teknik Informatika dan Sistem Informasi ◽

10.28932/jutisi.v7i3.4094 ◽

2021 ◽

Vol 7 (3) ◽

Author(s):

Kristian Adi Nugraha

Keyword(s):

Social Media ◽

Naive Bayes ◽

Naïve Bayes ◽

Sentiment Classification ◽

Communication Media ◽

Test Results ◽

Negative Comments ◽

Accuracy Result ◽

Bayes Algorithm ◽

Bahasa Indonesia

The usage of social media growing rapidly, especially after the smartphone was invented. Because the number of social media users was quite a lot, companies prefer to promote their products through social media like Instagram. But, unlike TV or radio, social media is a two-way communication media, that makes users can respond directly to the content created by the company. Comments given by users have various types of sentiment, like positive or negative comments. In addition to using text, comments also often contain emoticons to support the message. This study tries to analyzing sentiment based on the usage of emoticons inside them using the Naïve Bayes algorithm. Based on the test results, the accuracy result is quite good, it is about 96.3% correct in sentiment classification.

Download Full-text