scholarly journals A COMPARISON OF ACCURACY BETWEEN TWO METHODS: NAЇVE BAYES ALGORITHM AND DECISION TREE-J48 TO PREDICT THE STOCK PRICE OF PT ASTRA INTERNATIONAL Tbk USING DATA FROM INDONESIA STOCK EXCHANGE

2019 ◽  
Vol 7 (1) ◽  
pp. 1244-1258
Author(s):  
Joan Yuliana Hutapea ◽  
Yusran Timur Samuel ◽  
Heima Sitorus

The ability to predict the stock prices is very important for market players, whether individual or organizational investors.  The market players needs to know how to predict, that will help them in their decision making process, whether to buy or to sell its shares, so that it can maximize profits and reduce potential losses due to mistakes in decision making.  In accordance to this, the authors conducted a study that aimed to analyze and to compare the accuracy of two (2) methods that is used to predict the stock prices, namely: the Naїve Bayes Method and the Decision Tree-J48 Method. The amount of data used in this study were 1,195 stock datas of PT Astra International Tbk, issued by the IDX, by the period of January 1, 2013 to November 30, 2017. This study uses 7 attributes, namely:  Previews, High, Low, Close, Volume, Value, and Frequency. By using the WEKA application the result shows that, the accuracy of the Naïve Bayes Method using 20% of testing data, is 92.0502%, the precision value is 0.920 and the value of recall is 0.961,  while the accuracy of the Decision Tree J-48 method, using 20% of testing data, is 98.7448%, with precision value of 0.989 and the value of recall of 0.997.   Through this results,  it can be concluded that the decision tree J-48 algorithm has a better accuracy results compared to the Naive Bayes algorithm in predicting the stock price of PT. Astra Internasional Tbk.

2020 ◽  
Vol 8 (3) ◽  
pp. 227
Author(s):  
Gede Widiastawan ◽  
I Gusti Agung Gede Arya Kadyanan

Goprint is an Online Printing Marketplace that connects printing services with users who want to print documents quickly without the need to queue. In the span of time from April 2019 to September 2019 it was found that the number of Goprint users reached 407 users, 24 partners, and 256 orders. From transactions that have been carried out by users, not a few orders are often canceled due to ineffective Goprint features or poor partner performance. This causes Goprint users to feel dissatisfied with the services provided by the Goprint application. The Naive Bayes algorithm is one of the algorithms used for classification or grouping of data, but can also be used for decision making. With this algorithm and the problems that occur, the authors make a system to predict the loyalty of Goprint users to anticipate users who stop leaving Goprint because they are not satisfied or loyal users. The data used as training data is 20 and testing data is 10. From the test results it is found that the value of precision is 80%, 100% recall, and 90% accuracy.


SinkrOn ◽  
2020 ◽  
Vol 5 (1) ◽  
pp. 9-20
Author(s):  
Antonius Yadi Kuntoro

Abstract — The current Governor of DKI Jakarta, even though he has been elected since 2017 is always interesting to talk about or even comment on. Comments that appear come from the media directly or through social media. Twitter has become one of the social media that is often used as a media to comment on elected governors and can even become a trending topic on Twitter social media. Netizens who comment are also varied, some are always Tweeting criticism, some are commenting Positively, and some are only re-Tweeting. In this research, a prediction of whether active Netizens will tend to always lead to Positive or Negative comments will be carried out in this study. Model algorithms used are Decision Tree, Naïve Bayes, Random Forest, and also Ensemble. Twitter data that is processed must go through preprocessing first before proceeding using Rapidminer. In trials using Rapidminer conducted in four trials by dividing into two parts, namely testing data and training data. Comparisons made are 10% testing data: 90% Training data, then 20% testing data: 80% training data, then 30% testing data: 70% training data, and the last is 35% testing data: 65% training data. The average Accuracy for the Decision Tree algorithm is 93.15%, while for the Naïve Bayes algorithm the Accuracy is 91.55%, then for the Random Forest algorithm is 93.41, and the last is the Ensemble algorithm with an Accuracy of 93, 42%. here. Keywords — Decision Tree, Naïve Bayes, Random Forest, Set, Twitter.  


2018 ◽  
Vol 7 (1.7) ◽  
pp. 137 ◽  
Author(s):  
Danda Shashank Reddy ◽  
Chinta Naga Harshitha ◽  
Carmel Mary Belinda

Now a day’s many advanced techniques are proposed in diagnosing the tumor in brain like magnetic resonance imaging, computer tomography scan, angiogram, spinal tap and biospy. Based on diagnosis it is easy to predict treatment. All of the types of brain tumor are officially reclassified by the World Health Organization. Brain tumors are of 120 types, almost each tumor is having same symptoms and it is difficult to predict treatment. For this regard we are proposing more accurate and efficient algorithm in predicting the type of brain tumor is Naïve Bayes’ classification and decision tree algorithm. The main focus is on solving tumor classification problem using these algorithms. Here the main goal is to show that the prediction through the decision tree algorithm is simple and easy than the Naïve Bayes’ algorithm.


2020 ◽  
Vol 12 (2) ◽  
pp. 104-107
Author(s):  
Nurhayati . ◽  
Nuraeny Septianti ◽  
Nani Retnowati ◽  
Arief Wibowo

Data processing is imperative for the development of information technology. Almost any field of work has information about data. The data is made use of the analysis of the job. Nowadays, information data is imperatively processed to help workers in making decisions. This study discusses student prediction graduation rates by using the naïve Bayes method. That aims at providing information to college if they can use it properly to utilize the data of students who graduated by processing data mining. Based on the data mining process, steps founded that used producing information, namely predicting student graduation on time. The method of this study is Naïve Bayes with classification techniques. At this study, researchers used a six-phase data mining process of industry crossing standards in data mining known as CRISP-DM. The results of research concluded that the application of the Naive Bayes algorithm uses 4 (four) parameters namely ips, ipk, the number of credits, and graduation by getting an accuracy value of 80.95%.


Energies ◽  
2020 ◽  
Vol 13 (8) ◽  
pp. 2067
Author(s):  
Nilsa Duarte da Silva Lima ◽  
Irenilza de Alencar Nääs ◽  
João Gilberto Mendes dos Reis ◽  
Raquel Baracat Tosi Rodrigues da Silva

The present study aimed to assess and classify energy-environmental efficiency levels to reduce greenhouse gas emissions in the production, commercialization, and use of biofuels certified by the Brazilian National Biofuel Policy (RenovaBio). The parameters of the level of energy-environmental efficiency were standardized and categorized according to the Energy-Environmental Efficiency Rating (E-EER). The rating scale varied between lower efficiency (D) and high efficiency + (highest efficiency A+). The classification method with the J48 decision tree and naive Bayes algorithms was used to predict the models. The classification of the E-EER scores using a decision tree using the J48 algorithm and Bayesian classifiers using the naive Bayes algorithm produced decision tree models efficient at estimating the efficiency level of Brazilian ethanol producers and importers certified by the RenovaBio. The rules generated by the models can assess the level classes (efficiency scores) according to the scale discretized into high efficiency (Classification A), average efficiency (Classification B), and standard efficiency (Classification C). These results might generate an ethanol energy-environmental efficiency label for the end consumers and resellers of the product, to assist in making a purchase decision concerning its performance. The best classification model was naive Bayes, compared to the J48 decision tree. The classification of the Energy Efficiency Note levels using the naive Bayes algorithm produced a model capable of estimating the efficiency level of Brazilian ethanol to create labels.


Author(s):  
Youllia Indrawaty Nurhasanah ◽  
Asep Nana Hermana ◽  
Mahesa Arga Hutama

Sugeno Fuzzy algorithm is one of the algorithms contained on Fuzzy Inference System, that used to describe the condition between the two pieces of the decisions represented in the form of rules IF - THEN, where the output is constant or linear equations. While the Naive Bayes algorithm is an algorithm that uses data classification to a particular class based on the probability of each data class. Both of these algorithms can be implemented on a Decision Support System (DSS) for diet selection, using Fuzzy Sugeno as an additional determinant of energy and Naive Bayes method as decision maker. This is because the need for food intake and diet has become a problem for humans. To prevent excess intake of food it needs dietary adjustments or so-called diet. But in daily life, people sometimes hard to determine the type of diet that is suitable for them. So we need a system that can determine the type of diet that is suitable for a person. The data that used as a reference for decision support are age, daily caloric requirement, Body Mass Index (BMI), blood pressure, cholesterol, uric acid and blood sugar levels. Results of system testing showed from a sample of 30 data there are 26 appropriate data and 4 inappropriate data to determine the type of diet by the system with the success rate of 86.7%.


2021 ◽  
Vol 328 ◽  
pp. 04011
Author(s):  
Alwin Ali ◽  
Amal Khairan ◽  
Firman Tempola ◽  
Achmad Fuad

The amount of rainfall that occurs cannot be determined with certainty, but it can be predicted or estimated. In predicting the potential for rain, data mining techniques can be used by classifying data using the naive Bayes method. Naïve Bayes algorithm is a classification method using probability and statistical methods. The purpose of this study is how to implement the naive Bayes method to predict the potential for rain in Ternate City, and be able to calculate the accuracy of the Naive Bayes method from system created. The highest calculation results with new data with a total of 400 training data and 30 test data, obtained 30 correct data with 100% precision, 100% recall and 100% accuracy and the lowest calculation results with new data with a total of 500 training data and 50 test data, obtained 38 correct data and 12 incorrect data with a percentage of precision 61.29%, recall 100% and accuracy 76%.


2021 ◽  
Vol 5 (1) ◽  
pp. 32
Author(s):  
Hartatik Hartatik

<p>Abstrak :</p><p>Prediksi tentang status kelulusan mahasiswa menjadi persoalan tersendiri di perguruan tinggi. Perguruan tinggi utamanya di era Big Data sangatlah penting untuk melakukan prediksi perilaku akademik mahasiswa aktif sehingga dapat di ketahui kemungkinan mahasiswa bisa studi secara tepat waktu serta dapat diketahui langkah preventive dalam membuat prpgram perencanaan. Salah satu cara yang digunakan adalah teknik data mining yaitu menggunakan Algoritma <em>naive bayes</em>. Algoritma <em>Naive bayes</em> merupakan salah satu metode yang digunakan untuk memprediksi kelulusan mahasiswa.  Peneliti  dalam hal ini menerapkan  metode  <em>Naive bayes</em> menggunakan parameter Indeks prestasi kumulatif( IPK) dan membandingkan dengan menggunakan prediksi <em>naive bayes methods</em> berdasarkan parameter IPK dan sosial parameter yaitu jenis kelamin dan status tinggal. Dalam penelitian ini menggunakan parameter akademis  dan dilakukan optimasi menggunakan parameter sosial yang melekat pada mahasiswa. Berdasarkan hasil evaluasi untuk mendapatkan akurasi, hasil dari penelitian ini mendapatkan nilai akurasi untuk metode <em>Naive bayes</em>  sebesar 75% dan akurasi untuk model prediksi dengan parameter sosial  sebesar 85% dengan selisih akurasi 10%.</p><p>__________________________</p><p>Abstract : </p><p><em>Predictions about a student's graduation status are a problem in college. Major tertiary institutions in the era of Big Data are very important to predict the behavior of active students so that they can find out the possibility of students in a timely manner and can determine preventive steps in making program planning. One method used is data mining techniques using the Naive bayes Algorithm. The Naive bayes algorithm is one of the methods used to predict student graduation. Researchers in this case applied the Naive bayes method using the cumulative achievement index (GPA) parameter and compared using the prediction of the Naive bayes method based on the GPA parameters and social parameters, namely gender and status. This study uses academic parameters and is carried out optimally using social parameters inherent in students. Based on the results of the evaluation to get an accuracy value, the results of this study get an accurate value for the Naive bayes method of 75% and accurate for prediction models with social parameters of 85% with a difference of 10%.</em></p>


2018 ◽  
Vol 7 (4.44) ◽  
pp. 82
Author(s):  
Dyah Ayu Irawati ◽  
Yan Watequlis Syaifudin ◽  
Fabiola Ester Tomasila ◽  
Awan Setiawan ◽  
Erfan Rohadi

Many rabbit keepers or breeders are panics when their rabbit has an illness. This paper proposed an expert diagnostic system application for Android-based rabbit disease using the Naïve Bayes method to determine the illness and Certainty Factor for the trust value of the condition by combining the rate of the trust of users and experts due to diagnose the diseases of the rabbit.The testing was using 65 data learning and 160 data learning to test the naïve Bayes method. Furthermore, the certainty factor is using CF user 1 and its variation.The results obtained for 65 data learning is 53%, while 160 data learning is 73%. With the naïve Bayes method, it can be concluded that the more data learning, the better and more accurate the system. The results of conformity with the testing data obtained from the variative CF user value, namely 53% accordingly, 13% inappropriate, 33% near. The effect of compliance with the sample data collected from the CF value of user 1 is 53% appropriate, 7% inappropriate, 40% is near. With the certainty factor method, it can be concluded that differences in user input values affect the overall CF value. 


Sign in / Sign up

Export Citation Format

Share Document