Classification of Indonesian Government Budget Appropriations or Outlays for Research and Development (GBAORD) Using Decision Tree and Naive Bayes

Author(s):  
Arida Ferti Syafiandini ◽  
Hani Febri Mustika ◽  
Yan Rianto
Energies ◽  
2020 ◽  
Vol 13 (8) ◽  
pp. 2067
Author(s):  
Nilsa Duarte da Silva Lima ◽  
Irenilza de Alencar Nääs ◽  
João Gilberto Mendes dos Reis ◽  
Raquel Baracat Tosi Rodrigues da Silva

The present study aimed to assess and classify energy-environmental efficiency levels to reduce greenhouse gas emissions in the production, commercialization, and use of biofuels certified by the Brazilian National Biofuel Policy (RenovaBio). The parameters of the level of energy-environmental efficiency were standardized and categorized according to the Energy-Environmental Efficiency Rating (E-EER). The rating scale varied between lower efficiency (D) and high efficiency + (highest efficiency A+). The classification method with the J48 decision tree and naive Bayes algorithms was used to predict the models. The classification of the E-EER scores using a decision tree using the J48 algorithm and Bayesian classifiers using the naive Bayes algorithm produced decision tree models efficient at estimating the efficiency level of Brazilian ethanol producers and importers certified by the RenovaBio. The rules generated by the models can assess the level classes (efficiency scores) according to the scale discretized into high efficiency (Classification A), average efficiency (Classification B), and standard efficiency (Classification C). These results might generate an ethanol energy-environmental efficiency label for the end consumers and resellers of the product, to assist in making a purchase decision concerning its performance. The best classification model was naive Bayes, compared to the J48 decision tree. The classification of the Energy Efficiency Note levels using the naive Bayes algorithm produced a model capable of estimating the efficiency level of Brazilian ethanol to create labels.


Author(s):  
Jayalath Bandara Ekanayake

Manual classification of bug reports is time-consuming as the reports are received in large quantities. Alternatively, this project proposed automatic bug prediction models to classify the bug reports. The topics or the candidate keywords are mined from the developer description in bug reports using RAKE algorithm and converted into attributes. These attributes together with the target attribute—priority level—construct the training datasets. Naïve Bayes, logistic regression, and decision tree learner algorithms are trained, and the prediction quality was measured using area under recursive operative characteristics curves (AUC) as AUC does not consider the biasness in datasets. The logistics regression model outperforms the other two models providing the accuracy of 0.86 AUC whereas the naïve Bayes and the decision tree learner recorded 0.79 AUC and 0.81 AUC, respectively. The bugs can be classified without developer involvement and logistic regression is also a potential candidate as naïve Bayes for bug classification.


2021 ◽  
Vol 11 (2) ◽  
pp. 1084-1096
Author(s):  
T. Dinesh

Aim: The main aim of the study proposed is to perform higher classification of fake political news by implementing fake news detectors using machine learning classifiers by comparing their performance. Materials and Methods: By considering two groups such as Decision Tree algorithm and Naive Bayes algorithm. The algorithms have been implemented and tested over a dataset which consists of 44,000 records. Through the programming experiment which is performed using N=10 iterations on each algorithm to identify various scales of fake news and true news classification. Result: After performing the experiment the mean accuracy of 99.6990 by using Decision Tree algorithm and the accuracy of 95.3870 by using Naive Bayes algorithm for fake political news in. There is a statistical significant difference in accuracy for two algorithms is p<0.05 by performing independent samples t-tests. Conclusion: This paper is intended to implement the innovative fake news detection approach on recent Machine Learning Classifiers for prediction of fake political news. By testing the algorithms performance and accuracy on fake political news detection and other issues. The comparison results shows that the Decision Tree algorithm has better performance when compared to Naive Bayes algorithm.


2021 ◽  
Vol 4 (1) ◽  
pp. 63-69
Author(s):  
Baiq Andriska Candra Permana ◽  
◽  
Intan Komala Dewi Patwari ◽  

Diabetes is a group of metabolic diseases which is indicated by the occurrence of hyperglycemia caused by abnormalities in insulin secretion in the body. Many deaths are caused by diabetes, if this disease is not treated immediately, diabetes can cause damage to other organs such as blindness, stores, heart problem and even kidney problem. A best method is needed in classifying diabetes in order to detect diabetes early. Research related to the classification of diabetes using several calcification methods has been done before. In this study, two classification methods were compared, namely decision tree and naïve Bayes. Measurement methods were carried out through cross validation. The results obtained from this study are the best algorithms among the two algorithms to determine diabetes sufferers


2019 ◽  
Vol 15 (2) ◽  
pp. 275-280
Author(s):  
Agus Setiyono ◽  
Hilman F Pardede

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam.  One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.


2019 ◽  
Vol 64 (2) ◽  
pp. 53-71
Author(s):  
Botond Benedek ◽  
Ede László

Abstract Customer segmentation represents a true challenge in the automobile insurance industry, as datasets are large, multidimensional, unbalanced and it also requires a unique price determination based on the risk profile of the customer. Furthermore, the price determination of an insurance policy or the validity of the compensation claim, in most cases must be an instant decision. Therefore, the purpose of this research is to identify an easily usable data mining tool that is capable to identify key automobile insurance fraud indicators, facilitating the segmentation. In addition, the methods used by the tool, should be based primarily on numerical and categorical variables, as there is no well-functioning text mining tool for Central Eastern European languages. Hence, we decided on the SQL Server Analysis Services (SSAS) tool and to compare the performance of the decision tree, neural network and Naïve Bayes methods. The results suggest that decision tree and neural network are more suitable than Naïve Bayes, however the best conclusion can be drawn if we use the decision tree and neural network together.


Sign in / Sign up

Export Citation Format

Share Document