Analisis Perbandingan Akurasi Algoritma Naïve Bayes Dan C4.5 untuk Klasifikasi Diabetes

Diabetes is a metabolic disease in which blood sugar rises high. If blood sugar is not controlled properly, it can cause a variety of critical diseases, one of which is diabetes. The purpose of this study was to find out the results of comparing the performance values of Naïve Bayes and C4.5 algorithms with 7 different scenarios in the classification of diabetes that will be tested for accuracy, precision, and recall performance. The method used in this study is descriptive, and the source of skunder data obtained from the data of diabetic patients available on Kaggle with the format .csv issued by Ishan Dutta as many as 520 data and 17 fields. The tool used for data analysis is Rapidminer for the process of classification and performance testing of Naïve Bayes algorithm and C4.5 Algorithm. Our results showed that the C4.5 algorithm (scenario 4) had good results in the classification of diabetes compared to Naïve Bayes' algorithm (scenario 2) where the performance of the C4.5 algorithm had an accuracy of 99.03%, precision 100%, and recall 98.18%.

Download Full-text

Perbandingan Optimasi Feature Selection pada Naïve Bayes untuk Klasifikasi Kepuasan Airline Passenger

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v5i3.3086 ◽

2021 ◽

Vol 5 (3) ◽

pp. 527-533

Author(s):

Yoga Religia ◽

Amali Amali

Keyword(s):

Feature Selection ◽

Customer Satisfaction ◽

Naive Bayes ◽

Naïve Bayes ◽

Point Of View ◽

Classification Model ◽

Passenger Satisfaction ◽

Airline Passenger ◽

Bayes Algorithm

The quality of an airline's services cannot be measured from the company's point of view, but must be seen from the point of view of customer satisfaction. Data mining techniques make it possible to predict airline customer satisfaction with a classification model. The Naïve Bayes algorithm has demonstrated outstanding classification accuracy, but currently independent assumptions are rarely discussed. Some literature suggests the use of attribute weighting to reduce independent assumptions, which can be done using particle swarm optimization (PSO) and genetic algorithm (GA) through feature selection. This study conducted a comparison of PSO and GA optimization on Naïve Bayes for the classification of Airline Passenger Satisfaction data taken from www.kaggle.com. After testing, the best performance is obtained from the model formed, namely the classification of Airline Passenger Satisfaction data using the Naïve Bayes algorithm with PSO optimization, where the accuracy value is 86.13%, the precision value is 87.90%, the recall value is 87.29%, and the value is AUC of 0.923.

Download Full-text

Analysis and Classification of Danger Level in Android Applications Using Naive Bayes Algorithm

2018 6th International Conference on Information and Communication Technology (ICoICT) ◽

10.1109/icoict.2018.8528733 ◽

2018 ◽

Author(s):

Ridho Alif Utama ◽

Parman Sukarno ◽

Erwid Musthofa Jadied

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Android Applications ◽

Bayes Algorithm ◽

Danger Level

Download Full-text

Classification of diabetic patients records using Naïve Bayes classifier

2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) ◽

10.1109/rteict.2017.8256787 ◽

2017 ◽

Author(s):

K S Thulasi ◽

E S Ninu ◽

Kumar K M Shiva

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Diabetic Patients ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Download Full-text

Classifying the Level of Energy-Environmental Efficiency Rating of Brazilian Ethanol

Energies ◽

10.3390/en13082067 ◽

2020 ◽

Vol 13 (8) ◽

pp. 2067

Author(s):

Nilsa Duarte da Silva Lima ◽

Irenilza de Alencar Nääs ◽

João Gilberto Mendes dos Reis ◽

Raquel Baracat Tosi Rodrigues da Silva

Keyword(s):

Decision Tree ◽

High Efficiency ◽

Rating Scale ◽

Naive Bayes ◽

Naïve Bayes ◽

Environmental Efficiency ◽

Classification Model ◽

Bayes Algorithm ◽

J48 Decision Tree

The present study aimed to assess and classify energy-environmental efficiency levels to reduce greenhouse gas emissions in the production, commercialization, and use of biofuels certified by the Brazilian National Biofuel Policy (RenovaBio). The parameters of the level of energy-environmental efficiency were standardized and categorized according to the Energy-Environmental Efficiency Rating (E-EER). The rating scale varied between lower efficiency (D) and high efficiency + (highest efficiency A+). The classification method with the J48 decision tree and naive Bayes algorithms was used to predict the models. The classification of the E-EER scores using a decision tree using the J48 algorithm and Bayesian classifiers using the naive Bayes algorithm produced decision tree models efficient at estimating the efficiency level of Brazilian ethanol producers and importers certified by the RenovaBio. The rules generated by the models can assess the level classes (efficiency scores) according to the scale discretized into high efficiency (Classification A), average efficiency (Classification B), and standard efficiency (Classification C). These results might generate an ethanol energy-environmental efficiency label for the end consumers and resellers of the product, to assist in making a purchase decision concerning its performance. The best classification model was naive Bayes, compared to the J48 decision tree. The classification of the Energy Efficiency Note levels using the naive Bayes algorithm produced a model capable of estimating the efficiency level of Brazilian ethanol to create labels.

Download Full-text

Analisis Klasifikasi Kanker Payudara Menggunakan Algoritma Naive Bayes

INFORMAL: Informatics Journal ◽

10.19184/isj.v4i3.14170 ◽

2020 ◽

Vol 4 (3) ◽

pp. 117

Author(s):

Hardian Oktavianto ◽

Rahman Puji Handri

Keyword(s):

Breast Cancer ◽

Naive Bayes ◽

Naïve Bayes ◽

World Health ◽

Average Percentage ◽

Average Value ◽

Treatment Measures ◽

Bayes Algorithm ◽

Health Organization

Breast cancer is one of the highest causes of death among women, this disease ranks second cause of death after lung cancer. According to the world health organization, 1 million women get a diagnosis of breast cancer every year and half of them die, in general this is due to early treatment and slow treatment resulting in new cancers being detected after entering the final stage. In the field of health and medicine, machine learning-based classification has been carried out to help doctors and health professionals in classifying the types of cancer, to determine which treatment measures should be performed. In this study breast cancer classification will be carried out using the Naive Bayes algorithm to group the types of cancer. The dataset used is from the Wisconsin breast cancer database. The results of this study are the ability of the Naive Bayes algorithm for the classification of breast cancer produces a good value, where the average percentage of correctly classified data reaches 96.9% and the average percentage of data is classified as incorrect only 3.1%. While the level of effectiveness of classification with naive bayes is high, where the average value of precision and recall is around 0.96. The highest precision and recall values are when the test data uses a percentage split of 40% with the respective values reaching 0.974 and 0.973.

Download Full-text

ANALISIS KINERJA ALGORITMA C4.5 DAN NAÏVE BAYES DALAM MEMPREDIKSI KEBERHASILAN SEKOLAH MENGHADAPI UN

JITK (Jurnal Ilmu Pengetahuan dan Teknologi Komputer) ◽

10.33480/jitk.v5i2.1233 ◽

2020 ◽

Vol 5 (2) ◽

pp. 285-290

Author(s):

Yeni Angraini ◽

Siti Fauziah ◽

Jordi Lasmana Putra

Keyword(s):

High School ◽

Junior High School ◽

Naive Bayes ◽

Naïve Bayes ◽

National Examination ◽

National Examinations ◽

C4.5 Algorithm ◽

Student Graduation ◽

Bayes Algorithm ◽

Almost All

The national exam (UN) is one of the determinants of student graduation, both elementary school, junior high school and even high school. There are many businesses that are carried out by schools to prepare their students to face national examinations. In fact almost all schools provide material deepening to their students for subjects tested at the national examination. Therefore, this study was conducted to determine the level of success of the school in preparing students in facing national examinations. The method used is a decision tree with C4.5 algorithm and naïve Bayes algorithm. From the results of the study, the results of the accuracy of the naïve bayes algorithm were as big as 95,50% , while accuracy using the c4.5 algorithm is equal to 78,50%. Then it can be concluded that the predictions generated from the naïve bayes algorithm are better compared to the c4.5 algorithm .

Download Full-text

Opinion Mining on Culinary Food Customer Satisfaction Using Naïve Bayes Based-on Hybrid Feature Selection

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v15.i1.pp468-475 ◽

2019 ◽

Vol 15 (1) ◽

pp. 468 ◽

Cited By ~ 3

Author(s):

Oman Somantri ◽

Dyah Apriliani

Keyword(s):

Feature Selection ◽

Opinion Mining ◽

Naive Bayes ◽

Information Gain ◽

Naïve Bayes ◽

Classification Model ◽

Consumer Ratings ◽

Bayes Algorithm ◽

Restaurant Owners

<p>Conducting an assessment of consumer sentiments taken from social media in assessing a culinary food gives useful information for everyone who wants to get this information especially for migrants and tourists, in th other hand that information is very valuable for food stall and restaurant owners as information in improvinf food quality. Overcoming this problem, a sentiment analysis classification model using naïve bayes algorithm (NB) was applied to get this information. This problem occurs is the level of accuracy of classification of consumer ratings of culinary food is still not optimal because the weight of values in the data preprocessing process are not optimal. In this paper proposed a hybrid feature selection models to overcome the problems in the process of selecting the feature attributes that have not been optimal by using a combination of information gain (IG) and genetic algorithm (GA) algorithms. The result of this research showed that after the experiment and compared to using others algorithms produce the best of the level occuracy is 93%.</p>

Download Full-text

KLASIFIKASI PENGENALAN BUAH MENGGUNAKAN ALGORITMA NAIVE BAIYES

Jurnal RESISTOR (Rekayasa Sistem Komputer) ◽

10.31598/jurnalresistor.v2i2.434 ◽

2019 ◽

Vol 2 (2) ◽

pp. 83-88

Author(s):

Arif Saputra

Keyword(s):

Fuzzy Logic ◽

Naive Bayes ◽

Naïve Bayes ◽

Development Platform ◽

Apple Varieties ◽

Software Methodology ◽

Average Accuracy ◽

Bayes Algorithm ◽

Conducting Research ◽

And Performance

Manually sorting varieties of apples result in high costs, subjectivity, boredom, and inconsistencies associated with humans. A means is needed to distinguish between types of apples and, therefore, some reliable techniques are necessary to identify varieties quickly and without damage. The purpose of conducting research is to investigate the application and performance for Naive Bayes algorithm for apple varieties. This software methodology involves image acquisition, preprocessing, segmentation and analysis classification varieties for apple. The prototype of Apple's classification system was built using the MATLAB R2017 development platform environment. The results in this study indicate that the estimated average accuracy, sensitivity, precision, and specificity are 81%, 73%, 100%, and 70%, respectively. MLP-Neural shows that performance of the Naive Bayes technique is consistent with Principal, Fuzzy Logic, and Neural analysis with 89%, 91%, 87%, and 82% respectively in terms of accuracy. This study shows that Naif Bayes has excellent potential for identifying nondestructive and accurate apple varieties.

Download Full-text

The Comparison of Data Mining Methods Using C4.5 Algorithm and Naive Bayes in Predicting Heart Disease

Tech-E ◽

10.31253/te.v4i2.543 ◽

2021 ◽

Vol 4 (2) ◽

pp. 44

Author(s):

Rino Rino

Keyword(s):

Data Mining ◽

Heart Disease ◽

Naive Bayes ◽

Naïve Bayes ◽

Data Set ◽

A Value ◽

C4.5 Algorithm ◽

Calculation Results ◽

Mining Methods ◽

Bayes Algorithm

Heart disease is a condition of the presence of fatty deposits in the coronary arteries in the heart which changes the role and shape of the arteries so that blood flow to the heart is obstructed. Data mining methods can predict this disease, some of the methods are C4.5 Algorithm and Naive Bayes which are often used in research.The data set in this research was obtained from the uci machine learning repository site, where the dataset has 3546 records and 13 attributes.The accuracy value of the Naïve Bayes algorithm has a high value of 81.40% compared to the C4.5 algorithm which only has an accuracy value of 79.07%. Based on the calculation results, it can be concluded that the Naïve Bayes Algorithm is a very good clarification because it has a value between 0.709 - 1.00.From conclusion above, the Naïve Bayes algorithm has a higher accuracy value than the C4.5 algorithm so the researchers decided to use the Naïve Bayes algorithm in predicting heart disease.

Download Full-text

Accuracy Enhancement of Diabetic Retinopathy Detection Using Naive Bayes Algorithm

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.37099 ◽

2021 ◽

Vol 9 (VII) ◽

pp. 3648-3652

Author(s):

Mithileshkumar Yadav

Keyword(s):

Diabetic Retinopathy ◽

Vision Loss ◽

Naive Bayes ◽

Naïve Bayes ◽

Automated System ◽

Diabetic Patients ◽

Detection Accuracy ◽

Monitoring Model ◽

Bayes Algorithm ◽

Retinal Fundus

Diabetic retinopathy (DR) is a disease of eye which is caused by diabetes. Sometime the DR leads the diabetic patients to complete vision loss. In this scenario, early identification of DR is more essential to protect the eyesight and provide help for timely treatment. The detection of DR can be done manually by ophthalmologists and can also be done by an automated system. An ophthalmologist is required to analyze and explain retinal fundus images in the manual system, which is a time consuming and very expensive task. While, In the automated system, artificial intelligence is used to perform an significant role in the area of ophthalmology and specifically in the early detection of DR over the traditional detection approaches. Recently, numerous advanced studies related to the identification of DR have been reported, But still research for accurate detection of DR is going on. In this paper, a new diabetic retinopathy monitoring model is proposed by using the Naive Bayes method to improve the accuracy of detection of DR. The model is trained on mixture of two datasets Messidor and Kaggle, and evaluated on the Messidor dataset. By using proposed method detection accuracy is found to be higher than existing methods.

Download Full-text