PENERAPAN DATA MINING MENGGUNAKAN METODE TEKNIK CLASSIFICATION UNTUK MELIHAT POTENSI KEPATUHAN WAJIB PAJAK BUMI DAN BANGUNAN

The goverment implements development in Indonesia, requires substantial funds. The entry of cash from the Land and Building Tax is the most important part for the development of a region, with the results that have been obtained by the regional government can increase regional development with various infrastructures that help the community to carry out various activities and make the area more advanced. One type of tax is the Land and Building Tax (PBB). With the increasing number of taxpayers and data paying contributions directly into the treasury of state finances, the UPT BPPD of SU II Subdistrict of Palembang city did not know how many obedient and non-compliant taxpayers. In this study using data mining techniques, namely classification by applying the Naive Bayes algorithm and getting from the number of taxpayers as many as 1,647 taxpayers with an accuracy of 99.33% which has the potential to not be on time in 16 ulu villages at 0,437 and sub-district households with data of 0.229.

Download Full-text

Predicting heart ailment in patients with varying number of features using data mining techniques

International Journal of Informatics and Communication Technology (IJ-ICT) ◽

10.11591/ijict.v8i1.pp56-62 ◽

2019 ◽

Vol 8 (1) ◽

pp. 56

Author(s):

T R Stella Mary ◽

Shoney Sebastian

Keyword(s):

Data Mining ◽

Heart Disease ◽

Random Forest ◽

Naive Bayes ◽

Heart Diseases ◽

Naïve Bayes ◽

Bayes Classifier ◽

Data Mining Techniques ◽

Using Data ◽

Almost All

Data mining can be defined as a process of extracting unknown, verifiable and possibly helpful data from information. Among the various ailments, heart ailment is one of the primary reason behind death of individuals around the globe, hence in order to curb this, a detailed analysis is done using Data Mining. Many a times we limit ourselves with minimal attributes that are required to predict a patient with heart disease. By doing so we are missing on a lot of important attributes that are main causes for heart diseases. Hence, this research aims at considering almost all the important features affecting heart disease and performs the analysis step by step with minimal to maximum set of attributes using Data Mining techniques to predict heart ailments. The various classification methods used are Naïve Bayes classifier, Random Forest and Random Tree which are applied on three datasets with different number of attributes but with a common class label. From the analysis performed, it shows that there is a gradual increase in prediction accuracies with the increase in the attributes irrespective of the classifiers used and Naïve Bayes and Random Forest algorithms comparatively outperforms with these sets of data.

Download Full-text

Predicting Heart Ailment in Patients with Varying number of Features using Data Mining Techniques

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v9i4.pp2675-2681 ◽

2019 ◽

Vol 9 (4) ◽

pp. 2675

Author(s):

T R Stella Mary ◽

Shoney Sebastian

Keyword(s):

Data Mining ◽

Heart Disease ◽

Random Forest ◽

Naive Bayes ◽

Heart Diseases ◽

Naïve Bayes ◽

Bayes Classifier ◽

Data Mining Techniques ◽

Using Data ◽

Almost All

Data mining can be defined as a process of extracting unknown, verifiable and possibly helpful data from information. Among the various ailments, heart ailment is one of the primary reason behind death of individuals around the globe, hence in order to curb this, a detailed analysis is done using Data Mining. Many a times we limit ourselves with minimal attributes that are required to predict a patient with heart disease. By doing so we are missing on a lot of important attributes that are main causes for heart diseases. Hence, this research aims at considering almost all the important features affecting heart disease and performs the analysis step by step with minimal to maximum set of attributes using Data Mining techniques to predict heart ailments. The various classification methods used are Naïve Bayes classifier, Random Forest and Random Tree which are applied on three datasets with different number of attributes but with a common class label. From the analysis performed, it shows that there is a gradual increase in prediction accuracies with the increase in the attributes irrespective of the classifiers used and Naïve Bayes and Random Forest algorithms comparatively outperforms with these sets of data.

Download Full-text

Machine Learning for Cryptographic Algorithm Identification

Journal of Information Security and Cryptography (Enigma) ◽

10.17648/enig.v3i1.55 ◽

2016 ◽

Vol 3 (1) ◽

pp. 3 ◽

Cited By ~ 2

Author(s):

Flávio Barbosa ◽

Arthur Vidal ◽

Flávio Mello

Keyword(s):

Machine Learning ◽

Data Mining ◽

Multilayer Perceptron ◽

Naive Bayes ◽

Naïve Bayes ◽

Data Mining Techniques ◽

Cryptographic Algorithm ◽

Cryptographic Algorithms ◽

Using Data ◽

Confusion Matrices

This paper aims to study encrypted text files in order to identify their encoding algorithm. Plain texts were encoded with distinct cryptographic algorithms and then some metadata were extracted from these codifications. Afterward, the algorithm identification is obtained by using data mining techniques. Firstly, texts in Portuguese, English and Spanish were encrypted using DES, Blowfish, RSA, and RC4 algorithms. Secondly, the encrypted files were submitted to data mining techniques such as J48, FT, PART, Complement Naive Bayes, and Multilayer Perceptron classifiers. Charts were created using the confusion matrices generated in step two and it was possible to perceive that the percentage of identification for each of the algorithms is greater than a probabilistic bid. There are several scenarios where algorithm identification reaches almost 97, 23% of correctness.

Download Full-text

Prediksi Kelulusan Mahasiswa menggunakan Algoritma Naive Bayes (Studi Kasus 5 PTS di Banda Aceh)

Jurnal JTIK (Jurnal Teknologi Informasi dan Komunikasi) ◽

10.35870/jtik.v3i2.77 ◽

2019 ◽

Vol 3 (2) ◽

pp. 59

Author(s):

Munawir Munawir ◽

Taufiq Iqbal

Keyword(s):

Data Mining ◽

Research Method ◽

Naive Bayes ◽

Naïve Bayes ◽

Data Mining Algorithm ◽

Standard Data ◽

Industry Standard ◽

Bayes Algorithm ◽

Using Data ◽

Banda Aceh

The e-questionnaire application that researchers built using CodeIgniter and React-Js This study aims to data mining by using rapidminer tools to collect student data from the Feeder application page from the class of 2010-2014 which is assumed that the student class has been declared graduated in 2018. The data was collected from 5 (five) Private Universities in the City Banda Aceh. then by observing the graduation level using data mining can bring a considerable contribution to educational institutions, in an effort to improve curriculum competency in Higher Education, it is expected that the results of data mining can make reference to curriculum standards as a form of graduate competency improvement. The research method uses the Cross-Industry Standard Process for Data Mining (CRISP-DM) which is used as a standard data mining process as well as a research method with stages starting from Business understanding, data understanding, data preparation, modeling, evaluation, and deployment. The results showed that the data mining algorithm for graduation prediction based on the selected pass accuracy attribute revealed that the prediction level was uniform with the algorithm used, Naïve Bayes, prediction accuracy was 84%. The data attributes that were found to have significantly influenced the classification process were the GPA and Study Length. The results obtained that students who graduated by 60% are students who are educated in ASM Nusantara and AMIK Indonesia, while in Banda Aceh STIES and Serambi University Mecca the prediction of graduation is 52%. Another thing is different from STIA Iskandar Thani where the prediction of graduating is only 48% and not passing on time is 52%. The results of this prediction can reveal and become a recommendation for prospective students or academics to increase the quantity of graduates and increase student confidence in tertiary institutions.Keywords:Prediction, Student Graduation, Naive Bayes Algorithm.

Download Full-text

IMPLEMENTASI DATA MINING UNTUK MEMPREDIKSI PEMESANAN DRIVER GO-JEK ONLINE DENGAN MENGGUNAKAN METODE NAIVE BAYES (STUDI KASUS: PT. GO-JEK INDONESIA)

KOMIK (Konferensi Nasional Teknologi Informasi dan Komputer) ◽

10.30865/komik.v2i1.972 ◽

2018 ◽

Vol 2 (1) ◽

Author(s):

Delisman Laia ◽

Efori Buulolo ◽

Matias Julyus Fika Sirait

Keyword(s):

Data Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Training Data ◽

Transportation Industry ◽

Data Set ◽

Data Mining Algorithms ◽

Taxi Service ◽

Bayes Algorithm ◽

Using Data

PT. Go-Jek Indonesia is a service company. Go-jek online is a technology-based motorcycle taxi service that leads the transportation industry revolution. Predictions on ordering go-jek drivers using data mining algorithms are used to solve problems faced by the company PT. Go-Jek Indonesia to predict the level of ordering of online go-to drivers. In determining the crowded and lonely time. The proposed method is Naive Bayes. Naive Bayes algorithm aims to classify data in certain classes. The purpose of this study is to look at the prediction patterns of each of the attributes contained in the data set by using the naive algorithm and testing the training data on testing data to see whether the data pattern is good or not. what will be predicted is to collect the data of the previous driver ordering, which is based on the day, time for one month. The Naive Bayes algorithm is used to predict the ordering of online go-to-go drivers that will be experienced every day by seeing each order such as morning, afternoon and evening. The results of this study are to make it easier for the company to analyze the data of each go-jek driver booking in taking policies to ensure that both drivers and consumers or customers.Keywords: Go-jek Driver, Data Mining, Naive Bayes

Download Full-text

Prediction of Student Performance System using Machine Learning Techniques

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f3427.049620 ◽

2020 ◽

Vol 9 (6) ◽

pp. 32-37

Keyword(s):

Data Mining ◽

Student Performance ◽

Naive Bayes ◽

Educational Data Mining ◽

Estimation Method ◽

Naïve Bayes ◽

Machine Learning Techniques ◽

Prediction Algorithm ◽

Data Mining Techniques ◽

Bayes Algorithm

Educational organizations are unique and play the utmost significant role in the development of any country. In the Educational database, due to the enormous volume of data for predicting student's achievement becomes more complicated. To upgrade a student's performance and triumph is more efficient in a practical way using Educational Data Mining Techniques. Data Mining Techniques could deliver favor and brunt to educators and academic institutions. The student's data ((i.e.) Name,10th %,12th cut off, CGPA, No of arrears, etc.) are gathered. Then, the datasets are imported into the Anaconda Navigator. Then, analysis and classification based on attributes of the students and the schemes are performed. Then using the prediction algorithm Naïve Bayes what are all the features the particular student is eligible for are predicted as placed. The student's input that has disparate data about their past and present academics report and then apply the Naïve Bayes algorithm using Anaconda Navigator to search the student's achievement for placement. A proposed methodology based on a classification approach to finding an improved estimation method for predicting the placement for students. This project can find the association for academic achievement of each particular student and their placement achievement in campus selection.

Download Full-text

KLASIFIKASI NAÏVE BAYES UNTUK MENDIAGNOSIS PENYAKIT PNEUMONIA PADA ANAK BALITA (STUDI KASUS : UPTD PUSKESMAS SUKARAJA SUKABUMI)

KLIK - KUMPULAN JURNAL ILMU KOMPUTER ◽

10.20527/klik.v6i3.202 ◽

2019 ◽

Vol 6 (3) ◽

pp. 241

Author(s):

Ami Rahmawati ◽

Dede Wintana ◽

Satia Suhada ◽

Gunawan Gunawan ◽

Hamdun Sulaiman

Keyword(s):

Data Mining ◽

Indirect Effects ◽

Naive Bayes ◽

Developed Countries ◽

Naïve Bayes ◽

Physical Damage ◽

The World ◽

Difficulty Breathing ◽

Bayes Algorithm ◽

Using Data

Pneumonia is a contagious infectious disease that is the leading cause of death in toddlers in the world. In developed countries, there are 4 million cases each year, totaling 156 million cases of pneumonia every year worldwide. Pneumonia is caused by, among others, bacteria, viruses, fungi, exposure to chemicals or physical damage from the lungs, as well as indirect effects from other diseases. Pneumonia is characterized by symptoms of coughing and / or difficulty breathing such as rapid breathing, and pulling the lower chest wall inward. Therefore, early detection of pneumonia in children under five is very necessary in order to be able to prevent and cope with the disease into a serious stage as the purpose of this study is to diagnose pneumonia in toddlers using data mining classification, the naïve Bayes algorithm. Of the 118 cases consisting of 113 cases of patients diagnosed with pneumonia and 5 cases of patients who were not diagnosed with pneumonia, an accuracy value of 98% was obtained, so it can be interpreted that the naïve bayes algorithm has a good correlation with the attributes contained in the dataset.Keywords: Naïve Bayes Algorithm, Pneumonia.Pneumonia adalah penyakit infeksi menular yang merupakan penyebab utama kematian pada balita di dunia. Di negara maju terdapat 4 juta kasus setiap tahun hingga total di seluruh dunia ada 156 juta kasus pneumonia anak balita setiap tahun. Pneumonia antara lain disebabkan oleh bakteri, virus, jamur, pajanan bahan kimia atau kerusakan fisik dari paru-paru, maupun pengaruh tidak langsung dari penyakit lain. Pneumonia ditandai dengan gejala batuk dan atau kesulitan bernapas seperti napas cepat, dan tarikan dinding dada bagian bawah ke dalam. Oleh Karena itu, deteksi dini penyakit pneumonia pada anak balita sangat diperlukan agar dapat mencegah dan menanggulangi penyakit tersebut kedalam tahap yang serius seperti tujuan penelitian ini yaitu untuk mendiagnosis penyakit pneumonia pada anak balita menggunakan klasifikasi data mining yaitu algoritma naïve bayes. Dari 118 kasus yang terdiri dari 113 kasus pasien yang terdiagnosis pneumonia dan 5 kasus pasien yang tidak terdiagnosis pneumonia maka diperoleh nilai akurasi sebesar 98%, sehingga dapat diartikan bahwa algoritma naïve bayes memiliki korelasi yang baik dengan atribut yang terdapat pada dataset.Keywords: Naïve Bayes Algorithm, Pneumonia.

Download Full-text

KLASIFIKASI SMS SPAM MENGGUNAKAN SUPPORT VECTOR MACHINE

Jurnal Pilar Nusa Mandiri ◽

10.33480/pilar.v15i2.693 ◽

2019 ◽

Vol 15 (2) ◽

pp. 275-280

Author(s):

Agus Setiyono ◽

Hilman F Pardede

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Spam Detection ◽

Support Vector Machine Algorithm ◽

Data Mining Techniques ◽

To Receive

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam. One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.

Download Full-text

Algoritma Naïve Bayes Untuk Memprediksi Kredit Macet Pada Koperasi Simpan Pinjam

Jurnal Informatika Upgris ◽

10.26877/jiu.v4i2.2919 ◽

2019 ◽

Vol 4 (2) ◽

Author(s):

Diah Puspitasari ◽

Syifa Sintia Al Khautsar ◽

Wida Prima Mustika

Keyword(s):

Data Mining ◽

Predictive Value ◽

Naive Bayes ◽

False Negative ◽

False Negative Rate ◽

True Positive Rate ◽

Naïve Bayes ◽

Data Mining Technique ◽

Application Form ◽

Using Data

Cooperatives are a forum that can help people, especially small and medium-sized communities. Cooperatives play an important role in the economic growth of the community such as the price of basic commodities which are relatively cheap and there are also cooperatives that offer borrowing and storing money for the community. Constraints that have been felt by this cooperative are that borrowers find it difficult to repay loan installments, causing bad credit. Because the cooperative in conducting credit analysis is carried out in a personal manner, namely by filling out the loan application form along with the requirements and conducting a field survey. Therefore there is a need for an evaluation to be carried out in lending to borrowers. To minimize these problems, it is necessary to detect customer criteria that are used to predict bad loans and to determine whether or not the elites are eligible to take credit using data mining. The data mining technique used is classification with the Naive Bayes method. Based on testing the accuracy of the resulting model obtained accuracy level of 59%, sensitivity (True Positive Rate (TP Rate) or Recall) of 46.80%, specificity (False Negative Rate (FN Rate or Precision) of 69.81%, Positive Predictive Value (PPV) of 57.89%, and Negative Predictive Value (NPV) of 59.67%.

Download Full-text

Prediksi Tingkat Kelulusan Tepat Waktu Mahasiswa Menggunakan Algoritma Naïve Bayes pada Universitas XYZ

Jurnal ULTIMATICS ◽

10.31937/ti.v12i2.1715 ◽

2020 ◽

Vol 12 (2) ◽

pp. 104-107

Author(s):

Nurhayati . ◽

Nuraeny Septianti ◽

Nani Retnowati ◽

Arief Wibowo

Keyword(s):

Data Mining ◽

Information Technology ◽

Data Processing ◽

Naive Bayes ◽

Naïve Bayes ◽

Bayes Method ◽

Processing Data ◽

Student Graduation ◽

Phase Data ◽

Bayes Algorithm

Data processing is imperative for the development of information technology. Almost any field of work has information about data. The data is made use of the analysis of the job. Nowadays, information data is imperatively processed to help workers in making decisions. This study discusses student prediction graduation rates by using the naïve Bayes method. That aims at providing information to college if they can use it properly to utilize the data of students who graduated by processing data mining. Based on the data mining process, steps founded that used producing information, namely predicting student graduation on time. The method of this study is Naïve Bayes with classification techniques. At this study, researchers used a six-phase data mining process of industry crossing standards in data mining known as CRISP-DM. The results of research concluded that the application of the Naive Bayes algorithm uses 4 (four) parameters namely ips, ipk, the number of credits, and graduation by getting an accuracy value of 80.95%.

Download Full-text