PENERAPAN DATA MINING DAN ALGORITMA NAÏVE BAYES UNTUK PEMILIHAN KONSENTRASI MAHASISWA MENGGUNAKAN METODE KLASIFIKASI

Muhammad Farid Satrio Wibowo; Nila Feby Puspitasari; Barka Satya

doi:10.24076/joism.2022v3i2.680

PENERAPAN DATA MINING DAN ALGORITMA NAÏVE BAYES UNTUK PEMILIHAN KONSENTRASI MAHASISWA MENGGUNAKAN METODE KLASIFIKASI

Journal of Information System Management (JOISM) ◽

10.24076/joism.2022v3i2.680 ◽

2022 ◽

Vol 3 (2) ◽

pp. 39-45

Author(s):

Muhammad Farid Satrio Wibowo ◽

Nila Feby Puspitasari ◽

Barka Satya

Keyword(s):

Data Mining ◽

Naive Bayes ◽

Naïve Bayes

Pemilihan konsentrasi atau minat studi merupakan hal yang tidak mudah dilakukan oleh seorang mahasiswa pada sebuah jurusan di Perguruan Tinggi. Mahasiswa akan berupaya memilih konsentrasi yang menurut mereka paling tepat dan sesuai dengan kompetensi dan minat studi, karena konsentrasi yang dipilih akan mempengaruhi minat belajar, prestasi, lama studi dan juga berpengaruh terhadap Indeks Prestasi Akademik (IPK) mahasiswa. Pentingnya memilih sebuah konsentrasi penjurusan bagi mahasiswa pada Institusi Perguruan Tinggi, maka perlu dibangun suatu model yang dapat membantu mahasiswa dalam memilih konsentrasi sesuai dengan kompetensi dan minat studi mahasiswa. Oleh karena itu, peneliti akan melakukan penelitian dengan membuat sistem untuk pemilihan konsentrasi mahasiswa menggunakan algoritma Naïve Bayes dengan metode klasifikasi. Untuk membantu dalam mengambil keputusan pemilihan konsentrasi, penelitian ini menggunakan teknik data mining sebagai proses pencarian pola yang diinginkan dalam sebuah database yang besar. Hasil pengujian yang telah dilakukan terhadap sample dataset sebanyak 1534 data menggunakan Algoritma Naïve Bayes, diperoleh bahwa hasil prediksi untuk menentukan konsentrasi memiliki nilai akurasi sebesar 84.27%. Variabel berpengaruh terhadap tingkat akurasi yang di hasilkan. Ukuran variabel yang sempit atau sedikit menyebabkan hasil akurasi yang kurang baik, tetapi ukuran variabel yang luas dapat menghasilkan akurasi ouput yang lebih optimal

Download Full-text

KLASIFIKASI SMS SPAM MENGGUNAKAN SUPPORT VECTOR MACHINE

Jurnal Pilar Nusa Mandiri ◽

10.33480/pilar.v15i2.693 ◽

2019 ◽

Vol 15 (2) ◽

pp. 275-280

Author(s):

Agus Setiyono ◽

Hilman F Pardede

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Spam Detection ◽

Support Vector Machine Algorithm ◽

Data Mining Techniques ◽

To Receive

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam. One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.

Download Full-text

Evaluasi Telemarketing Kartu Kredit Bank Menggunakan Algoritma Genetika untuk Seleksi Fitur dan Naive Bayes

Jurnal Aplikasi Pelayaran dan Kepelabuhanan ◽

10.30649/japk.v10i1.71 ◽

2020 ◽

Vol 10 (1) ◽

pp. 12

Author(s):

Ekka Pujo Ariesanto Akhmad

Keyword(s):

Data Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Standard Process ◽

Industry Standard

Bagian pemasaran bank sudah menampung data dari nasabah atau pelanggan bank dengan cara memasarkan atau mensosialisasikan kartu kredit lewat telepon (telemarketing). Evaluasi telemarketing kartu kredit yang sudah dilakukan bank masih kurang membawa hasil dan berdaya guna. Salah satu cara yang tepat untuk evaluasi laporan telemarketing kartu kredit bank adalah menggunakan teknik data mining. Tujuan penggunaan data mining untuk mengetahui kecenderungan dan pola nasabah yang berpeluang untuk berlangganan kartu kredit yang ditawarkan bank. Metode penelitian menggunakan Cross Industry Standard Process for Data Mining (CRISP-DM) dengan Algoritma Genetika untuk Seleksi Fitur (GAFS) dan Naive Bayes (NB). Hasil penelitian menunjukkan jumlah atribut pada dataset telemarketing kartu kredit bank sejumlah 15 atribut terdiri dari 14 atribut biasa dan 1 atribut spesial. Dataset telemarketing bank mengandung data berdimensi tinggi, sehingga diterapkan metode GAFS. Setelah menerapkan metode GAFS diperoleh 7 atribut optimal terdiri dari 6 atribut biasa dan 1 atribut spesial. Enam atribut biasa meliputi pekerjaan, balance, rumah, pinjaman, durasi, poutcome. Sedangkan atribut spesial adalah target. Hasil penelitian menunjukkan algoritma NB mempunyai nilai akurasi 86,71%. Algoritma GAFS dan NB meningkatkan nilai akurasi menjadi 90,27% untuk prediksi nasabah bank yang mengambil kartu kredit.

Download Full-text

Prediction of benign and malignant breast cancer using data mining techniques

Journal of Algorithms & Computational Technology ◽

10.1177/1748301818756225 ◽

2018 ◽

Vol 12 (2) ◽

pp. 119-126 ◽

Cited By ~ 43

Author(s):

Vikas Chaurasia ◽

Saurabh Pal ◽

BB Tiwari

Keyword(s):

Breast Cancer ◽

Data Mining ◽

Low Income ◽

Prediction Models ◽

Naive Bayes ◽

Naïve Bayes ◽

Low Income Countries ◽

Breast Cancer Dataset ◽

Cancer Dataset ◽

Rbf Network

Breast cancer is the second most leading cancer occurring in women compared to all other cancers. Around 1.1 million cases were recorded in 2004. Observed rates of this cancer increase with industrialization and urbanization and also with facilities for early detection. It remains much more common in high-income countries but is now increasing rapidly in middle- and low-income countries including within Africa, much of Asia, and Latin America. Breast cancer is fatal in under half of all cases and is the leading cause of death from cancer in women, accounting for 16% of all cancer deaths worldwide. The objective of this research paper is to present a report on breast cancer where we took advantage of those available technological advancements to develop prediction models for breast cancer survivability. We used three popular data mining algorithms (Naïve Bayes, RBF Network, J48) to develop the prediction models using a large dataset (683 breast cancer cases). We also used 10-fold cross-validation methods to measure the unbiased estimate of the three prediction models for performance comparison purposes. The results (based on average accuracy Breast Cancer dataset) indicated that the Naïve Bayes is the best predictor with 97.36% accuracy on the holdout sample (this prediction accuracy is better than any reported in the literature), RBF Network came out to be the second with 96.77% accuracy, J48 came out third with 93.41% accuracy.

Download Full-text

Analisa Komparasi Algoritma Decision Tree C4.5 dan Naïve Bayes untuk Prediksi Churn Berdasarkan Kelas Pelanggan Retail

International Journal of Natural Science and Engineering ◽

10.23887/ijnse.v3i3.23113 ◽

2019 ◽

Vol 3 (3) ◽

pp. 103

Author(s):

Ni Wayan Wardani ◽

Ni Kadek Ariasih

Keyword(s):

Data Mining ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes

Pelanggan adalah salah satu aset utama bagi perusahaan ritel. Perusahaan harus dapat mengenali bagaimana karakter pelanggan mereka sehingga mereka dapat mempertahankan pelanggan yang sudah ada agar tidak berhenti membeli dan pindah ke perusahaan ritel yang bersaing (churn). Salah satu model yang tepat untuk mengenali karakter pelanggan adalah model RFM (Recency, Frekuensi, Moneter). Model RFM mampu menghasilkan kelas pelanggan dan di setiap kelas pelanggan dapat dianalisis atau diprediksi dengan konsep data mining apakah pelanggan tetap sebagai pelanggan atau churn. Data yang digunakan berasal dari data pelanggan dan data penjualan di UD. Mawar Sari. Kelas pelanggan UD Mawar Sari yang dihasilkan dari model RFM adalah Dormant, Everyday, Golden dan Superstar. Konsep data mining dengan membangun model prediksi dalam penelitian ini menggunakan algoritma Decision Tree C4.5 dan Naïve Bayes. Di semua kelas pelanggan kinerja Algoritma Naïve Bayes lebih baik daripada Algoritma Decision Tree C4.5 dengan Recall 95,92%, Precision 84,15%, dan Accuracy 83,49% dan kelas pelanggan yang memiliki potensi churn tinggi adalah Dormant B, Dormant E, dan Dormant F.Kata Kunci: Prediksi Churn, RFM, C4.5, Naïve Bayes

Download Full-text

Algoritma Naïve Bayes Untuk Memprediksi Kredit Macet Pada Koperasi Simpan Pinjam

Jurnal Informatika Upgris ◽

10.26877/jiu.v4i2.2919 ◽

2019 ◽

Vol 4 (2) ◽

Author(s):

Diah Puspitasari ◽

Syifa Sintia Al Khautsar ◽

Wida Prima Mustika

Keyword(s):

Data Mining ◽

Predictive Value ◽

Naive Bayes ◽

False Negative ◽

False Negative Rate ◽

True Positive Rate ◽

Naïve Bayes ◽

Data Mining Technique ◽

Application Form ◽

Using Data

Cooperatives are a forum that can help people, especially small and medium-sized communities. Cooperatives play an important role in the economic growth of the community such as the price of basic commodities which are relatively cheap and there are also cooperatives that offer borrowing and storing money for the community. Constraints that have been felt by this cooperative are that borrowers find it difficult to repay loan installments, causing bad credit. Because the cooperative in conducting credit analysis is carried out in a personal manner, namely by filling out the loan application form along with the requirements and conducting a field survey. Therefore there is a need for an evaluation to be carried out in lending to borrowers. To minimize these problems, it is necessary to detect customer criteria that are used to predict bad loans and to determine whether or not the elites are eligible to take credit using data mining. The data mining technique used is classification with the Naive Bayes method. Based on testing the accuracy of the resulting model obtained accuracy level of 59%, sensitivity (True Positive Rate (TP Rate) or Recall) of 46.80%, specificity (False Negative Rate (FN Rate or Precision) of 69.81%, Positive Predictive Value (PPV) of 57.89%, and Negative Predictive Value (NPV) of 59.67%.

Download Full-text

APPLICATION OF NAIVE BAYES CLASSIFIER ALGORITHM IN DETERMINING NEW STUDENT ADMISSION PROMOTION STRATEGIES

Journal of Information Systems and Informatics ◽

10.33557/journal-isi.v1i1.2 ◽

2019 ◽

Vol 1 (1) ◽

pp. 14-28

Author(s):

Ahmad Haidar Mirza

Keyword(s):

Data Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Statistical Techniques ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Promotion Strategies ◽

Effectiveness And Efficiency ◽

New Student

Data Mining is a process that uses statistical techniques, mathematics, artificial intelligence, machine learning to extract and identify useful information and related knowledge from large databases. Data mining is the process of finding new patterns in data by filtering large amounts of data. Data mining uses pattern recognition technology that is similar to statistical techniques and mathematical techniques. The patterns found can provide useful information for generating economic benefits, effectiveness and efficiency. Algorithm Naive Bayes Classifier is one method of data mining that can be used to support effective and efficient promotion strategies. The Naive Bayes Classifier algorithm is used to predict the interest of the study based on the calculations performed. The data used are new student registration data from 2014 until 2016 at Bina Darma University. The results of this study are new models that are expected to provide important information can be used to assist the Marketing Team of Bina Darma University Palembang in policy making and implementation of appropriate marketing strategy. The results obtained are expected to help to support the promotion strategies that impact on the effectiveness and efficiency of promotion and increase the number of new students who will register.

Download Full-text

Performance of Naïve Bayes, C4.5 and KNN using Breast Cancer, Iris and Hypothyroid Datasets

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c8795.019320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 2193-2197

Keyword(s):

Breast Cancer ◽

Data Mining ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Specific Pattern ◽

K Nearest Neighbor ◽

Data Mining Technique ◽

Digital Format ◽

Tree Classifier

Data mining usually specifies the discovery of specific pattern or analysis of data from a large dataset. Classification is one of an efficient data mining technique, in which class the data are classified are already predefined using the existing datasets. The classification of medical records in terms of its symptoms using computerized method and storing the predicted information in the digital format is of great importance in the diagnosis of various diseases in the medical field. In this paper, finding the algorithm with highest accuracy range is concentrated so that a cost-effective algorithm can be found. Here the data mining classification algorithms are compared with their accuracy of finding exact data according to the diagnosis report and their execution rate to identify how fast the records are classified. The classification technique based algorithms used in this study are the Naive Bayes Classifier, the C4.5 tree classifier and the K-Nearest Neighbor (KNN) to predict which algorithm is the best suited for classifying any kind of medical dataset. Here the datasets such as Breast Cancer, Iris and Hypothyroid are used to predict which of the three algorithms is suitable for classifying the datasets with highest accuracy of finding the records of patients with the particular health problems. The experimental results represented in the form of table and graph shows the performance and the importance of Naïve Bayes, C4.5 and K-Nearest Neighbor algorithms. From the performance outcome of the three algorithms the C4.5 algorithm is a lot better than the Naïve Bayes and the K-Nearest Neighbor algorithm.

Download Full-text

PENERAPAN ALGORITMA NAÏVE BAYES UNTUK MEMPREDIKSI KEPUTUSAN CALON NASABAH DAN NASABAH TETAPBANK BRI SYARIAH MENERIMA PENAWARAN PROGRAM DEPOSITO BERJANGKA

Jurnal Teknologi dan Informasi ◽

10.34010/jati.v8i1.906 ◽

2018 ◽

Vol 8 (1) ◽

Author(s):

Wahyu Nurjaya WK ◽

Yusrina Adani

Keyword(s):

Data Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Data Preparation ◽

Standard Process ◽

Industry Standard

Bank BRI Syariah memiliki banyak produk yang menarik untuk ditawarkan kepada calon nasabah maupun nasabah tetap berupa produk jangka panjang atau jangka pendek, yang menawarkan banyak keuntungan bagi nasabah itu sendiri. Salah satu produknya adalah Deposito berjangka yang merupakan produk investasi dengan menyimpan uang dan penarikanya hanya bisa dilakukan pada kurun waktu tertentu yang telah di janjikan oleh pihak bank dengan persetujuan nasabah. Dengan telemarketing yang baik oleh pihak bank maka diharapkan calon nasabah dan nasabah tetap mengetahui produk ini.Telemarketing adalah salah satu cara dalam mempromosikan produk-produk atau jasa layanan yang ada di bank. Seorang telemarketing bank harus dapat membuat target nasabah, nasabah mana yang berpotensi untuk meningkatkan deposito dengan melihat data-data nasabah bank yang telah tersimpan dalam database. Dikarenakan database nasabah sangat besar, maka tidak mungkin untuk mencari pola prediksi calon nasabah atau nasabah tetap yang berminat untuk program Deposito dengan cara konvensional.Berdasarkan hal tersebut, pengelolaan data yang sangat besar bisa diatasi dengan memanfaatkan Data Mining yaitu proses iteratif dan interaktif untuk menentukan pola atau model baru yang sempurna, bermanfaat dan dapat dimengerti dalam suatu database yang sangat besar. Data Mining berisi pencarian trend pola yang diinginkan dalam database besar untuk membantu pengambilan keputusan diwaktu yang akan datang. Dengan menggunakan Data Mining diharapkan dapat mengoptimasikan proses prediksi data nasabah oleh seorang telemarketing, sehingga dia mampu menawarkan deposito dengan target calon nasabah atau nasabah tetap yang tepat sasaran. Adapun Teknik Klasifikasi Data Mining menggunakan algoritma Naïve Bayes. Naïve Bayes bekerja sangat efektif saat diuji pada dataset yang besar untuk menentukan pola dimasa lalu dan mencari fungsi yang akan menjadi pola penilaian data dimasa yang akan datang. Untuk mencapai hasil yang diharapkan metode CRISP-DM (Cross Industry Standard Process for Data Mining) sangat cocok sebagai solusi, melalui proses business understanding, data understanding, data preparation, modeling, evaluation dan deployment. Dengan ini hasil prediksi akan lebih akurat, sehingga untuk target telemarketing produk Deposito Bank BRI Syariah akan tepat sasaran.

Download Full-text

KLASIFIKASI KARAKTERISTIK KONSUMEN SEPEDA MOTOR MERK T DI JAWA BARAT MENGGUNAKAN METODE NAÏVE BAYES CLASSIFIER PADA DATA MINING

Jurnal Ilmiah Matematika dan Pendidikan Matematika ◽

10.20884/1.jmp.2017.9.2.2864 ◽

2017 ◽

Vol 9 (2) ◽

pp. 37

Author(s):

Jaka Aulia Pratama ◽

Zulhanif Zulhanif ◽

Yadi Suprijadi

Keyword(s):

Data Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Model Classification ◽

Automatic Transmissions ◽

Market Planning ◽

Customer Classification

PT. JKL has a role as a main dealer of T’s brand are handling three types of motorcycle products in West Java. These are type of Sport, CUB, and Scooter(Automatic Transmissions). The company records the buyer of T’s brand motorcycle in the Customer Database (CDB). CDB collected from 2011 to 2013 yielded information of consumer characteristics which is necessary in market planning. Consumer characteristics are classified into two groups: Repeated Order and New Customer. Classification methods used in the study of Data Mining is the Naïve Bayes Classifier. Model classification is done by calculating the conditional probability to choose the greatest value of probability. The accuracy of the classification is 83% and the error classification is 17%.

Download Full-text

PENERAPAN DATA MINING UNTUK MENGKLASIFIKASI TINGKAT BAHAYA POLUTAN PM10 DI KOTA BANJARBARU

Technologia: Jurnal Ilmiah ◽

10.31602/tji.v11i3.3288 ◽

2020 ◽

Vol 11 (3) ◽

pp. 176

Author(s):

Nur Arminarahmah ◽

Mirza Yogi Kurniawan ◽

Al Fath Riza Kholdani

Keyword(s):

Data Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Pm 10

AbstrakSalah satu aplikasi Data Mining adalah klasifikasi, kami menggunakan algoritma Naive Bayes untuk mengklasifikasikan tingkat bahaya polutan PM 10 untuk menghasilkan model yang digunakan untuk mendapatkan informasi tentang kondisi polutan PM10. Aplikasi awal menggunakan data dari wilayah kota Banjarbaru karena ada kabut asap dalam beberapa tahun terakhir di Kalimantan Selatan dan studi kasus penelitian ini di kota Banjarbaru karena telah meningkat sehingga mengganggu visibilitas dan menghambat semua kegiatan baik kantor maupun sekolah. Proses klasifikasi dimulai dengan mengolah data PM10 menggunakan atribut suhu, kelembaban dan waktu kemudian data yang diperoleh diklasifikasikan menggunakan algoritma naif bayes untuk menghasilkan kelas tingkat bahaya dari 0 - 4 dengan tingkat akurasi keberhasilan klasifikasi 60%.Kata Kunci : Datamining, Naive Bayes,PM10

Download Full-text