Customer Segmentation berdasarkan Usia, Jumlah Kredit dan Lama Kredit Nasabah di Bank XYZ menggunakan Model K-Means Clustering

Prosiding Seminar Nasional Universitas Ma Chung ◽

10.33479/snumc.v1i.228 ◽

2021 ◽

Vol 1 ◽

pp. 101-116

Author(s):

Moch Rizky Wijaya ◽

Gigih Satriyo Wibowo

Keyword(s):

Data Mining ◽

Internet Banking ◽

Customer Segmentation ◽

Silhouette Coefficient ◽

Cluster 2

Pelanggan internet banking tumbuh sangat cepat. Segmentasi nasabah dapat diterapkan berdasarkan data internet banking. Clustering adalah teknik data mining tanpa pengawasan yang dapat digunakan untuk segmentasi pelanggan. Penelitian ini membangun model clustering pada data profil nasabah bank berdasarkan data kredit sehinagga didapatkan segemntasi nasabah yang nantinya digunakan sebagai landasan keputusan untuk melakukan startegi pemasaran. Metode clustering menggunakan metode K-Means dengan validasi cluster menggunakan metode Silhouette coefficient. Berdasarkan Silhouette coefficient didapatkan nilai terbaik untuk 3 cluster yaitu cluster 0, 1, dan 2. Hasil cluster dengan k-means terbagi menjadi 3 cluster yaitu cluster 0 – rata-rata jumlah kredit lebih rendah, durasi pendek, dan pelanggan usia tua, cluster 1 – rata-rata jumlah kredit tinggi, durasi panjang, dan pelanggan paruh baya atau usai pertengahan, dan cluster 2 - rata-rata jumlah kredit lebih rendah, durasi pendek, dan pelanggan usia muda. Hasil segementasi tersebut dapat dijadikan acuan untuk melakukan startegi pemasaran kedepan.

Download Full-text

COMPARATIVE STUDY OF FUZZY C-MEANS AND K-MEANS ALGORITHM FOR GROUPING CUSTOMER POTENTIAL IN BRAND LIMBACK

Jurnal Riset Informatika ◽

10.34288/jri.v3i4.241 ◽

2021 ◽

Vol 3 (4) ◽

pp. 327-334

Author(s):

Difa Lazuardi Aditya ◽

Devi Fitrianah

Keyword(s):

Optimal Number ◽

Customer Segmentation ◽

Cluster Validation ◽

Fuzzy C Means ◽

Monetary Analysis ◽

A Value ◽

Silhouette Coefficient ◽

Potential Customers ◽

Cluster 2 ◽

Optimal Number Of Clusters

The customer is a stakeholder for a business, to maintain and increase customer enthusiasm and develop it for the company's performance, it is necessary to do customer segmentation which aims to find out potential customers. This study uses purchase transaction data from Brand Limback customers in the period 2020. The use of RFM (Recency, Frecuency, Monetary) analysis helps in determining the attributes used for customer segmentation. To determine the optimal number of clusters from the RFM dataset, the Elbow method is applied. The datasets generated from RFM are grouped using the Fuzzy C-Means and K-Means algorithms, the two algorithms will compare the quality in the formation of clusters using the Silhoutte Coefficient and Davies-Bouldin Index methods. Customer segmentation from the RFM dataset that has been clustered produces 7 optimal clusters, namely Cluster 0 is a bronze customer. Cluster 1 is a silver customer. Cluster 2 is a gold customer. Cluster 3 is a platinum customer. Cluster 4 is a diamond customer. Cluster 5 is a super customer, and cluster 6 is a superstar customer. The cluster validation of k-means using the silhouette coefficient produces a value of 0.934 while the Davies bouldin index produces a value of 0.155 and the validation results of the fuzzy c-means algorithm using the silhouette coefficient produces a value of 0.921 while the Davies bouldin index produces a value of 0.145.

Download Full-text

Data Mining dalam Pengelompokan Penyakit Pasien dengan Metode K-Medoids

Jurnal Informasi & Teknologi ◽

10.37034/jidt.v3i4.150 ◽

2021 ◽

pp. 181-189

Author(s):

Dwi Utari Iswavigra ◽

Sarjon Defit ◽

Gunadi Widi Nurcahyo

Keyword(s):

Data Mining ◽

Silhouette Coefficient ◽

Cluster 2 ◽

Diabetes Melitus

Penyakit merupakan suatu kondisi di mana pikiran dan tubuh mengalami semacam gangguan dan ke tidaknyamanan bagi yang mengalaminya. Semakin hari, jumlah pasien di Puskesmas Kuok semakin meningkat dengan beragam jenis penyakit yang berbeda-beda. Peningkatan jumlah pasien mengharuskan staff Puskesmas Kuok harus selalu memperbaharui data rekam medis pasien. Data rekam medis pasien ini berupa laporan yang berisikan jumlah pasien dan penyakit yang diderita. Berdasarkan data tersebut, pihak Puskesmas perlu untuk mengetahui informasi tentang penyakit yang paling rentan dan banyak diderita pasien. Penelitian ini bertujuan untuk mengelompokkan data penyakit pasien untuk mengetahui penyakit yang paling banyak diderita oleh pasien di Puskesmas Kuok Kabupaten Kampar. Pengelompokan data penyakit pasien dilakukan dengan tahapan Data Mining Clustering dan dilanjutkan dengan tahapan metode K-Medoids. Selanjutnya, di lakukan pengujian cluster menggunakan Silhouette Coefficient. Hasil penelitian ini menunjukkan bahwa pada cluster 1 penyakit yang paling banyak diderita oleh pasien adalah penyakit Diabetes Melitus tidak bergantung Insulin (tipe II) dengan total kasus sebanyak 435 kasus. Pada cluster 2, penyakit yang paling banyak diderita oleh pasien adalah penyakit Hipertensi Esensial (Primer) dengan total kasus sebanyak 2785 kasus. Untuk cluster ke 3, penyakit yang paling banyak diderita oleh pasien adalah penyakit Vulnus Laseratum, Punctum, dengan total kasus sebanyak 328 kasus. Dari hasil cluster yang diperoleh, maka didapatkan hasil pengujian Silhouette Coeficient sebesar 0,900033674.

Download Full-text

Penerapan Teknik Data Mining untuk Clustering Armada pada PT. Siaga Transport Indonesia Menggunakan Metode k-Means

EXPLORE ◽

10.35200/explore.v9i1.116 ◽

2019 ◽

Vol 9 (1) ◽

pp. 8

Author(s):

Nuzul Hikmah ◽

Dyah Ariyanti ◽

Melista Sugesti

Keyword(s):

Data Mining ◽

Silhouette Coefficient ◽

Cluster 2

PT. Siaga Transport Indonesia adalah perusahaan yang bergerak dibidang expedisi. Armada menjadi aset penting untuk perusahaan yang bergerak di bidang ini. Beratnya muatan yang diangkut, jauhnya jarak yang ditempuh, dan usia armada menjadi faktor penyebab kerusakan armada. Hal ini berpengaruh terhadap potensi armada. Untuk itu diperlukan analysis clustering menggunakan metode k-means untuk mengetahui armada yang potensial dan yang kurang potensial. Clustering merupakan teknik pengelompokkan objek yang didasarkan pada kemiripan antar objek dalam satu cluster. K-means merupakan metode clustering yang mempunyai kemampuan mengelompokkan data dalam jumlah cukup besar dengan waktu komputasi yang relatif cepat dan efisien. Dalam penelitian ini sistem akan mengelompokkan 33 armada dengan menggunakan variabel biaya operasional, tahun kendaraan, produktivitas armada dan kilometer armada menjadi 3 kelompok dengan kategori cluster armada berpotensi rendah, sedang dan tinggi. Dari hasil akhir sistem diketahui cluster 1 atau armada dengan potensi rendah ada 7 armada, cluster 2 atau armada dengan potensi sedang ada 14 armada, cluster 3 atau armada dengan potensi tinggi ada 12 armada. Pengujian dilakukan dengan metode silhouette coefficient dengan rentang variasi nilai yang dihasilkan yaitu dibawah nol sampai dengan 0,9.

Download Full-text

Penerapan Data Mining Untuk Prediksi Penjualan Mobil Menggunakan Metode K-Means Clustering

Jurnal Nasional Komputasi dan Teknologi Informasi (JNKTI) ◽

10.32672/jnkti.v3i3.2428 ◽

2020 ◽

Vol 3 (3) ◽

pp. 187-201

Author(s):

Sufajar Butsianto ◽

Nindi Tya Mayangwulan

Keyword(s):

Data Mining ◽

Clustering Data ◽

Cluster 2

Penggunaan mobil di Indonesia setiap tahunnya selalu meningkat dan membuat perusahaan otomotif berlomba-lomba dalam peningkatan penjualannya. Tujuan dari penelitian ini untuk mengelompokan data penjualan kedalam sebuah cluster dengan metode Data Mining Algoritma K-Means Clustering. Data Penjualan nantinya akan dikelompokan berdasarkan kemiripan data tersebut sehingga data dengan karakteristik yang sama akan berada dalam satu cluster. Atribut yang digunakan adalah brand dan penjualan. Cluster yang terbentuk setelah dilakukan proses K-Means Clustering terbagi menjadi tiga cluster yaitu Cluster 0 jumlah anggota 235 dengan presentase 26% dikategorikan Laris, Cluster 1 jumlah anggota 604 dengan presentase 67% dikategorikan Kurang Laris, dan Cluster 2 jumlah angota 61 dengan presentase 7% dikategorikan Paling Laris, dari proses clustering diatas dapat diperoleh validasi DBI (Davies Bouldin Index) dengan nilai 0,341

Download Full-text

Analisis Data Sebaran Bandwidth Menggunakan Algoritma Dbscan Untuk Menentukan Tingkat Kebutuhan Bandwidth Di Kabupaten Purwakarta

Rabit : Jurnal Teknologi dan Sistem Informasi Univrab ◽

10.36341/rabit.v5i2.1388 ◽

2020 ◽

Vol 5 (2) ◽

pp. 130-137

Author(s):

Teguh Iman Hermanto ◽

Yusuf Muhyidin

Keyword(s):

Data Mining ◽

Data Selection ◽

Cluster 2

Berdasarkan data yang tercatat pada tahun 2018 terdapat 43 organisasi perangkat daerah di kabupaten Purwakarta yang sudah mendapatkan bandwidth internet. Setiap organisasi perangkat daetah yang telah mendapatkan bandwidth mempunyai tingkat kebutuhan yang berbeda – beda ,namun saat ini jumlah pembagian bandwidth dan tingkat kebutuhan belum dapat dikelompokan. Tujuan dari penelitian ini untuk menetukan tingkat kebutuhan bandwidth di Purwakarta dengan cara melakukan analisis data mining terhadap data yang ada menggunakan algoritma DBSCAN sehingga akan terbentuk cluster yang yang dibagi berdasarkan tingkat kebutuhan. Pada penelitian ini metode analisis yang digunakan yaitu SEMMA (Sample, Explore, Modify, Model, Assess) tahapan SEMMA meliputi Data Selection, Pre-processing / cleaning, Transformation, Data Mining dan Assess / Evaluation. Hasil dari analisis menggunakan nilai minpts = 5 dan nilai epsilon = 3. Cluster yang terbentuk yaitu sebanyak 2 cluster, cluster 1 terdapat sebanyak 15 organisasi perangkat daerah dengan tingkat kebutuhan bandwidth rendah dan cluster 2 terdapat 21 organisasi perangkat daerah dengan tingkat kebutuhan bandwidth sedang, dan Noise terdapat 7 organisasi perangkat daerah dengan kebutuhan bandwidth yang terlalu tinggi.

Download Full-text

Clustering Fasilitas Kesehatan Berdasarkan Kecamatan Di Karawang Dengan Algoritma K-Means

BINA INSANI ICT JOURNAL ◽

10.51211/biict.v8i1.1488 ◽

2021 ◽

Vol 8 (1) ◽

pp. 83

Author(s):

Bagus Muhammad Islami ◽

Cepy Sukmayadi ◽

Tesa Nur Padilah

Keyword(s):

Data Mining ◽

Developing Countries ◽

Data Processing ◽

Health Problems ◽

Health Facilities ◽

Microsoft Excel ◽

Two Factors ◽

Clustering Data ◽

Index Value ◽

Cluster 2

Abstrak: Masalah kesehatan yang ada di dalam masyarakat terutama di negara- negara berkembang seperti Indonesia dipengaruhi oleh dua faktor yaitu aspek fisik dan aspek non fisik. Berdasarkan data yang diperoleh dari karawangkab.bps.go.id data dibagi menjadi 3 cluster yaitu sedikit, sedang dan terbanyak. Algoritma yang digunakan adalah K-Means cluster yang diimplementsikan menggunakan Microsoft Excel dan Rapidminer Studio. Hasil pengolahan data fasilitas kesehatan di karawang menghasilkan 3 cluster dengan cluster 1 yang mempunyai fasilitas kesehatan sedikit sebanyak 23 kecamatan, cluster 2 yang mempunyai fasilitas kesehatan sedang sebanyak 5 kecamatan dan cluster 3 yang mempunyai fasilitas kesehatan terbanyak terdapat 2 kecamatan. Kinerja yang dihasilkan dari algoritma K-means menghasilkan nilai Davies Boildin Index sebesar 0,109. Kata kunci: clustering, data mining, fasilitas kesehatan, K-Means. Abstract: Health problems that exist in society, especially in developing countries like Indonesia, are built by two factors, namely physical and non-physical aspects. Based on data obtained from karawangkab.bps.go.id the data is divided into 3 clusters, namely the least, medium and the most. The algorithm used is the K-Means cluster which is implemented using Microsoft Excel and Rapidminer Studio. The results of data processing of health facilities in Karawang produce 3 clusters with cluster 1 which has 23 sub-districts of health facilities, cluster 2 which has medium health facilities as many as 5 districts and cluster 3 which has the most health facilities in 2 districts. The performance resulting from the K-means algorithm results in a Davies Boildin Index value of 0.109. Keywords: clustering, data mining, health facilities, K-Means.

Download Full-text

Customers Grouping Using Data Mining Techniques in the Food Distribution Industry (A Case Study)

SRPH Journal of Applied management and Agile Organisation ◽

10.47176/sjamao.3.1.1 ◽

2020 ◽

Keyword(s):

Data Mining ◽

Customer Segmentation ◽

Food Distribution ◽

Data Mining Techniques ◽

Customer Lifetime ◽

Rfm Model ◽

Marketing Activities ◽

Using Data ◽

Production Company

Significant data development has required organizations to use a tool to understand the relationships between data and make various appropriate decisions based on the information obtained. Customer segmentation and analysis of their behavior in the manufacturing and distribution industries according to the purposefulness of marketing activities and effective communication and with customers has a particular importance. Customer segmentation using data mining techniques is mainly based on the variables of recency purchase (R), frequency of purchase (F) and monetary value of purchase (M) in RFM model. In this article, using the mentioned variables, twelve customer groups related to the BTB (business to business) of a food production company, are grouped. The grouping in this study is evaluated based on the K-means algorithm and the Davies-Bouldin index. As a result, customer grouping is divided into three groups and, finally the CLV (customer lifetime value) of each cluster is calculated, and appropriate marketing strategies for each cluster have been proposed.

Download Full-text

Goods Stock Management using the K-Means Algorithm Method

Jurnal Teknologi ◽

10.35134/jitekin.v9i2.15 ◽

2020 ◽

Vol 10 (1) ◽

pp. 22-45

Author(s):

Dhio Saputra

Keyword(s):

Data Mining ◽

Data Processing ◽

Test Results ◽

Stock Management ◽

Clustering Method ◽

Sales Data ◽

Using Data ◽

Cluster 2

The grouping of Mazaya products at PT. Bougenville Anugrah can still do manuals in calculating purchases, sales and product inventories. Requires time and data. For this reason, a research is needed to optimize the inventory of Mazaya goods by computerization. The method used in this research is K-Means Clustering on sales data of Mazaya products. The data processed is the purchase, sales and remaining inventory of Mazaya products in March to July 2019 totaling 40 pieces. Data is grouped into 3 clusters, namely cluster 0 for non-selling criteria, cluster 1 for best-selling criteria and cluster 2 for very best-selling criteria. The test results obtained are cluster 0 with 13 data, cluster 1 with 25 data and cluster 2 with 2 data. So to optimize inventory is to multiply goods in cluster 2, so as to save costs for management of Mazayaproducts that are not available. K-Means clustering method can be used for data processing using data mining in grouping data according to criteria.

Download Full-text

Predicting Browsers and Purchasers of Hotel Websites

Cornell Hospitality Quarterly ◽

10.1177/1938965512468225 ◽

2012 ◽

Vol 54 (1) ◽

pp. 38-48 ◽

Cited By ~ 16

Author(s):

Edmond H. C. Wu ◽

Rob Law ◽

Brianda Jiang

Keyword(s):

Data Mining ◽

Hong Kong ◽

Predictive Power ◽

Decision Rules ◽

Customer Segmentation ◽

Marketing Strategies ◽

Weight Of Evidence ◽

Marketing Resources ◽

Hotel Websites ◽

Analytical Power

A study of the online browsing and purchasing habits of some 1,400 outbound travelers in Hong Kong demonstrates the analytical power of weight-of-evidence (WOE) data mining. The WOE approach allows analysts to identify and transform the variables with the most predictive power regarding the likelihood of tourists’ online preferences and decisions. The study found that just over one-third of the respondents browsed hotel-related websites, and about half of those browsers had booked a room on those sites. Browsers in Hong Kong tended to be young, well educated, and well traveled. Those who used the hotel websites for purchases were, of course, part of the browser group, and were likewise relatively well educated. However, one unexpected variable set off those who used the websites for a hotel purchase, the length of their most recent trip. One possible reason is that long-haul tourists want to be sure of their accommodations, or this may reflect hotels’ free-night offers. The convenient use of model-based customer segmentation and decision rules would help hospitality practitioners effectively manage their marketing resources and activities, and enhance information-based marketing strategies to attract target customers.

Download Full-text

Data Mining for Potential Customer Segmentation in the Marketing Bank Dataset

JUITA Jurnal Informatika ◽

10.30595/juita.v9i1.7983 ◽

2021 ◽

Vol 9 (1) ◽

pp. 25

Author(s):

Maulida Ayu Fitriani ◽

Dany Candra Febrianto

Keyword(s):

Data Mining ◽

Random Forest ◽

Direct Marketing ◽

Class Imbalance ◽

Processing Technique ◽

Customer Segmentation ◽

Class Imbalance Problem ◽

Imbalance Problem ◽

Mining Methods ◽

Bank Marketing

Direct marketing is an effort made by the Bank to increase sales of its products and services, but the Bank sometimes has to contact a customer or prospective customer more than once to ascertain whether the customer or prospective customer is willing to subscribe to a product or service. To overcome this ineffective process several data mining methods are proposed. This study compares several data mining methods such as Naïve Bayes, K-NN, Random Forest, SVM, J48, AdaBoost J48 which prior to classification the SMOTE pre-processing technique was done in order to eliminate the class imbalance problem in the Bank Marketing dataset instance. The SMOTE + Random Forest method in this study produced the highest accuracy value of 92.61%.

Download Full-text