Efficient Data-Mining Algorithm for Predicting Heart Disease Based on an Angiographic Test

Background: The computerised classification and prediction of heart disease can be useful for medical personnel for the purpose of fast diagnosis with accurate results. This study presents an efficient classification method for predicting heart disease using a data-mining algorithm. Methods: The algorithm utilises the weighted support vector machine method for efficient classification of heart disease based on a binary response that indicates the presence or absence of heart disease as the result of an angiographic test. The optimal values of the support vector machine and the Radial Basis Function kernel parameters for the heart disease classification were determined via a 10-fold cross-validation method. The heart disease data was partitioned into training and testing sets using different percentages of the splitting ratio. Each of the training sets was used in training the classification method while the predictive power of the method was evaluated on each of the test sets using the Monte-Carlo cross-validation resampling technique. The effect of different percentages of the splitting ratio on the method was also observed. Results: The misclassification error rate was used to compare the performance of the method with three selected machine learning methods and was observed that the proposed method performs best over others in all cases considered. Conclusion: Finally, the results illustrate that the classification algorithm presented can effectively predict the heart disease status of an individual based on the results of an angiographic test.

Download Full-text

Using Stratified Sample and Grid Search to Improve Disease Prediction Accuracy of SVM

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.295-298.644 ◽

2013 ◽

Vol 295-298 ◽

pp. 644-647 ◽

Cited By ~ 1

Author(s):

Yu Kai Yao ◽

Hong Mei Cui ◽

Ming Wei Len ◽

Xiao Yun Chen

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Classification Accuracy ◽

Prediction Accuracy ◽

Support Vector ◽

Disease Prediction ◽

Data Mining Algorithm ◽

Grid Search ◽

Mining Algorithm ◽

Stratified Sample

SVM (Support Vector Machine) is a powerful data mining algorithm, and is mainly used to finish classification or regression tasks. In this literature, SVM is used to conduct disease prediction. We focus on integrating with stratified sample and grid search technology to improve the classification accuracy of SVM, thus, we propose an improved algorithm named SGSVM: Stratified sample and Grid search based SVM. To testify the performance of SGSVM, heart-disease data from UCI are used in our experiment, and the results show SGSVM has obvious improvement in classification accuracy, and this is very valuable especially in disease prediction.

Download Full-text

Data Mining Algorithm and the Effectiveness of Mathematics Classroom Teaching based on Support Vector Machine

International Journal of Database Theory and Application ◽

10.14257/ijdta.2016.9.11.15 ◽

2016 ◽

Vol 9 (11) ◽

pp. 163-174 ◽

Cited By ~ 1

Author(s):

Tang Qiang

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Mathematics Classroom ◽

Classroom Teaching ◽

Support Vector ◽

Data Mining Algorithm ◽

Mining Algorithm

Download Full-text

Prediction of Heart Disease Using 2-Tier SVM Data Mining Algorithm

International Journal of Advanced Research in Big Data Management System ◽

10.21742/ijarbms.2017.1.1.02 ◽

2017 ◽

Vol 1 (2) ◽

Cited By ~ 1

Keyword(s):

Data Mining ◽

Heart Disease ◽

Data Mining Algorithm ◽

Mining Algorithm

Download Full-text

Analisis Perbandingan Kernel Algoritma Support Vector Machine dalam Mengklasifikasikan Skripsi Teknik Informatika berdasarkan Abstrak

JOINS (Journal of Information System) ◽

10.33633/joins.v5i2.3715 ◽

2020 ◽

Vol 5 (2) ◽

pp. 240-249

Author(s):

Anggri Liani

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Text Mining ◽

Cross Validation ◽

Support Vector

Mahasiswa memiliki kewajiban menyelesaikan skripsi untuk menyelesaikan pendidikan jenjang S-1, namun justru menentukan topik skripsi adalah kesulitan pertama mahasiswa dalam pembuatan skripsi yang menjadi salah satu faktor mahasiswa lulus terlambat, Dengan melakukan pengklasifikasian skripsi berdasarkan abstrak dapat membantu mahasiswa dalam mencari referensi untuk menentukan topik skripsi. Metodelogi yang digunakan adalah proses text Mining dengan proses case folding, tekonizing, filtering, stemming, TF-IDF, data mining dan evaluation. Pembagian data menggunakan rasio 80% data latih dan 20% data uji. Pengklasifikasian menggunakan algoritma Support Vector Machine (SVM). Algoritma support vector machine (SVM) adalah salah satu algoritma pengklasifikasian yang memiliki beberapa kernel yaitu liniear dan 3 kernel yang paling dipertimbangkan. Validasi data menggunakan cross validation dengan 10-fold. Tingkat akurasi didapatkan 81%, presisi 82% dan recall 81% pada kernel liniear.

Download Full-text

Analisis Sentimen Twitter terhadap Tokoh Publik dengan Algoritma Naive Bayes dan Support Vector Machine

Simetris Jurnal Teknik Mesin Elektro dan Ilmu Komputer ◽

10.24176/simet.v11i2.4568 ◽

2021 ◽

Vol 11 (2) ◽

pp. 626-636

Author(s):

Tanthy Tawaqalia Widowati ◽

Mujiono Sadikin

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Cross Validation ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Fold Cross Validation

Salah satu media sosial yang berkembang adalah Twitter. Media sosial Twitter mempermudah masyarakat untuk bebas berpendapat melalui cuitan atau biasa disebut dengan tweets. Netizen dengan bebas menyampaikan opini pribadinya untuk topik apapun, termasuk persepsi terhadap tokoh publik. Artikel ini menyajikan hasil penelitian dan analisis sentimen masyarakat (netizen) terhadap tokoh publik, Nadiem Makariem sebagai Menteri Kementerian Pendidikan dan Kebudayaan baru. Penelitian ini menggunakan teknik data mining yang bertujuan untuk membandingkan hasil klasifikasi dari opini masyarakat yang dituliskan di Twitter. Dataset yang digunakan berasal dari tweets dengan kata kunci ”nadiem makariem”, ”kemendikbud” dan ”pak nadiem”. Tools RapidMiner digunakan untuk membantu tahap pre-processing dan klasifikasi menggunakan dua metode yaitu, Naive Bayes dan Support Vector Machine dengan evaluasi k-fold cross-validation. Dari hasil ujicoba diketahui bahwa untuk kasus yang diteliti, metode Naive Bayes menghasilkan kinerja yang lebih baik dengan accuracy 91.48%, precision 89.28% dan recall 91.58%.

Download Full-text

ANALISIS SENTIMEN GOJEK PADA MEDIA SOSIAL TWITTER DENGAN KLASIFIKASI SUPPORT VECTOR MACHINE (SVM

Jurnal Gaussian ◽

10.14710/j.gauss.v9i3.28932 ◽

2020 ◽

Vol 9 (3) ◽

pp. 376-390

Author(s):

Nur Fitriyah ◽

Budi Warsito ◽

Di Asih I Maruddani

Keyword(s):

Support Vector Machine ◽

Cross Validation ◽

Classification Model ◽

Support Vector ◽

Test Results ◽

Machine Method ◽

Support Vector Machine Method ◽

Rbf Kernel ◽

Negative Sentiment ◽

Fold Cross Validation

Appearance of PT Aplikasi Karya Anak Bangsa or as known as Gojek since 2015 give a convenience facility to people in Indonesia especially in daily activities. Sentiment analysis on Twitter social media can be the option to see how Gojek users respond to the services that have been provided. The response was classified into positive sentiment and negative sentiment using Support Vector Machine method with model evaluation 10-fold cross validation. The kernel used is the linear kernel and the RBF kernel. Data labeling can be done with manually and sentiment scoring. The test results showed that the RBF kernel gets overall accuracy and the highest kappa accuracy on manual data labeling and sentiment scoring. On manual data labeling, the overall accuracy is 79.19% and kappa accuracy is 16.52%. While the labeling of data with sentiment scoring obtained overall accuracy of 79.19% and kappa accuracy of 21%. The greater overall accuracy value and kappa accuracy obtained, the better performance of the classification model. Keywords: Gojek, Twitter, Support Vector Machine, overall accuracy, kappa accuracy

Download Full-text

Perbandingan Metode Naïve Bayes Dan Support Vector Machine Dalam Klasifikasi Penyakit Diabetes Melitus

Journal of Information Technology Ampera ◽

10.51519/journalita.volume1.isssue3.year2020.page133-143 ◽

2020 ◽

Vol 1 (3) ◽

pp. 133-143

Author(s):

Hilda Apriyani ◽

Kurniati Kurniati

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Cross Validation ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Diabetes Melitus

Diabetes melitus merupakan penyakit kronis yang terjadi akibat kadar glukosa didalam darah yang terlalu tinggi sehingga tidak adanya insulin. Dalam kurun waktu data di Rumah Sakit Islam Siti Khadijah Palembang yang dipengaruhi oleh jumlah dari pasien yang melakukan pemeriksaan kesehatan seperti penyakit diabetes melitus sehingga berpengaruh dalam hal klasifikasi data yang akan menyulitkan pihak rumah sakit. Maka dengan memanfaatkan data mining, pengklasifikasian untuk menentukan pasien yang telah melakukan pemeriksaan termasuk penderita penyakit diabetes atau tidak. Dengan adanya permasalahan tersebut maka penulis melakukan analisis perbandingan dari dua algoritma yaitu algoritma naïve bayes dan algoritma support vector machine untuk klasifikasi penyakit diabetes dengan menggunakan alat bantu WEKA dengan tools options Cross Validation dan Confussion Matrix dengan hasil akurasi tertinggi yaitu algoritma support vector machine dengan kernel polynomial yang hasilnya 96.2704% dan tingkat error sebanyak 3.7296% dapat disimpulkan algoritma yang akurat dalam klasifikasi penyakit diabetes yaitu algoritma support vector machine dengan kernel polynomial.

Download Full-text

Implementation and Analysis of the Performance of EDTA (Enhanced Decision Tree Data Mining Algorithm) for diagnosis of Angioplasty and Stents for Heart Disease Treatment

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i4.541543 ◽

2018 ◽

Vol 6 (4) ◽

pp. 541-543

Author(s):

Amarjeet Kaur ◽

◽

Ashok Jetawat2 ◽

...

Keyword(s):

Data Mining ◽

Heart Disease ◽

Decision Tree ◽

Data Mining Algorithm ◽

Disease Treatment ◽

Mining Algorithm ◽

Tree Data

Download Full-text

Analysis of rainfall classification over Tanah Laut disrict based on global climate indicators using support vector machine method

Journal of Physics Conference Series ◽

10.1088/1742-6596/2106/1/012009 ◽

2021 ◽

Vol 2106 (1) ◽

pp. 012009

Author(s):

N Hayah ◽

O Soesanto ◽

M A Rahman

Keyword(s):

Support Vector Machine ◽

Global Climate ◽

Climatic Conditions ◽

Training Data ◽

Classification Method ◽

Support Vector ◽

Rainfall Forecasting ◽

Machine Method ◽

Svm Classification ◽

The Relationship

Abstract The Support Vector Machine (SVM) classification method can be applied in various fields, one of which is meteorology and climatology in rainfall forecasting. Thus, a study was conducted by classifying rainfall to recognize the relationship between global phenomena and rainfall and the results of applying the classification using the SVM method to rainfall in the Tanah Laut Regency. The analysis is carried out using the SVM Multiclass concept with 4 categories of rainfall classification: low, medium, high, and Extreme. The kernel used in SVM is the RBF kernel with optimization parameters used, namely Cost (C) 1,5,10,15 and Gamma (γ) 1,5,10,15. The dataset formed is based on the annual period, climatic conditions, and seasonality. The Spearman Rank correlation test describes the relationship between global phenomena and rainfall with a correlation range of (−0.1456 ) − (0.43144) for the entire dataset. The implementation of the SVM classification method shows that the Cost (C) 10 and Gamma (γ) ≥ 5 parameters obtained the highest accuracy of 100% on the training data. In contrast, in testing the data testing, the accuracy was good, namely the accuracy of 78.00% in La Nina and 81.38% in seasonal periods.

Download Full-text

ANALISIS PERBANDINGAN ALGORITMA NAIVE BAYES DAN SUPPORT VECTOR MACHINE DALAM MENGKLASIFIKASIKAN JUMLAH PEMBACA ARTIKEL ONLINE

JIKA (Jurnal Informatika) ◽

10.31000/.v2i2.1521 ◽

2019 ◽

Vol 2 (2) ◽

Author(s):

Umbar Riyanto

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Cross Validation ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector

PT. Linktone Indonesia merupakan salah satu perusahaan yang bergerak dalam bidang portal berita online. Semakin banyaknya portal berita online di Indonesia, para penulis yang ada di PT. Linktone Indonesia harus dapat bersaing, agar artikel yang mereka publish mendapatkan jumlah pembaca yang maksimal. Jumlah pembaca pada sebuah artikel tidaklah menentu, dan sulit untuk diprediksi. Banyaknya jumlah artikel yang dimiliki, maka dapat dilakukan penelitian data mining untuk mengklasifikasi jumlah pembaca artikel. Terdapat beberapa algoritma dalam teknik klasifikasi, akan tetapi tidak semua algoritma memiliki kinerja dan tingkat keakuratan yang baik dalam mengklasifikasi jumlah pembaca artikel. Penelitian ini membandingkan dua algoritma klasifikasi antara Naive Bayes, Support Vector Machine dan Bagging pada tiap algoritma. Peneliti membagi menjadi 5 dataset dan menggunakan tools WEKA dengan tools options K-Folds Cross Validation dan Confussion Matrix. Hasil penelitian ini, dengan jumlah dataset 7111 record. Bagging kurang memperbaiki hasil klasifikasi dengan jumlah dataset yang besar dan memerlukan waktu pembuatan model yang sangat lama dengan klasifikasi Support Vector Machine. Sementara itu Naive Bayes dalam segi waktu pembuatan model mendapatkan waktu yang paling cepat.

Download Full-text