A Data-Driven Heart Disease Prediction Model Through K-Means Clustering-Based Anomaly Detection

2021 ◽  
Vol 2 (2) ◽  
Author(s):  
Rony Chowdhury Ripan ◽  
Iqbal H. Sarker ◽  
Syed Md. Minhaz Hossain ◽  
Md. Musfique Anwar ◽  
Raza Nowrozy ◽  
...  
Author(s):  
Xiaoming Yuan ◽  
Jiahui Chen ◽  
Kuan Zhang ◽  
Yuan Wu ◽  
Tingting Yang

Author(s):  
Tssehay Admassu Assegie

<span>In this study, the author proposed k-nearest neighbor (KNN) based heart disease prediction model. The author conducted an experiment to evaluate the performance of the proposed model. Moreover, the result of the experimental evaluation of the predictive performance of the proposed model is analyzed. To conduct the study, the author obtained heart disease data from Kaggle machine learning data repository. The dataset consists of 1025 observations of which 499 or 48.68% is heart disease negative and 526 or 51.32% is heart disease positive. Finally, the performance of KNN algorithm is analyzed on the test set. The result of performance analysis on the experimental results on the Kaggle heart disease data repository shows that the accuracy of the KNN is 91.99%</span>


BMJ Open ◽  
2014 ◽  
Vol 4 (5) ◽  
pp. e005025 ◽  
Author(s):  
Sun Ha Jee ◽  
Yangsoo Jang ◽  
Dong Joo Oh ◽  
Byung-Hee Oh ◽  
Sang Hoon Lee ◽  
...  

2021 ◽  
Vol 1 (4) ◽  
pp. 268-280
Author(s):  
Bamanga Mahmud , , , Ahmad ◽  
Ahmadu Asabe Sandra ◽  
Musa Yusuf Malgwi ◽  
Dahiru I. Sajoh

For the identification and prediction of different diseases, machine learning techniques are commonly used in clinical decision support systems. Since heart disease is the leading cause of death for both men and women around the world. Heart is one of the essential parts of human body, therefore, it is one of the most critical concerns in the medical domain, and several researchers have developed intelligent medical devices to support the systems and further to enhance the ability to diagnose and predict heart diseases. However, there are few studies that look at the capabilities of ensemble methods in developing a heart disease detection and prediction model. In this study, the researchers assessed that how to use ensemble model, which proposes a more stable performance than the use of base learning algorithm and these leads to better results than other heart disease prediction models. The University of California, Irvine (UCI) Machine Learning Repository archive was used to extract patient heart disease data records. To achieve the aim of this study, the researcher developed the meta-algorithm. The ensemble model is a superior solution in terms of high predictive accuracy and diagnostics output reliability, as per the results of the experiments. An ensemble heart disease prediction model is also presented in this work as a valuable, cost-effective, and timely predictive option with a user-friendly graphical user interface that is scalable and expandable. From the finding, the researcher suggests that Bagging is the best ensemble classifier to be adopted as the extended algorithm that has the high prediction probability score in the implementation of heart disease prediction.


Author(s):  
Rony Chowdhury Ripan ◽  
Iqbal H. Sarker ◽  
Md. Hasan Furhad ◽  
Md Musfique Anwar ◽  
Mohammed Moshiul Hoque

This paper presents an effective heart disease prediction model through detecting the anomalies, also known as outliers, in healthcare data using the unsupervised K-means clustering algorithm. Most existing approaches for detecting anomalies are based on constructing profiles of normal instances. However, such techniques require an adequate number of normal profiles to justify those models. Our proposed model first evaluates an \textit{optimal} value of K using Silhouette method. Next, it intends to locate anomalies that are far from a certain threshold distance with respect to their clusters. Finally, the five most popular classification techniques such as K-Nearest Neighbor (KNN), Random Forest (RF), Support Vector Machines (SVM), Naive Bayes (NB), and Logistic Regression (LR) are applied to build the resultant prediction model. The effectiveness of the proposed methodology is justified using a benchmark dataset of heart disease.


Author(s):  
B.V. Chowdary ◽  
Jaina Kedarnath ◽  
Rachamalla Vyshnavi ◽  
Valluri Lavakush ◽  
Chavula Shashidhar

Sign in / Sign up

Export Citation Format

Share Document