Heart Disease Risk Level Prediction: Knitting Machine Learning Classifiers

Machine learning is often perceived as a sophisticated technology accessible only by highly trained experts. This prevents many physicians and biologists from using this tool in their research. The goal of this paper is to eliminate this out-dated perception. We argue that the recent development of auto machine learning techniques enables biomedical researchers to quickly build competitive machine learning classifiers without requiring in-depth knowledge about the underlying algorithms. We study the case of predicting the risk of cardiovascular diseases. To support our claim, we compare auto machine learning techniques against a graduate student using several important metrics, including the total amounts of time required for building machine learning models and the final classification accuracies on unseen test datasets. In particular, the graduate student manually builds multiple machine learning classifiers and tunes their parameters for one month using scikit-learn library, which is a popular machine learning library to obtain ones that perform best on two given, publicly available datasets. We run an auto machine learning library called auto-sklearn on the same datasets. Our experiments find that automatic machine learning takes 1 h to produce classifiers that perform better than the ones built by the graduate student in one month. More importantly, building this classifier only requires a few lines of standard code. Our findings are expected to change the way physicians see machine learning and encourage wide adoption of Artificial Intelligence (AI) techniques in clinical domains.

Download Full-text

Prediction of Heart Disease Risk Using Machine Learning with Correlation-based Feature Selection and Optimization Techniques

10.1109/icsc53193.2021.9673490 ◽

2021 ◽

Author(s):

Karna Vishnu Vardhana Reddy ◽

Irraivan Elamvazuthi ◽

Azrina Abd Aziz ◽

Sivajothi Paramasivam ◽

Hui Na Chua ◽

...

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Heart Disease ◽

Disease Risk ◽

Optimization Techniques ◽

Correlation Based Feature Selection ◽

Heart Disease Risk

Download Full-text

Elimination and Backward Selection of Features (P-Value Technique) In Prediction of Heart Disease by Using Machine Learning Algorithms

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i6.5765 ◽

2021 ◽

Vol 12 (6) ◽

pp. 2650-2665

Author(s):

Ritu Aggrawal, Saurabh Pal

Keyword(s):

Machine Learning ◽

Heart Disease ◽

Machine Learning Algorithms ◽

Model Verification ◽

P Value ◽

Accuracy Score ◽

Data Set ◽

Machine Learning Classifiers ◽

Backward Elimination ◽

Learning Classifiers

Background: Early speculation of cardiovascular disease can help determine the lifestyle change options of high-risk patients, thereby reducing difficulties. We propose a coronary heart disease data set analysis technique to predict people’s risk of danger based on people’s clinically determined history. The methods introduced may be integrated into multiple uses, such for developing decision support system, developing a risk management network, and help for experts and clinical staff. Methods: We employed the Framingham Heart study dataset, which is publicly available Kaggle, to train several machine learning classifiers such as logistic regression (LR), K-nearest neighbor (KNN), Naïve Bayes (NB), decision tree (DT), random forest (RF) and gradient boosting classifier (GBC) for disease prediction. The p-value method has been used for feature elimination, and the selected features have been incorporated for further prediction. Various thresholds are used with different classifiers to make predictions. In order to estimating the precision of the classifiers, ROC curve, confusion matrix and AUC value are considered for model verification. The performance of the six classifiers is used for comparison to predict chronic heart disease (CHD). Results: After applying the p-value backward elimination statistical method on the 10-year CHD data set, 6 significant features were selected from 14 features with p <0.5. In the performance of machine learning classifiers, GBC has the highest accuracy score, which is 87.61%. Conclusions: Statistical methods, such as the combination of p-value backward elimination method and machine learning classifiers, thereby improving the accuracy of the classifier and shortening the running time of the machine.

Download Full-text

Incidence, prevalence and coronary heart disease risk level in known Type 2 diabetes: a sentinel practice network study in the Basque Country, Spain

Diabetologia ◽

10.1007/s00125-003-1137-1 ◽

2003 ◽

Vol 46 (7) ◽

pp. 899-909 ◽

Cited By ~ 43

Author(s):

J. M. Arteagoitia ◽

M. I. Larra�aga ◽

J. L. Rodriguez ◽

I. Fernandez ◽

J. A. Pini�s

Keyword(s):

Type 2 Diabetes ◽

Coronary Heart Disease ◽

Heart Disease ◽

Coronary Heart Disease Risk ◽

Disease Risk ◽

Basque Country ◽

Risk Level ◽

Heart Disease Risk ◽

Practice Network

Download Full-text

Heart Disease Risk Prediction Using Machine Learning Classifiers with Attribute Evaluators

Applied Sciences ◽

10.3390/app11188352 ◽

2021 ◽

Vol 11 (18) ◽

pp. 8352

Author(s):

Karna Vishnu Vardhana Reddy ◽

Irraivan Elamvazuthi ◽

Azrina Abd Aziz ◽

Sivajothi Paramasivam ◽

Hui Na Chua ◽

...

Keyword(s):

Machine Learning ◽

Heart Disease ◽

Risk Prediction ◽

Disease Risk ◽

Prediction Method ◽

Chi Squared ◽

Attribute Evaluators ◽

Roc Area ◽

Heart Disease Risk ◽

Validation Testing

Cardiovascular diseases (CVDs) kill about 20.5 million people every year. Early prediction can help people to change their lifestyles and to ensure proper medical treatment if necessary. In this research, ten machine learning (ML) classifiers from different categories, such as Bayes, functions, lazy, meta, rules, and trees, were trained for efficient heart disease risk prediction using the full set of attributes of the Cleveland heart dataset and the optimal attribute sets obtained from three attribute evaluators. The performance of the algorithms was appraised using a 10-fold cross-validation testing option. Finally, we performed tuning of the hyperparameter number of nearest neighbors, namely, ‘k’ in the instance-based (IBk) classifier. The sequential minimal optimization (SMO) achieved an accuracy of 85.148% using the full set of attributes and 86.468% was the highest accuracy value using the optimal attribute set obtained from the chi-squared attribute evaluator. Meanwhile, the meta classifier bagging with logistic regression (LR) provided the highest ROC area of 0.91 using both the full and optimal attribute sets obtained from the ReliefF attribute evaluator. Overall, the SMO classifier stood as the best prediction method compared to other techniques, and IBk achieved an 8.25% accuracy improvement by tuning the hyperparameter ‘k’ to 9 with the chi-squared attribute set.

Download Full-text

Proficiency Assessment of Machine Learning Classifiers: An Implementation for the Prognosis of Breast Tumor and Heart Disease Classification

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2020.0111170 ◽

2020 ◽

Vol 11 (11) ◽

Author(s):

Talha Ahmed Khan ◽

Kushsairy A. ◽

Shahzad Nasim ◽

Muhammad Alam ◽

Zeeshan Shahid ◽

...

Keyword(s):

Machine Learning ◽

Heart Disease ◽

Breast Tumor ◽

Disease Classification ◽

Machine Learning Classifiers ◽

Learning Classifiers ◽

Proficiency Assessment

Download Full-text

Heart Disease Risk Level Prediction: Knitting Machine Learning Classifiers

A Comparative Analysis of Machine Learning Classifiers for Robust Heart Disease Prediction

Applying Machine Learning Classifiers on ECG Dataset for Predicting Heart Disease

Identifying the Predictive Capability of Machine Learning Classifiers for Designing Heart Disease Detection System

FA-1D-CNN Implementation to Improve Diagnosis of Heart Disease Risk Level

Physician-Friendly Machine Learning: A Case Study with Cardiovascular Disease Risk Prediction

Prediction of Heart Disease Risk Using Machine Learning with Correlation-based Feature Selection and Optimization Techniques

Elimination and Backward Selection of Features (P-Value Technique) In Prediction of Heart Disease by Using Machine Learning Algorithms

Incidence, prevalence and coronary heart disease risk level in known Type 2 diabetes: a sentinel practice network study in the Basque Country, Spain

Heart Disease Risk Prediction Using Machine Learning Classifiers with Attribute Evaluators

Proficiency Assessment of Machine Learning Classifiers: An Implementation for the Prognosis of Breast Tumor and Heart Disease Classification

Export Citation Format