Prediction of Cardiovascular Disease using Machine Learning Algorithms

Background/Aim: Healthcare is an unavoidable assignment to be done in human life. Cardiovascular sickness is a general class for a scope of infections that are influencing heart and veins. The early strategies for estimating the cardiovascular sicknesses helped in settling on choices about the progressions to have happened in high-chance patients which brought about the decrease of their dangers. Methods: In the proposed research, we have considered informational collection from kaggle and it doesn't require information pre-handling systems like the expulsion of noise data, evacuation of missing information, filling default esteems if applicable and classification of attributes for prediction and decision making at different levels. The performance of the diagnosis model is obtained by using methods like classification, accuracy, sensitivity and specificity analysis. This paper proposes a prediction model to predict whether a people have a cardiovascular disease or not and to provide an awareness or diagnosis on that. This is done by comparing the accuracies of applying rules to the individual results of Support Vector Machine, Random forest, Naive Bayes classifier and logistic regression on the dataset taken in a region to present an accurate model of predicting cardiovascular disease. Results: The machine learning algorithms under study were able to predict cardiovascular disease in patients with accuracy between 58.71% and 77.06%. Conclusions: It was shown that Logistic Regression has better Accuracy (77.06 %) when compared to different Machine-learning Algorithms.

Download Full-text

Prediction of Prostate Cancer using Machine Learning Algorithms

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.e6754.018520 ◽

2020 ◽

Vol 8 (5) ◽

pp. 5353-5362

Keyword(s):

Prostate Cancer ◽

Machine Learning ◽

Logistic Regression ◽

Random Forest ◽

Missing Values ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

Cancer Disease ◽

The Individual

Background/Aim: Prostate cancer is regarded as the most prevalent cancer in the word and the main cause of deaths worldwide. The early strategies for estimating the prostate cancer sicknesses helped in settling on choices about the progressions to have happened in high-chance patients which brought about the decrease of their dangers. Methods: In the proposed research, we have considered informational collection from kaggle and we have done pre-processing tasks for missing values .We have three missing data values in compactness attribute and two missing values in fractal dimension were replaced by mean of their column values .The performance of the diagnosis model is obtained by using methods like classification, accuracy, sensitivity and specificity analysis. This paper proposes a prediction model to predict whether a people have a prostate cancer disease or not and to provide an awareness or diagnosis on that. This is done by comparing the accuracies of applying rules to the individual results of Support Vector Machine, Random forest, Naive Bayes classifier and logistic regression on the dataset taken in a region to present an accurate model of predicting prostate cancer disease. Results: The machine learning algorithms under study were able to predict prostate cancer disease in patients with accuracy between 70% and 90%. Conclusions: It was shown that Logistic Regression and Random Forest both has better Accuracy (90%) when compared to different Machine-learning Algorithms.

Download Full-text

Book Genre Categorization Using Machine Learning Algorithms (K-Nearest Neighbor, Support Vector Machine and Logistic Regression) using Customized Dataset

International Journal of Computer Science and Mobile Computing ◽

10.47760/ijcsmc.2021.v10i03.002 ◽

2021 ◽

Vol 10 (3) ◽

pp. 14-25

Author(s):

Parilkumar Shiroya

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Logistic Regression ◽

Nearest Neighbor ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

K Nearest Neighbor

Download Full-text

Tremor Identification Using Machine Learning in Parkinson's Disease

Early Detection of Neurological Disorders Using Machine Learning Systems - Advances in Medical Technologies and Clinical Practice ◽

10.4018/978-1-5225-8567-1.ch008 ◽

2019 ◽

pp. 128-151

Author(s):

Angana Saikia ◽

Vinayak Majhi ◽

Masaraf Hussain ◽

Sudip Paul ◽

Amitava Datta

Keyword(s):

Machine Learning ◽

Parkinson’S Disease ◽

Support Vector Machine ◽

Parkinson's Disease ◽

Discriminant Analysis ◽

Learning Algorithms ◽

The Body ◽

Machine Learning Algorithms ◽

Support Vector

Tremor is an involuntary quivering movement or shake. Characteristically occurring at rest, the classic slow, rhythmic tremor of Parkinson's disease (PD) typically starts in one hand, foot, or leg and can eventually affect both sides of the body. The resting tremor of PD can also occur in the jaw, chin, mouth, or tongue. Loss of dopamine leads to the symptoms of Parkinson's disease and may include a tremor. For some people, a tremor might be the first symptom of PD. Various studies have proposed measurable technologies and the analysis of the characteristics of Parkinsonian tremors using different techniques. Various machine-learning algorithms such as a support vector machine (SVM) with three kernels, a discriminant analysis, a random forest, and a kNN algorithm are also used to classify and identify various kinds of tremors. This chapter focuses on an in-depth review on identification and classification of various Parkinsonian tremors using machine learning algorithms.

Download Full-text

FEASIBILITY OF MACHINE LEARNING METHODS FOR SEPARATING WOOD AND LEAF POINTS FROM TERRESTRIAL LASER SCANNING DATA

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-2-w4-157-2017 ◽

2017 ◽

Vol IV-2/W4 ◽

pp. 157-164 ◽

Cited By ~ 6

Author(s):

D. Wang ◽

M. Hollaus ◽

N. Pfeifer

Keyword(s):

Machine Learning ◽

Laser Scanning ◽

Learning Algorithms ◽

Terrestrial Laser Scanning ◽

Gaussian Mixture ◽

Machine Learning Algorithms ◽

Support Vector ◽

Area Index ◽

Training Samples

Classification of wood and leaf components of trees is an essential prerequisite for deriving vital tree attributes, such as wood mass, leaf area index (LAI) and woody-to-total area. Laser scanning emerges to be a promising solution for such a request. Intensity based approaches are widely proposed, as different components of a tree can feature discriminatory optical properties at the operating wavelengths of a sensor system. For geometry based methods, machine learning algorithms are often used to separate wood and leaf points, by providing proper training samples. However, it remains unclear how the chosen machine learning classifier and features used would influence classification results. To this purpose, we compare four popular machine learning classifiers, namely Support Vector Machine (SVM), Na¨ıve Bayes (NB), Random Forest (RF), and Gaussian Mixture Model (GMM), for separating wood and leaf points from terrestrial laser scanning (TLS) data. Two trees, an <i>Erytrophleum fordii</i> and a <i>Betula pendula</i> (silver birch) are used to test the impacts from classifier, feature set, and training samples. Our results showed that RF is the best model in terms of accuracy, and local density related features are important. Experimental results confirmed the feasibility of machine learning algorithms for the reliable classification of wood and leaf points. It is also noted that our studies are based on isolated trees. Further tests should be performed on more tree species and data from more complex environments.

Download Full-text

Machine learning in the diagnosis of Myocardial Infarction with Non-Obstructive Coronary Arteries

European Heart Journal ◽

10.1093/eurheartj/ehab724.3067 ◽

2021 ◽

Vol 42 (Supplement_1) ◽

Author(s):

M J Espinosa Pascual ◽

P Vaquero Martinez ◽

V Vaquero Martinez ◽

J Lopez Pais ◽

B Izquierdo Coronel ◽

...

Keyword(s):

Machine Learning ◽

Myocardial Infarction ◽

Support Vector Machine ◽

Logistic Regression ◽

Random Forest ◽

Obstructive Coronary Artery Disease ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Classification Model ◽

Support Vector

Abstract Introduction Out of all patients admitted with Myocardial Infarction, 10 to 15% have Myocardial Infarction with Non-Obstructive Coronaries Arteries (MINOCA). Classification algorithms based on deep learning substantially exceed traditional diagnostic algorithms. Therefore, numerous machine learning models have been proposed as useful tools for the detection of various pathologies, but to date no study has proposed a diagnostic algorithm for MINOCA. Purpose The aim of this study was to estimate the diagnostic accuracy of several automated learning algorithms (Support-Vector Machine [SVM], Random Forest [RF] and Logistic Regression [LR]) to discriminate between people suffering from MINOCA from those with Myocardial Infarction with Obstructive Coronary Artery Disease (MICAD) at the time of admission and before performing a coronary angiography, whether invasive or not. Methods A Diagnostic Test Evaluation study was carried out applying the proposed algorithms to a database constituted by 553 consecutive patients admitted to our Hospital with Myocardial Infarction. According to the definitions of 2016 ESC Position Paper on MINOCA, patients were classified into two groups: MICAD and MINOCA. Out of the total 553 patients, 214 were discarded due to the lack of complete data. The set of machine learning algorithms was trained on 244 patients (training sample: 75%) and tested on 80 patients (test sample: 25%). A total of 64 variables were available for each patient, including demographic, clinical and laboratorial features before the angiographic procedure. Finally, the diagnostic precision of each architecture was taken. Results The most accurate classification model was the Random Forest algorithm (Specificity [Sp] 0.88, Sensitivity [Se] 0.57, Negative Predictive Value [NPV] 0.93, Area Under the Curve [AUC] 0.85 [CI 0.83–0.88]) followed by the standard Logistic Regression (Sp 0.76, Se 0.57, NPV 0.92 AUC 0.74 and Support-Vector Machine (Sp 0.84, Se 0.38, NPV 0.90, AUC 0.78) (see graph). The variables that contributed the most in order to discriminate a MINOCA from a MICAD were the traditional cardiovascular risk factors, biomarkers of myocardial injury, hemoglobin and gender. Results were similar when the 19 patients with Takotsubo syndrome were excluded from the analysis. Conclusion A prediction system for diagnosing MINOCA before performing coronary angiographies was developed using machine learning algorithms. Results show higher accuracy of diagnosing MINOCA than conventional statistical methods. This study supports the potential of machine learning algorithms in clinical cardiology. However, further studies are required in order to validate our results. FUNDunding Acknowledgement Type of funding sources: None. ROC curves of different algorithms

Download Full-text

Comparison of common machine learning algorithms trained with multi-zone models for identifying the location and strength of indoor pollutant sources

Indoor and Built Environment ◽

10.1177/1420326x20931576 ◽

2020 ◽

pp. 1420326X2093157

Author(s):

Yu Huang ◽

Zhi Gao ◽

Hongguang Zhang

Keyword(s):

Machine Learning ◽

Meteorological Parameters ◽

Human Life ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Identification Accuracy ◽

Sensor Data ◽

Support Vector ◽

Accurate Identification ◽

Pollutant Sources

The accurate identification of the characteristics of pollutant sources can effectively prevent the loss of human life and property damage caused by the sudden release of harmful chemicals in emergency situations. Machine learning algorithms, artificial neural network (ANN), support vector machine (SVM), k-nearest neighbour (KNN) and naive Bayesian (NB) classification can be used to identify the location of pollutant sources with limited sensor data inputs. In this study, the identification accuracy of the four above-mentioned machine learning algorithms was investigated and compared, considering the different sensor layouts, eigenvector inputs, meteorological parameters and number of samples. The results show that the collection of pollutant concentrations over an extended period of time could improve identification accuracy. Additional sensors were required to reach the same identification accuracy after the introduction of distributed meteorological parameters. Increasing the number of trained samples by a factor of five improved the identification accuracy of KNN by 22% and that of SVM by 1.7%; however, ANN and NB classification remained basically unchanged. When identifying the release mass of the pollutant source, multiple linear, ANN and SVM regression models were adopted. Results show that ANN performs best, whereas SVM provides the least optimal performance.

Download Full-text

Invisible experience to real-time assessment in elite tennis athlete training: Sport-specific movement classification based on wearable MEMS sensor data

Proceedings of the Institution of Mechanical Engineers Part P Journal of Sports Engineering and Technology ◽

10.1177/17543371211050312 ◽

2021 ◽

pp. 175433712110503

Author(s):

Mingyue Wu ◽

Ran Wang ◽

Yang Hu ◽

Mengjiao Fan ◽

Yufan Wang ◽

...

Keyword(s):

Machine Learning ◽

Real Time ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

Test Accuracy ◽

Z Score ◽

Mems Sensor ◽

Score Normalization

This study examined the reliability of a tennis stroke classification and assessment platform consisting of a single low-cost MEMS sensor in a wrist-worn wearable device, smartphone, and computer. The data that was collected was transmitted via Bluetooth and analyzed by machine learning algorithms. Twelve right-handed male elite tennis athletes participated in the study, and each athlete performed 150 strokes. The results from three machine learning algorithms regarding their recognition and classification of the real-time data stream were compared. Stroke recognition and classification went through pre-processing, segmentation, feature extraction, and classification with Support Vector Machine (SVM), including SVM without normalization, SVM with Min–Max, SVM with Z-score normalization, K-nearest neighbor (K-NN), and Naive Bayes (NB) machine learning algorithms. During the data training process, 10-fold cross-validation was used to avoid overfitting and suitable parameters were found within the SVM classifiers. The best classifier was achieved when C = 1 using the RBF kernel function. Different machine learning algorithms’ classification of unique stroke types yielded highly reliable clusters within each stroke type with the highest test accuracy of 99% achieved by SVM with Min–Max normalization and 98.4% achieved using SVM with a Z-score normalization classifier.

Download Full-text

Tremor Identification Using Machine Learning in Parkinson's Disease

Research Anthology on Diagnosing and Treating Neurocognitive Disorders ◽

10.4018/978-1-7998-3441-0.ch018 ◽

2021 ◽

pp. 341-365

Author(s):

Angana Saikia ◽

Vinayak Majhi ◽

Masaraf Hussain ◽

Sudip Paul ◽

Amitava Datta

Keyword(s):

Machine Learning ◽

Parkinson’S Disease ◽

Support Vector Machine ◽

Parkinson's Disease ◽

Discriminant Analysis ◽

Learning Algorithms ◽

The Body ◽

Machine Learning Algorithms ◽

Support Vector

Download Full-text

A Novel Ensemble Stacking Classification of Genetic Variations Using Machine Learning Algorithms

International Journal of Image and Graphics ◽

10.1142/s0219467823500158 ◽

2021 ◽

Author(s):

Jahnavi Yeturu ◽

Poongothai Elango ◽

S. P. Raja ◽

P. Nagendra Kumar

Keyword(s):

Machine Learning ◽

Heart Diseases ◽

Learning Algorithms ◽

Genetic Mutation ◽

Machine Learning Algorithms ◽

Support Vector ◽

Genetic Mutations ◽

Validation Data ◽

Data Set

Genetics is the clinical review of congenital mutation, where the principal advantage of analyzing genetic mutation of humans is the exploration, analysis, interpretation and description of the genetic transmitted and inherited effect of several diseases such as cancer, diabetes and heart diseases. Cancer is the most troublesome and disordered affliction as the proportion of cancer sufferers is growing massively. Identification and discrimination of the mutations that impart to the enlargement of tumor from the unbiased mutations is difficult, as majority tumors of cancer are able to exercise genetic mutations. The genetic mutations are systematized and categorized to sort the cancer by way of medical observations and considering clinical studies. At the present time, genetic mutations are being annotated and these interpretations are being accomplished either manually or using the existing primary algorithms. Evaluation and classification of each and every individual genetic mutation was basically predicated on evidence from documented content built on medical literature. Consequently, as a means to build genetic mutations, basically, depending on the clinical evidences persists a challenging task. There exist various algorithms such as one hot encoding technique is used to derive features from genes and their variations, TF-IDF is used to extract features from the clinical text data. In order to increase the accuracy of the classification, machine learning algorithms such as support vector machine, logistic regression, Naive Bayes, etc., are experimented. A stacking model classifier has been developed to increase the accuracy. The proposed stacking model classifier has obtained the log loss 0.8436 and 0.8572 for cross-validation data set and test data set, respectively. By the experimentation, it has been proved that the proposed stacking model classifier outperforms the existing algorithms in terms of log loss. Basically, minimum log loss refers to the efficient model. Here the log loss has been reduced to less than 1 by using the proposed stacking model classifier. The performance of these algorithms can be gauged on the basis of the various measures like multi-class log loss.

Download Full-text

Comparison of Machine Learning Algorithms for Cardiovascular Disease Prediction

Computational Methodologies for Electrical and Electronics Engineers - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-7998-3327-7.ch009 ◽

2021 ◽

pp. 111-126

Author(s):

Stuti Pandey ◽

Abhay Kumar Agarwal

Keyword(s):

Machine Learning ◽

Cardiovascular Disease ◽

Learning Algorithm ◽

Learning Algorithms ◽

Research Field ◽

Machine Learning Algorithms ◽

Support Vector ◽

Disease Prediction ◽

K Nearest Neighbors ◽

The University

Cardiovascular disease prediction is a research field of healthcare which depends on a large volume of data for making effective and accurate predictions. These predictions can be more effective and accurate when used with machine learning algorithms because it can disclose all the concealed facts which are helpful in making decisions. The processing capabilities of machine learning algorithms are also very fast which is almost infeasible for human beings. Therefore, the work presented in this research focuses on identifying the best machine learning algorithm by comparing their performances for predicting cardiovascular diseases in a reasonable time. The machine learning algorithms which have been used in the presented work are naïve Bayes, support vector machine, k-nearest neighbors, and random forest. The dataset which has been utilized for this comparison is taken from the University of California, Irvine (UCI) machine learning repository named “Heart Disease Data Set.”

Download Full-text

Prediction of Cardiovascular Disease using Machine Learning Algorithms

Prediction of Prostate Cancer using Machine Learning Algorithms

Book Genre Categorization Using Machine Learning Algorithms (K-Nearest Neighbor, Support Vector Machine and Logistic Regression) using Customized Dataset﻿

Tremor Identification Using Machine Learning in Parkinson's Disease

FEASIBILITY OF MACHINE LEARNING METHODS FOR SEPARATING WOOD AND LEAF POINTS FROM TERRESTRIAL LASER SCANNING DATA

Machine learning in the diagnosis of Myocardial Infarction with Non-Obstructive Coronary Arteries

Comparison of common machine learning algorithms trained with multi-zone models for identifying the location and strength of indoor pollutant sources

Invisible experience to real-time assessment in elite tennis athlete training: Sport-specific movement classification based on wearable MEMS sensor data

Tremor Identification Using Machine Learning in Parkinson's Disease

A Novel Ensemble Stacking Classification of Genetic Variations Using Machine Learning Algorithms

Comparison of Machine Learning Algorithms for Cardiovascular Disease Prediction

Book Genre Categorization Using Machine Learning Algorithms (K-Nearest Neighbor, Support Vector Machine and Logistic Regression) using Customized Dataset