Predictors of Mortality Using Machine Learning Decision Tree Algorithm in Critically Ill Adult Patients with COVID-19 Admitted to the ICU.

Abstract Background:The Coronavirus Disease-19 (COVID-19) caused by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) is a major cause of intensive care unit (ICU) admissions globally. Robust data of epidemiology, characteristics, and disease outcomes from different regions and populations showed considerable variations. However, limited number of reports addressed predictors of mortality utilizing machine learning methods. Herein, we aimed to describe the association and relationship of a predefined set of variables found to be predictive of 28–day ICU outcome among adults COVID-19 patients admitted to the ICU using a machine learning decision tree (DT) algorithm.Methods:This was a prospective/retrospective, multicenter cohort study from 14 hospitals in Saudi Arabia. We included critically ill COVID-19 patients admitted to the ICU between March 1, 2020, and October 31, 2020. The primary outcome was 28-day ICU mortality. Secondary outcomes were 90-day mortality and ICU length of stay. The predictors of mortality were identified using two predictive models, the conventional logistic regression and DT analysis.Results:A total of 1468 critically ill COVID-19 patients were included. The mean age was 55.9 (SD±15.1) years, with 74% of the patients were males. The 28-day ICU mortality was 540 (36.8%), while 90-day mortality was 600 (40.9%). The multivariable logistic regression model demonstrated that the PaO2/FiO2 ratio on ICU admission and the need for intubation or vasopressors could strongly predict 28-day ICU mortality. The DT algorithm identified five variables [need for intubation, need for vasopressors, age, gender, and PaO2/FiO2 ratio] provided in an algorithmic fashion to predict 28-day ICU outcome. Conclusion:Five clinical predictors of 28-day ICU outcome were identified using DT algorithmic analysis of COVID-19 patients admitted to ICU. The findings of this DT analysis may be used in ICU for early identification of critically ill COVID-19 patients who are at high risk of 28-day mortality.

Download Full-text

AN EFFICIENT MACHINE LEARNING MODEL FOR PREDICTION OF ACUTE MYOCARDIAL INFARCTION

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813666200325104317 ◽

2020 ◽

Vol 13 ◽

Author(s):

Dhilsath Fathima.M ◽

S. Justin Samuel ◽

R. Hari Haran

Keyword(s):

Machine Learning ◽

Myocardial Infarction ◽

Acute Myocardial Infarction ◽

Logistic Regression ◽

Decision Tree ◽

Learning Model ◽

Training Dataset ◽

Data Set ◽

Machine Learning Model ◽

Proposed Model

Aim: This proposed work is used to develop an improved and robust machine learning model for predicting Myocardial Infarction (MI) could have substantial clinical impact. Objectives: This paper explains how to build machine learning based computer-aided analysis system for an early and accurate prediction of Myocardial Infarction (MI) which utilizes framingham heart study dataset for validation and evaluation. This proposed computer-aided analysis model will support medical professionals to predict myocardial infarction proficiently. Methods: The proposed model utilize the mean imputation to remove the missing values from the data set, then applied principal component analysis to extract the optimal features from the data set to enhance the performance of the classifiers. After PCA, the reduced features are partitioned into training dataset and testing dataset where 70% of the training dataset are given as an input to the four well-liked classifiers as support vector machine, k-nearest neighbor, logistic regression and decision tree to train the classifiers and 30% of test dataset is used to evaluate an output of machine learning model using performance metrics as confusion matrix, classifier accuracy, precision, sensitivity, F1-score, AUC-ROC curve. Results: Output of the classifiers are evaluated using performance measures and we observed that logistic regression provides high accuracy than K-NN, SVM, decision tree classifiers and PCA performs sound as a good feature extraction method to enhance the performance of proposed model. From these analyses, we conclude that logistic regression having good mean accuracy level and standard deviation accuracy compared with the other three algorithms. AUC-ROC curve of the proposed classifiers is analyzed from the output figure.4, figure.5 that logistic regression exhibits good AUC-ROC score, i.e. around 70% compared to k-NN and decision tree algorithm. Conclusion: From the result analysis, we infer that this proposed machine learning model will act as an optimal decision making system to predict the acute myocardial infarction at an early stage than an existing machine learning based prediction models and it is capable to predict the presence of an acute myocardial Infarction with human using the heart disease risk factors, in order to decide when to start lifestyle modification and medical treatment to prevent the heart disease.

Download Full-text

Comparison of the Performance of Machine Learning Algorithms in Predicting Heart Disease

Frontiers in Health Informatics ◽

10.30699/fhi.v10i1.349 ◽

2021 ◽

Vol 10 (1) ◽

pp. 99

Author(s):

Sajad Yousefi

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Heart Disease ◽

Decision Tree ◽

Roc Curve ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Learning Models ◽

Algorithm Performance ◽

Machine Learning Models

Introduction: Heart disease is often associated with conditions such as clogged arteries due to the sediment accumulation which causes chest pain and heart attack. Many people die due to the heart disease annually. Most countries have a shortage of cardiovascular specialists and thus, a significant percentage of misdiagnosis occurs. Hence, predicting this disease is a serious issue. Using machine learning models performed on multidimensional dataset, this article aims to find the most efficient and accurate machine learning models for disease prediction.Material and Methods: Several algorithms were utilized to predict heart disease among which Decision Tree, Random Forest and KNN supervised machine learning are highly mentioned. The algorithms are applied to the dataset taken from the UCI repository including 294 samples. The dataset includes heart disease features. To enhance the algorithm performance, these features are analyzed, the feature importance scores and cross validation are considered.Results: The algorithm performance is compared with each other, so that performance based on ROC curve and some criteria such as accuracy, precision, sensitivity and F1 score were evaluated for each model. As a result of evaluation, Accuracy, AUC ROC are 83% and 99% respectively for Decision Tree algorithm. Logistic Regression algorithm with accuracy and AUC ROC are 88% and 91% respectively has better performance than other algorithms. Therefore, these techniques can be useful for physicians to predict heart disease patients and prescribe them correctly.Conclusion: Machine learning technique can be used in medicine for analyzing the related data collections to a disease and its prediction. The area under the ROC curve and evaluating criteria related to a number of classifying algorithms of machine learning to evaluate heart disease and indeed, the prediction of heart disease is compared to determine the most appropriate classification. As a result of evaluation, better performance was observed in both Decision Tree and Logistic Regression models.

Download Full-text

Predicting Metabolic Syndrome With Machine Learning Models Using a Decision Tree Algorithm: Retrospective Cohort Study (Preprint)

10.2196/preprints.17110 ◽

2019 ◽

Author(s):

Cheng-Sheng Yu ◽

Yu-Jiun Lin ◽

Chang-Hsien Lin ◽

Sen-Te Wang ◽

Shiyng-Yu Lin ◽

...

Keyword(s):

Machine Learning ◽

Metabolic Syndrome ◽

Logistic Regression ◽

Decision Tree ◽

Characteristic Curve ◽

Significant Risk ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Health Examination ◽

Multivariate Logistic Regression

BACKGROUND Metabolic syndrome is a cluster of disorders that significantly influence the development and deterioration of numerous diseases. FibroScan is an ultrasound device that was recently shown to predict metabolic syndrome with moderate accuracy. However, previous research regarding prediction of metabolic syndrome in subjects examined with FibroScan has been mainly based on conventional statistical models. Alternatively, machine learning, whereby a computer algorithm learns from prior experience, has better predictive performance over conventional statistical modeling. OBJECTIVE We aimed to evaluate the accuracy of different decision tree machine learning algorithms to predict the state of metabolic syndrome in self-paid health examination subjects who were examined with FibroScan. METHODS Multivariate logistic regression was conducted for every known risk factor of metabolic syndrome. Principal components analysis was used to visualize the distribution of metabolic syndrome patients. We further applied various statistical machine learning techniques to visualize and investigate the pattern and relationship between metabolic syndrome and several risk variables. RESULTS Obesity, serum glutamic-oxalocetic transaminase, serum glutamic pyruvic transaminase, controlled attenuation parameter score, and glycated hemoglobin emerged as significant risk factors in multivariate logistic regression. The area under the receiver operating characteristic curve values for classification and regression trees and for the random forest were 0.831 and 0.904, respectively. CONCLUSIONS Machine learning technology facilitates the identification of metabolic syndrome in self-paid health examination subjects with high accuracy.

Download Full-text

Machine Learning Techniques Applied to Profile Mobile Banking Users in India

International Journal of Information Systems in the Service Sector ◽

10.4018/jisss.2013010105 ◽

2013 ◽

Vol 5 (1) ◽

pp. 82-92 ◽

Cited By ~ 8

Author(s):

M. Carr ◽

V. Ravi ◽

G. Sridharan Reddy ◽

D. Veranna

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Decision Tree ◽

Decision Trees ◽

Multilayer Perceptron ◽

Machine Learning Techniques ◽

Mobile Banking ◽

Classification Rules ◽

Learning Techniques ◽

Potential Customers

This paper profiles mobile banking users using machine learning techniques viz. Decision Tree, Logistic Regression, Multilayer Perceptron, and SVM to test a research model with fourteen independent variables and a dependent variable (adoption). A survey was conducted and the results were analysed using these techniques. Using Decision Trees the profile of the mobile banking adopter’s profile was identified. Comparing different machine learning techniques it was found that Decision Trees outperformed the Logistic Regression and Multilayer Perceptron and SVM. Out of all the techniques, Decision Tree is recommended for profiling studies because apart from obtaining high accurate results, it also yields ‘if–then’ classification rules. The classification rules provided here can be used to target potential customers to adopt mobile banking by offering them appropriate incentives.

Download Full-text

Azure Machine Learning tools efficiency in the electroencephalographic signal P300 standard and target responses classification

Bio-Algorithms and Med-Systems ◽

10.1515/bams-2019-0031 ◽

2019 ◽

Vol 15 (3) ◽

Author(s):

Grzegorz M. Wójcik ◽

Andrzej Kawiak ◽

Lukasz Kwasniewicz ◽

Piotr Schneider ◽

Jolanta Masiak

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Decision Tree ◽

Event Related Potentials ◽

Learning Tools ◽

Mentally Disabled ◽

Boosted Decision Tree ◽

Brodmann Areas ◽

Related Potentials ◽

Somatosensory Responses

AbstractThe Event-Related Potentials were investigated on a group of 70 participants using the dense array electroencephalographic amplifier with photogrammetry geodesic station. The source localisation was computed for each participant. The activity of brodmann areas (BAs) involved in the brain cortical activity of each participant was measured. Then the mean electric charge flowing through particular areas was calculated. The five different machine learning tools (logistic regression, boosted decision tree, Bayes point machine, classic neural network and averaged perceptron classifier) from the Azure ecosystem were trained, and their accuracy was tested in the task of distinguishing standard and target responses in the experiment. The efficiency of each tool was compared, and it was found out that the best tool was logistic regression and the boosted decision tree in our task. Such an approach can be useful in eliminating somatosensory responses in experimental psychology or even in establishing new communication protocols with mildly mentally disabled subjects.

Download Full-text

Comparison of Support Vector Machine, Bayesian Logistic Regression, and Alternating Decision Tree Algorithms for Shallow Landslide Susceptibility Mapping along a Mountainous Road in the West of Iran

Applied Sciences ◽

10.3390/app10155047 ◽

2020 ◽

Vol 10 (15) ◽

pp. 5047 ◽

Cited By ~ 7

Author(s):

Viet-Ha Nhu ◽

Danesh Zandi ◽

Himan Shahabi ◽

Kamran Chapi ◽

Ataollah Shirzadi ◽

...

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Logistic Regression ◽

Decision Tree ◽

Shallow Landslide ◽

Machine Learning Algorithms ◽

Support Vector ◽

Svm Algorithm ◽

Alternating Decision Tree ◽

Bayesian Logistic Regression

This paper aims to apply and compare the performance of the three machine learning algorithms–support vector machine (SVM), bayesian logistic regression (BLR), and alternating decision tree (ADTree)–to map landslide susceptibility along the mountainous road of the Salavat Abad saddle, Kurdistan province, Iran. We identified 66 shallow landslide locations, based on field surveys, by recording the locations of the landslides by a global position System (GPS), Google Earth imagery and black-and-white aerial photographs (scale 1: 20,000) and 19 landslide conditioning factors, then tested these factors using the information gain ratio (IGR) technique. We checked the validity of the models using statistical metrics, including sensitivity, specificity, accuracy, kappa, root mean square error (RMSE), and area under the receiver operating characteristic curve (AUC). We found that, although all three machine learning algorithms yielded excellent performance, the SVM algorithm (AUC = 0.984) slightly outperformed the BLR (AUC = 0.980), and ADTree (AUC = 0.977) algorithms. We observed that not only all three algorithms are useful and effective tools for identifying shallow landslide-prone areas but also the BLR algorithm can be used such as the SVM algorithm as a soft computing benchmark algorithm to check the performance of the models in future.

Download Full-text

Comparison Decision Tree and Logistic Regression Machine Learning Classification Algorithms to determine Covid-19

SinkrOn ◽

10.33395/sinkron.v7i1.11243 ◽

2022 ◽

Vol 7 (1) ◽

pp. 59-65

Author(s):

Artika Arista

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Decision Tree ◽

Cross Validation ◽

Performance Testing ◽

Signs And Symptoms ◽

Classification Algorithms ◽

Machine Learning Classification ◽

Wide Range ◽

Testing Performance

Many people today are unsure whether they have COVID-19. The frequent fever, dry cough, and sore throat are all signs and symptoms of COVID-19. If a person has signs or symptoms of coronavirus disease 2019 (COVID-19), he/she should see the doctor or go to a clinic as soon as possible. As a result, it's vital to learn and comprehend the fundamental differences. COVID-19 can cause a wide range of symptoms. The experiments were carried out using two Machine Learning Classification Algorithms, namely Decision Tree (DT) and Logistic Regression (LR). Both algorithms were written and analyzed using the Python program in Jupyter Notebook 6.4.5. From the results obtained in the experiments of covid symptoms dataset, on average, the DT model has obtained the best cross-validation average and the testing performance average compared to the LR machine learning models. For cross-validation results, the DT model has achieved an accuracy of 98.0%. For performance testing, the DT model has achieved an accuracy of 98.0%. The LR has obtained the second-best result on the average of cross-validation performance and the testing results. For cross-validation results, the LR model has achieved an accuracy of 96.0%. For performance testing, the LR model has achieved an accuracy of 97.0%. Consequently, the DT for the COVID-19 symptoms dataset is outperforming the LR for cross-validation and testing results.

Download Full-text

Predicting Metabolic Syndrome With Machine Learning Models Using a Decision Tree Algorithm: Retrospective Cohort Study

JMIR Medical Informatics ◽

10.2196/17110 ◽

2020 ◽

Vol 8 (3) ◽

pp. e17110 ◽

Cited By ~ 3

Author(s):

Cheng-Sheng Yu ◽

Yu-Jiun Lin ◽

Chang-Hsien Lin ◽

Sen-Te Wang ◽

Shiyng-Yu Lin ◽

...

Keyword(s):

Machine Learning ◽

Metabolic Syndrome ◽

Logistic Regression ◽

Decision Tree ◽

Characteristic Curve ◽

Significant Risk ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Health Examination ◽

Multivariate Logistic Regression

Background Metabolic syndrome is a cluster of disorders that significantly influence the development and deterioration of numerous diseases. FibroScan is an ultrasound device that was recently shown to predict metabolic syndrome with moderate accuracy. However, previous research regarding prediction of metabolic syndrome in subjects examined with FibroScan has been mainly based on conventional statistical models. Alternatively, machine learning, whereby a computer algorithm learns from prior experience, has better predictive performance over conventional statistical modeling. Objective We aimed to evaluate the accuracy of different decision tree machine learning algorithms to predict the state of metabolic syndrome in self-paid health examination subjects who were examined with FibroScan. Methods Multivariate logistic regression was conducted for every known risk factor of metabolic syndrome. Principal components analysis was used to visualize the distribution of metabolic syndrome patients. We further applied various statistical machine learning techniques to visualize and investigate the pattern and relationship between metabolic syndrome and several risk variables. Results Obesity, serum glutamic-oxalocetic transaminase, serum glutamic pyruvic transaminase, controlled attenuation parameter score, and glycated hemoglobin emerged as significant risk factors in multivariate logistic regression. The area under the receiver operating characteristic curve values for classification and regression trees and for the random forest were 0.831 and 0.904, respectively. Conclusions Machine learning technology facilitates the identification of metabolic syndrome in self-paid health examination subjects with high accuracy.

Download Full-text

CT Radiomics Nomogram for the Preoperative Prediction of Severe Post-Hepatectomy Liver Failure in Patients with Huge (≥10 cm) Hepatocellular Carcinoma

10.21203/rs.3.rs-732725/v1 ◽

2021 ◽

Author(s):

Fei Xiang ◽

Xiaoyuan Liang ◽

Lili Yang ◽

Xingyu Liu ◽

Sheng Yan

Keyword(s):

Hepatocellular Carcinoma ◽

Decision Tree ◽

Liver Failure ◽

Extent Of Resection ◽

Meld Score ◽

Training Dataset ◽

Clinical Predictors ◽

Multivariable Logistic Regression Model ◽

Risk Patients ◽

Radiomics Signature

Abstract Background To establish a radiomics-based nomogram for predicting severe (grade B or C) post-hepatectomy liver failure (PHLF) in patients with huge (≥10 cm) hepatocellular carcinoma (HCC).Methods 186 patients with huge HCC (n = 131 for training dataset and n = 55 for test dataset) who underwent curative hepatic resection were included. The least absolute shrinkage and selection operator approach was applied to develop the radiomics signature for grade B or C PHLF prediction in the training dataset. A multivariable logistic regression model was used by incorporating radiomics signature and other clinical predictors to establish a radiomics nomogram. A decision tree was created to stratify the risk for severe PHLF.Results The radiomics signature consisting of nine features predicted severe PHLF with an AUC of 0.766 and 0.745 in the training and test datasets, respectively. The radiomics nomogram was generated by integrating the radiomics signature, the extent of resection and model for end-stage liver disease (MELD) score. The nomogram exhibited satisfactory discrimination and calibration, with an AUC of 0.842 and 0.863 in the training and test datasets, respectively. Decision tree split patients into 3 risk classes: low-risk patients with radiomics score < -0.247 and MELD score < 10 or，radiomics score ≥ -0.247 and underwent partial resections; intermediate-risk patients with radiomics score < -0.247 but MELD score ≥10; high-risk patients with radiomics score ≥ -0.247 and underwent extended resections.Conclusions The radiomics nomogram was able to predict severe PHLF in huge HCC patients. Decision tree may be useful in surgical decision-making for huge HCC hepatectomy.

Download Full-text

Employee's Attrition Prediction Using Machine Learning Approaches

Machine Learning and Deep Learning in Real-Time Applications - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-7998-3095-5.ch005 ◽

2020 ◽

pp. 121-128

Author(s):

Krishna Kumar Mohbey

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Comparative Analysis ◽

Random Forest ◽

Decision Tree ◽

Experimental Results ◽

Learning Approaches ◽

Employee Attrition ◽

Or Organization ◽

Customer Attrition

In any industry, attrition is a big problem, whether it is about employee attrition of an organization or customer attrition of an e-commerce site. If we can accurately predict which customer or employee will leave their current company or organization, then it will save much time, effort, and cost of the employer and help them to hire or acquire substitutes in advance, and it would not create a problem in the ongoing progress of an organization. In this chapter, a comparative analysis between various machine learning approaches such as Naïve Bayes, SVM, decision tree, random forest, and logistic regression is presented. The presented result will help us in identifying the behavior of employees who can be attired over the next time. Experimental results reveal that the logistic regression approach can reach up to 86% accuracy over other machine learning approaches.

Download Full-text