Interpretable Machine Learning Model for Mortality Prediction in ICU: A Multicenter Study

Abstract Background: Researchers have long been struggling to improve the disease severity score in mortality prediction in ICU. The digitalization of medical health records and advancement of computation power have promoted the use of machine learning in critical care. This study aimed to develop an interpretable machine learning model using datasets from multicenters, and to compare with the APACHE IV, in predicting hospital mortality of patients admitted to ICU.Method: The datasets were assembled from the eICU database including 136145 patients across 208 hospitals throughout the U.S. and 5 ICUs in Hong Kong, including 10909 patients. The two datasets were first combined into one large dataset before 80:20 stratified split into the training set and the test set. The XGBoost machine algorithm was chosen to predict the hospital mortality. The variables in the model were the same as those included in the APACHE IV score. The discrimination and calibration of the model were assessed. The model would be interpreted using the Shapley Additive explanations values.Results: Of the 147054 patients in the whole cohort, the hospital mortality was 9.3%. The area under the precision-recall curve for the XGBoost algorithm was 0.57, and 0.49 for APACHE IV. Similarly, the XGBoost reached an area under the receiving operating curve (AUROC) of 0.90, while APACHE IV had an AUROC of 0.87. Additionally, the XGBoost algorithm showed better calibration than the APACHE IV. The three most important variables were age, heart rate, and whether the patient was on ventilator.Conclusions: The severity score developed by machine learning model using mutlicenter datasets outperformed the APACHE IV in predicting hospital mortality for patients admitted to ICU.

Download Full-text

MACHINE LEARNING MODEL FOR IN-HOSPITAL MORTALITY PREDICTION AFTER TRANSCATHETER AORTIC VALVE REPLACEMENT

Journal of the American College of Cardiology ◽

10.1016/s0735-1097(21)04574-5 ◽

2021 ◽

Vol 77 (18) ◽

pp. 3219

Author(s):

Chayakrit Krittanawong ◽

Hafeez Ul Hassan Virk ◽

Joshua Hahn ◽

Fu’ad Al-Azzam ◽

Kevin Greason ◽

...

Keyword(s):

Machine Learning ◽

Aortic Valve ◽

Aortic Valve Replacement ◽

Hospital Mortality ◽

Valve Replacement ◽

Transcatheter Aortic Valve Replacement ◽

Learning Model ◽

Mortality Prediction ◽

Transcatheter Aortic Valve ◽

Machine Learning Model

Download Full-text

Interpretable machine learning model for the deformation of multi-walled carbon nanotubes

10.26226/morressier.612f6735bc98103724100779 ◽

2021 ◽

Author(s):

Upendra Yadav ◽

Shashank Shrikant Pathrudkar ◽

Susanta Ghosh

Keyword(s):

Machine Learning ◽

Carbon Nanotubes ◽

Learning Model ◽

Interpretable Machine Learning ◽

Machine Learning Model ◽

Multi Walled Carbon Nanotubes ◽

Walled Carbon Nanotubes

Download Full-text

Generating Pseudo-Data to Enhance the Performance of Classification-Based Engineering Design: A Preliminary Investigation

Volume 6: Design, Systems, and Complexity ◽

10.1115/imece2020-24634 ◽

2020 ◽

Author(s):

Xianping Du ◽

Onur Bilgen ◽

Hongyi Xu

Keyword(s):

Machine Learning ◽

Engineering Design ◽

Real World ◽

Surrogate Model ◽

Preliminary Investigation ◽

Learning Model ◽

Classification Model ◽

Design Decision ◽

Large Dataset ◽

Machine Learning Model

Abstract Machine learning for classification has been used widely in engineering design, for example, feasible domain recognition and hidden pattern discovery. Training an accurate machine learning model requires a large dataset; however, high computational or experimental costs are major issues in obtaining a large dataset for real-world problems. One possible solution is to generate a large pseudo dataset with surrogate models, which is established with a smaller set of real training data. However, it is not well understood whether the pseudo dataset can benefit the classification model by providing more information or deteriorates the machine learning performance due to the prediction errors and uncertainties introduced by the surrogate model. This paper presents a preliminary investigation towards this research question. A classification-and-regressiontree model is employed to recognize the design subspaces to support design decision-making. It is implemented on the geometric design of a vehicle energy-absorbing structure based on finite element simulations. Based on a small set of real-world data obtained by simulations, a surrogate model based on Gaussian process regression is employed to generate pseudo datasets for training. The results showed that the tree-based method could help recognize feasible design domains efficiently. Furthermore, the additional information provided by the surrogate model enhances the accuracy of classification. One important conclusion is that the accuracy of the surrogate model determines the quality of the pseudo dataset and hence, the improvements in the machine learning model.

Download Full-text

Ensemble Machine Learning Model for Mortality Prediction Inside Intensive Care Unit

Studies in Computational Intelligence - Medical Informatics and Bioimaging Using Artificial Intelligence ◽

10.1007/978-3-030-91103-4_14 ◽

2021 ◽

pp. 245-258

Author(s):

Nora El-Rashidy ◽

Shaker El-Sappagh ◽

Samir Abdelrazik ◽

Hazem El-Bakry

Keyword(s):

Machine Learning ◽

Intensive Care Unit ◽

Intensive Care ◽

Learning Model ◽

Mortality Prediction ◽

Ensemble Machine Learning ◽

Machine Learning Model

Download Full-text

Incorporating Laboratory Values Into a Machine Learning Model Improves In-Hospital Mortality Predictions After Rapid Response Team Call

Critical Care Explorations ◽

10.1097/cce.0000000000000023 ◽

2019 ◽

Vol 1 (7) ◽

pp. e0023

Author(s):

Peter M. Reardon ◽

Enea Parimbelli ◽

Szymon Wilk ◽

Wojtek Michalowski ◽

Kyle Murphy ◽

...

Keyword(s):

Machine Learning ◽

Hospital Mortality ◽

Rapid Response Team ◽

Learning Model ◽

Rapid Response ◽

Laboratory Values ◽

Machine Learning Model ◽

Response Team

Download Full-text

Early prediction of in-hospital death of COVID-19 patients: a machine-learning model based on age, blood analyses, and chest x-ray score

eLife ◽

10.7554/elife.70640 ◽

2021 ◽

Vol 10 ◽

Author(s):

Emirena Garrafa ◽

Marika Vezzoli ◽

Marco Ravanelli ◽

Davide Farina ◽

Andrea Borghesi ◽

...

Keyword(s):

Machine Learning ◽

Hospital Mortality ◽

Learning Model ◽

Hospital Death ◽

Web Based ◽

X Ray ◽

Machine Learning Model ◽

Chest X Ray ◽

Blood Analytes ◽

Second Wave

An early-warning model to predict in-hospital mortality on admission of COVID-19 patients at an emergency department (ED) was developed and validate using a Machine-Learning model. In total, 2782 patients were enrolled between March 2020 and December 2020, including 2106 patients (first wave) and 676 patients (second wave) in the COVID-19 outbreak in Italy. The first-wave patients were divided into two groups with 1474 patients used to train the model, and 632 to validate it. The 676 patients in the second wave were used to test the model. Age, 17 blood analytes and Brescia chest X-ray score were the variables processed using a Random Forests classification algorithm to build and validate the model. ROC analysis was used to assess the model performances. A web-based death-risk calculator was implemented and integrated within the Laboratory Information System of the hospital. The final score was constructed by age (the most powerful predictor), blood analytes (the strongest predictors were lactate dehydrogenase, D-dimer, Neutrophil/Lymphocyte ratio, C-reactive protein, Lymphocyte %, Ferritin std and Monocyte %), and Brescia chest X-ray score. The areas under the receiver operating characteristic curve obtained for the three groups (training, validating and testing) were 0.98, 0.83 and 0.78, respectively. The model predicts in-hospital mortality on the basis of data that can be obtained in a short time, directly at the ED on admission. It functions as a web-based calculator, providing a risk score which is easy to interpret. It can be used in the triage process to support the decision on patient allocation.

Download Full-text

Prospective and External Evaluation of a Machine Learning Model to Predict In-Hospital Mortality of Adults at Time of Admission

JAMA Network Open ◽

10.1001/jamanetworkopen.2019.20733 ◽

2020 ◽

Vol 3 (2) ◽

pp. e1920733 ◽

Cited By ~ 6

Author(s):

Nathan Brajer ◽

Brian Cozzi ◽

Michael Gao ◽

Marshall Nichols ◽

Mike Revoir ◽

...

Keyword(s):

Machine Learning ◽

Hospital Mortality ◽

Learning Model ◽

External Evaluation ◽

Machine Learning Model ◽

Time Of Admission

Download Full-text

An interpretable machine learning model for diagnosis of Alzheimer's disease

PeerJ ◽

10.7717/peerj.6543 ◽

2019 ◽

Vol 7 ◽

pp. e6543 ◽

Cited By ~ 4

Author(s):

Diptesh Das ◽

Junichi Ito ◽

Tadashi Kadowaki ◽

Koji Tsuda

Keyword(s):

Machine Learning ◽

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Medical Diagnosis ◽

Cost Effective ◽

Learning Model ◽

Patient Specific ◽

Interpretable Machine Learning ◽

Machine Learning Model ◽

Effective Diagnosis

We present an interpretable machine learning model for medical diagnosis called sparse high-order interaction model with rejection option (SHIMR). A decision tree explains to a patient the diagnosis with a long rule (i.e., conjunction of many intervals), while SHIMR employs a weighted sum of short rules. Using proteomics data of 151 subjects in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset, SHIMR is shown to be as accurate as other non-interpretable methods (Sensitivity, SN = 0.84 ± 0.1, Specificity, SP = 0.69 ± 0.15 and Area Under the Curve, AUC = 0.86 ± 0.09). For clinical usage, SHIMR has a function to abstain from making any diagnosis when it is not confident enough, so that a medical doctor can choose more accurate but invasive and/or more costly pathologies. The incorporation of a rejection option complements SHIMR in designing a multistage cost-effective diagnosis framework. Using a baseline concentration of cerebrospinal fluid (CSF) and plasma proteins from a common cohort of 141 subjects, SHIMR is shown to be effective in designing a patient-specific cost-effective Alzheimer’s disease (AD) pathology. Thus, interpretability, reliability and having the potential to design a patient-specific multistage cost-effective diagnosis framework can make SHIMR serve as an indispensable tool in the era of precision medicine that can cater to the demand of both doctors and patients, and reduce the overwhelming financial burden of medical diagnosis.

Download Full-text

Predicting lethal courses in critically ill COVID-19 patients using a machine learning model trained on patients with non-COVID-19 viral pneumonia

Scientific Reports ◽

10.1038/s41598-021-92475-7 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Gregor Lichtner ◽

Felix Balzer ◽

Stefan Haufe ◽

Niklas Giesa ◽

Fridtjof Schiefenhövel ◽

...

Keyword(s):

Machine Learning ◽

Critically Ill ◽

Prediction Models ◽

Predictive Performance ◽

Learning Model ◽

Mortality Prediction ◽

Viral Pneumonia ◽

Machine Learning Model ◽

Mortality Prediction Models ◽

Time Courses

AbstractIn a pandemic with a novel disease, disease-specific prognosis models are available only with a delay. To bridge the critical early phase, models built for similar diseases might be applied. To test the accuracy of such a knowledge transfer, we investigated how precise lethal courses in critically ill COVID-19 patients can be predicted by a model trained on critically ill non-COVID-19 viral pneumonia patients. We trained gradient boosted decision tree models on 718 (245 deceased) non-COVID-19 viral pneumonia patients to predict individual ICU mortality and applied it to 1054 (369 deceased) COVID-19 patients. Our model showed a significantly better predictive performance (AUROC 0.86 [95% CI 0.86–0.87]) than the clinical scores APACHE2 (0.63 [95% CI 0.61–0.65]), SAPS2 (0.72 [95% CI 0.71–0.74]) and SOFA (0.76 [95% CI 0.75–0.77]), the COVID-19-specific mortality prediction models of Zhou (0.76 [95% CI 0.73–0.78]) and Wang (laboratory: 0.62 [95% CI 0.59–0.65]; clinical: 0.56 [95% CI 0.55–0.58]) and the 4C COVID-19 Mortality score (0.71 [95% CI 0.70–0.72]). We conclude that lethal courses in critically ill COVID-19 patients can be predicted by a machine learning model trained on non-COVID-19 patients. Our results suggest that in a pandemic with a novel disease, prognosis models built for similar diseases can be applied, even when the diseases differ in time courses and in rates of critical and lethal courses.

Download Full-text