scholarly journals Development and validation of an online model to predict critical COVID-19 with immune-inflammatory parameters

2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Yue Gao ◽  
Lingxi Chen ◽  
Jianhua Chi ◽  
Shaoqing Zeng ◽  
Xikang Feng ◽  
...  

Abstract Background Immune and inflammatory dysfunction was reported to underpin critical COVID-19(coronavirus disease 2019). We aim to develop a machine learning model that enables accurate prediction of critical COVID-19 using immune-inflammatory features at admission. Methods We retrospectively collected 2076 consecutive COVID-19 patients with definite outcomes (discharge or death) between January 27, 2020 and March 30, 2020 from two hospitals in China. Critical illness was defined as admission to intensive care unit, receiving invasive ventilation, or death. Least Absolute Shrinkage and Selection Operator (LASSO) was applied for feature selection. Five machine learning algorithms, including Logistic Regression (LR), Support Vector Machine (SVM), Gradient Boosted Decision Tree (GBDT), K-Nearest Neighbor (KNN), and Neural Network (NN) were built in a training dataset, and assessed in an internal validation dataset and an external validation dataset. Results Six features (procalcitonin, [T + B + NK cell] count, interleukin 6, C reactive protein, interleukin 2 receptor, T-helper lymphocyte/T-suppressor lymphocyte) were finally used for model development. Five models displayed varying but all promising predictive performance. Notably, the ensemble model, SPMCIIP (severity prediction model for COVID-19 by immune-inflammatory parameters), derived from three contributive algorithms (SVM, GBDT, and NN) achieved the best performance with an area under the curve (AUC) of 0.991 (95% confidence interval [CI] 0.979–1.000) in internal validation cohort and 0.999 (95% CI 0.998–1.000) in external validation cohort to identify patients with critical COVID-19. SPMCIIP could accurately and expeditiously predict the occurrence of critical COVID-19 approximately 20 days in advance. Conclusions The developed online prediction model SPMCIIP is hopeful to facilitate intensive monitoring and early intervention of high risk of critical illness in COVID-19 patients. Trial registration This study was retrospectively registered in the Chinese Clinical Trial Registry (ChiCTR2000032161). Graphical abstracthelper lymphocytve vv

2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Yue Gao ◽  
Guang-Yao Cai ◽  
Wei Fang ◽  
Hua-Yi Li ◽  
Si-Yuan Wang ◽  
...  

Abstract Soaring cases of coronavirus disease (COVID-19) are pummeling the global health system. Overwhelmed health facilities have endeavored to mitigate the pandemic, but mortality of COVID-19 continues to increase. Here, we present a mortality risk prediction model for COVID-19 (MRPMC) that uses patients’ clinical data on admission to stratify patients by mortality risk, which enables prediction of physiological deterioration and death up to 20 days in advance. This ensemble model is built using four machine learning methods including Logistic Regression, Support Vector Machine, Gradient Boosted Decision Tree, and Neural Network. We validate MRPMC in an internal validation cohort and two external validation cohorts, where it achieves an AUC of 0.9621 (95% CI: 0.9464–0.9778), 0.9760 (0.9613–0.9906), and 0.9246 (0.8763–0.9729), respectively. This model enables expeditious and accurate mortality risk stratification of patients with COVID-19, and potentially facilitates more responsive health systems that are conducive to high risk COVID-19 patients.


2022 ◽  
Vol 8 ◽  
Author(s):  
Jinzhang Li ◽  
Ming Gong ◽  
Yashutosh Joshi ◽  
Lizhong Sun ◽  
Lianjun Huang ◽  
...  

BackgroundAcute renal failure (ARF) is the most common major complication following cardiac surgery for acute aortic syndrome (AAS) and worsens the postoperative prognosis. Our aim was to establish a machine learning prediction model for ARF occurrence in AAS patients.MethodsWe included AAS patient data from nine medical centers (n = 1,637) and analyzed the incidence of ARF and the risk factors for postoperative ARF. We used data from six medical centers to compare the performance of four machine learning models and performed internal validation to identify AAS patients who developed postoperative ARF. The area under the curve (AUC) of the receiver operating characteristic (ROC) curve was used to compare the performance of the predictive models. We compared the performance of the optimal machine learning prediction model with that of traditional prediction models. Data from three medical centers were used for external validation.ResultsThe eXtreme Gradient Boosting (XGBoost) algorithm performed best in the internal validation process (AUC = 0.82), which was better than both the logistic regression (LR) prediction model (AUC = 0.77, p < 0.001) and the traditional scoring systems. Upon external validation, the XGBoost prediction model (AUC =0.81) also performed better than both the LR prediction model (AUC = 0.75, p = 0.03) and the traditional scoring systems. We created an online application based on the XGBoost prediction model.ConclusionsWe have developed a machine learning model that has better predictive performance than traditional LR prediction models as well as other existing risk scoring systems for postoperative ARF. This model can be utilized to provide early warnings when high-risk patients are found, enabling clinicians to take prompt measures.


2020 ◽  
Vol 35 (Supplement_3) ◽  
Author(s):  
Hea Eun Kim ◽  
Hyeonsik Yang ◽  
Sejoong Kim ◽  
Kipyo Kim

Abstract Background and Aims Rapidly Increasing electronic health record (EHR) data and recent development of machine learning methods offers the possibilities of improvement in quality of care in clinical practice. Machine learning can incorporate huge amount of features into the model, and enable non-linear algorithms with great performance. Previously published AKI prediction models have simple design without real-time assessment. Major risk factors in in-hospital AKI include use of various nephrotoxins, repeatedly measured laboratory findings, and vital signs, which are dynamic variables rather than static. Given that recurrent neural network (RNN) is a powerful tool to handle the sequential data, using RNN method in the prediction model is a promising approach. Therefore, in the present study, we proposed a RNN-based prediction model with external validation for in-hospital AKI and aimed to provide a framework to link the developed model with clinical decision supports. Method Study populations were all patients aged ≥ 18 years and hospitalized more than a week at Seoul National University Bundang Hospital (SNUBH) from 2013 to 2017 (training cohort) and at Seoul National University Hospital (SNUH) in 2017 (validation cohort). All demographics, laboratory values, vital signs, and clinical conditions were obtained from the EHR of each hospital. A total of 102 variables included in the model. Each variable falls into two categories: static and dynamic variable; static variable was time-invariant values during hospitalization, and dynamic variables were daily-updated values. Baseline creatinine was determined by searching the minimum serum Cr level within 2 weeks before admission. We developed two different models (model 1 and model 2) using RNN algorithms. The outcome for model 1 was the occurrence of AKI within 7 days from the present. In model 2, we constructed the prediction model of the trajectory of Cr values after 24 hours, 48 hours, and 72 hours, using available Cr values from 7 days ago to the present. Internal validation was performed by 5-fold cross validation using the training set (SNUBH), and then external validation was done using test set (SNUH). Results A total of 40,552 patients in training cohort and 4,000 patients in external validation cohort (test cohort) were included in the study. The mean age of participants was 62.2 years in training cohort and 58.7 years in test cohort. Baseline eGFR was 93.8 ± 40.4 ml/min/1.73m2 in training cohort and 88.4 ± 23.2 ml/min/1.73m2 in test cohort. In model 1 for the prediction of AKI occurrence within 7 days, the area under the curve was 0.93 (sensitivity 0.90, specificity 0.96) in internal validation, and 0.83 (sensitivity 0.83, specificity 0.82) in external validation. The model 2 predicted the creatinine trajectory within 3 days accurately; root mean square error was 0.1 in training cohort and 0.3 in test cohort. To support the clinical decision for AKI manage, we estimated the predicted trajectories of future creatinine levels after renal insult removal, such as nephrotoxic drugs, based on the established model 2. Conclusion We developed and validated a real-time AKI prediction model using RNN algorithms. This model showed high performance and can accurately visualize future creatinine trajectories. In addition, the model can provide the information about modifiable factors in patients with high risk of AKI.


Diagnostics ◽  
2021 ◽  
Vol 11 (12) ◽  
pp. 2242
Author(s):  
Jingyi Wu ◽  
Yu Lin ◽  
Pengfei Li ◽  
Yonghua Hu ◽  
Luxia Zhang ◽  
...  

This study aimed to construct machine learning (ML) models for predicting prolonged length of stay (pLOS) in intensive care units (ICU) among general ICU patients. A multicenter database called eICU (Collaborative Research Database) was used for model derivation and internal validation, and the Medical Information Mart for Intensive Care (MIMIC) III database was used for external validation. We used four different ML methods (random forest, support vector machine, deep learning, and gradient boosting decision tree (GBDT)) to develop prediction models. The prediction performance of the four models were compared with the customized simplified acute physiology score (SAPS) II. The area under the receiver operation characteristic curve (AUROC), area under the precision-recall curve (AUPRC), estimated calibration index (ECI), and Brier score were used to measure performance. In internal validation, the GBDT model achieved the best overall performance (Brier score, 0.164), discrimination (AUROC, 0.742; AUPRC, 0.537), and calibration (ECI, 8.224). In external validation, the GBDT model also achieved the best overall performance (Brier score, 0.166), discrimination (AUROC, 0.747; AUPRC, 0.536), and calibration (ECI, 8.294). External validation showed that the calibration curve of the GBDT model was an optimal fit, and four ML models outperformed the customized SAPS II model. The GBDT-based pLOS-ICU prediction model had the best prediction performance among the five models on both internal and external datasets. Furthermore, it has the potential to assist ICU physicians to identify patients with pLOS-ICU risk and provide appropriate clinical interventions to improve patient outcomes.


2021 ◽  
Vol 12 ◽  
Author(s):  
Wei Guo ◽  
Ze Yu ◽  
Ya Gao ◽  
Xiaoqian Lan ◽  
Yannan Zang ◽  
...  

Risperidone is an efficacious second-generation antipsychotic (SGA) to treat a wide spectrum of psychiatric diseases, whereas its active moiety (risperidone and 9-hydroxyrisperidone) concentration without a therapeutic reference range may increase the risk of adverse drug reactions. We aimed to establish a prediction model of risperidone active moiety concentration in the next therapeutic drug monitoring (TDM) based on the initial TDM information using machine learning methods. A total of 983 patients treated with risperidone between May 2017 and May 2018 in Beijing Anding Hospital were collected as the data set. Sixteen predictors (the initial TDM value, dosage, age, WBC, PLT, BUN, weight, BMI, prolactin, ALT, MECT, Cr, AST, Ccr, TDM interval, and RBC) were screened from 26 variables through univariate analysis (p < 0.05) and XGBoost (importance score >0). Ten algorithms (XGBoost, LightGBM, CatBoost, AdaBoost, Random Forest, support vector machine, lasso regression, ridge regression, linear regression, and k-nearest neighbor) compared the model performance, and ultimately, XGBoost was chosen to establish the prediction model. A cohort of 210 patients treated with risperidone between March 1, 2019, and May 31, 2019, in Beijing Anding Hospital was used to validate the model. Finally, the prediction model was evaluated, obtaining R2 (0.512 in test cohort; 0.374 in validation cohort), MAE (10.97 in test cohort; 12.07 in validation cohort), MSE (198.55 in test cohort; 324.15 in validation cohort), RMSE (14.09 in test cohort; 18.00 in validation cohort), and accuracy of the predicted TDM within ±30% of the actual TDM (54.82% in test cohort; 60.95% in validation cohort). The prediction model has promising performance to facilitate rational risperidone regimen on an individualized level and provide reference for other antipsychotic drugs' risk prediction.


Gut ◽  
2019 ◽  
Vol 69 (3) ◽  
pp. 540-550 ◽  
Author(s):  
Shulin Yu ◽  
Yuchen Li ◽  
Zhuan Liao ◽  
Zheng Wang ◽  
Zhen Wang ◽  
...  

ObjectivePancreatic ductal adenocarcinoma (PDAC) is difficult to diagnose at resectable stage. Recent studies have suggested that extracellular vesicles (EVs) contain long RNAs. The aim of this study was to develop a diagnostic (d-)signature for the detection of PDAC based on EV long RNA (exLR) profiling.DesignWe conducted a case-control study with 501 participants, including 284 patients with PDAC, 100 patients with chronic pancreatitis (CP) and 117 healthy subjects. The exLR profile of plasma samples was analysed by exLR sequencing. The d-signature was identified using a support vector machine algorithm and a training cohort (n=188) and was validated using an internal validation cohort (n=135) and an external validation cohort (n=178).ResultsWe developed a d-signature that comprised eight exLRs, including FGA, KRT19, HIST1H2BK, ITIH2, MARCH2, CLDN1, MAL2 and TIMP1, for PDAC detection. The d-signature showed high accuracy, with an area under the receiver operating characteristic curve (AUC) of 0.960, 0.950 and 0.936 in the training, internal validation and external validation cohort, respectively. The d-signature was able to identify resectable stage I/II cancer with an AUC of 0.949 in the combined three cohorts. In addition, the d-signature showed superior performance to carbohydrate antigen 19-9 in distinguishing PDAC from CP (AUC 0.931 vs 0.873, p=0.028).ConclusionThis study is the first to characterise the plasma exLR profile in PDAC and to report an exLR signature for the detection of pancreatic cancer. This signature may improve the prognosis of patients who would have otherwise missed the curative treatment window.


2021 ◽  
Vol 8 ◽  
Author(s):  
Xiaoyun Cheng ◽  
Jinzhang Li ◽  
Tianming Xu ◽  
Kemin Li ◽  
Jingnan Li

Background: The number of patients diagnosed with rectal neuroendocrine tumors (R-NETs) is increasing year by year. An integrated survival predictive model is required to predict the prognosis of R-NETs. The present study is aimed at exploring epidemiological characteristics of R-NETs based on a retrospective study from the Surveillance, Epidemiology, and End Results (SEER) database and predicting survival of R-NETs with machine learning.Methods: Data of patients with R-NETs were extracted from the SEER database (2000–2017), and data were also retrospectively collected from a single medical center in China. The main outcome measure was the 5-year survival status. Risk factors affecting survival were analyzed by Cox regression analysis, and six common machine learning algorithms were chosen to build the predictive models. Data from the SEER database were divided into a training set and an internal validation set according to the year 2010 as a time point. Data from China were chosen as an external validation set. The best machine learning predictive model was compared with the American Joint Committee on Cancer (AJCC) seventh staging system to evaluate its predictive performance in the internal validation dataset and external validation dataset.Results: A total of 10,580 patients from the SEER database and 68 patients from a single medical center were included in the analysis. Age, gender, race, histologic type, tumor size, tumor number, summary stage, and surgical treatment were risk factors affecting survival status. After the adjustment of parameters and algorithms comparison, the predictive model using the eXtreme Gradient Boosting (XGBoost) algorithm had the best predictive performance in the training set [area under the curve (AUC) = 0.87, 95%CI: 0.86–0.88]. In the internal validation, the predictive ability of XGBoost was better than that of the AJCC seventh staging system (AUC: 0.90 vs. 0.78). In the external validation, the XGBoost predictive model (AUC = 0.89) performed better than the AJCC seventh staging system (AUC = 0.83).Conclusions: The XGBoost algorithm had better predictive power than the AJCC seventh staging system, which had a potential value of the clinical application.


2020 ◽  
Vol 2 (Supplement_3) ◽  
pp. ii14-ii14
Author(s):  
Toru Umehara ◽  
Manabu Kinoshita ◽  
Takahiro Sasaki ◽  
Hideyuki Arita ◽  
Ema Yoshioka ◽  
...  

Abstract Introduction: Clinical application of survival prediction of primary glioblastoma (pGBM) using preoperative images remains challenging due to a lack of robustness and standardization of the method. This research focused on validating a machine learning-based texture analysis model for this purpose using internal and external cohorts. Method: We included all cases of IDH wild-type pGBM available of preoperative MRI (T1WI, T2WI, and Gd-T1WI) from the databases of Kansai Molecular Diagnosis Network for CNS tumors (KN) and The Cancer Genome Atlas (TCGA). Of 242 cases from KN, we assigned 137 cases as a training dataset (D1), and the remaining 105 cases as an internal validation dataset (D2). Furthermore, we extracted 96 cases from TCGA as an external validation dataset (D3). Preoperative MRI scans were semi-quantitatively analyzed, leading to the acquisition of 489 texture features as explanatory variables. Dichotomous overall survival (OS) with a 16.6 months cutoff was regarded as the response variable (short/long OS). We employed Lasso regression for feature selection, and a survival prediction model constructed for D1 via cross-validation (M1) was applied to D2 and D3 to ensure the model robustness. Results: The population of predicted short OS by M1 significantly showed poorer prognosis in D2 (median OS 11.1 vs. 19.4 months; log-rank test, p=0.03), while there was no significant difference in D3 (median OS 14.2 vs. 11.9 months; p=0.61). In the comparative analysis using t-SNE, there was little variation in the feature distribution among three datasets. Conclusion: We were able to validate the prediction model in the internal but not in the external cohort. The presented result supports the use of machine learning-based texture analysis for survival prediction of pGBM in a localized population or country. However, further consideration is required to achieve a universal prediction model for pGBM, irrespective of regional difference.


2021 ◽  
Vol 8 ◽  
Author(s):  
Wenwen Xu ◽  
Wanlong Wu ◽  
Yu Zheng ◽  
Zhiwei Chen ◽  
Xinwei Tao ◽  
...  

Objectives: Anti-melanoma differentiation-associated gene 5-positive dermatomyositis-associated interstitial lung disease (MDA5+ DM-ILD) is a life-threatening disease. The current study aimed to quantitatively assess the pulmonary high-resolution computed tomography (HRCT) images of MDA5+ DM-ILD by applying the radiomics approach and establish a multidimensional risk prediction model for the 6-month mortality.Methods: This retrospective study was conducted in 228 patients from two centers, namely, a derivation cohort and a longitudinal internal validation cohort in Renji Hospital, as well as an external validation cohort in Guangzhou. The derivation cohort was randomly divided into training and testing sets. The primary outcome was 6-month all-cause mortality since the time of admission. Baseline pulmonary HRCT images were quantitatively analyzed by radiomics approach, and a radiomic score (Rad-score) was generated. Clinical predictors selected by univariable Cox regression were further incorporated with the Rad-score, to enhance the prediction performance of the final model (Rad-score plus model). In parallel, an idiopathic pulmonary fibrosis (IPF)-based visual CT score and ILD-GAP score were calculated as comparators.Results: The Rad-score was significantly associated with the 6-month mortality, outperformed the traditional visual score and ILD-GAP score. The Rad-score plus model was successfully developed to predict the 6-month mortality, with C-index values of 0.88 [95% confidence interval (CI), 0.79–0.96] in the training set (n = 121), 0.88 (95%CI, 0.71–1.0) in the testing set (n = 31), 0.83 (95%CI, 0.68–0.98) in the internal validation cohort (n = 44), and 0.84 (95%CI, 0.64–1.0) in the external validation cohort (n = 32).Conclusions: The radiomic feature was an independent and reliable prognostic predictor for MDA5+ DM-ILD.


2021 ◽  
Vol 10 ◽  
Author(s):  
Zhizhen Li ◽  
Lei Yuan ◽  
Chen Zhang ◽  
Jiaxing Sun ◽  
Zeyuan Wang ◽  
...  

Background and ObjectivesCurrently, the prognostic performance of the staging systems proposed by the 8th edition of the American Joint Committee on Cancer (AJCC 8th) and the Liver Cancer Study Group of Japan (LCSGJ) in resectable intrahepatic cholangiocarcinoma (ICC) remains controversial. The aim of this study was to use machine learning techniques to modify existing ICC staging strategies based on clinical data and to demonstrate the accuracy and discrimination capacity in prognostic prediction.Patients and MethodsThis is a retrospective study based on 1,390 patients who underwent surgical resection for ICC at Eastern Hepatobiliary Surgery Hospital from 2007 to 2015. External validation was performed for patients from 2015 to 2017. The ensemble of three machine learning algorithms was used to select the most important prognostic factors and stepwise Cox regression was employed to derive a modified scoring system. The discriminative ability and predictive accuracy were assessed using the Concordance Index (C-index) and Brier Score (BS). The results were externally validated through a cohort of 42 patients operated on from the same institution.ResultsSix independent prognosis factors were selected and incorporated in the modified scoring system, including carcinoembryonic antigen, carbohydrate antigen 19-9, alpha-fetoprotein, prealbumin, T and N of ICC staging category in 8th edition of AJCC. The proposed scoring system showed a more favorable discriminatory ability and model performance than the AJCC 8th and LCSGJ staging systems, with a higher C-index of 0.693 (95% CI, 0.663–0.723) in the internal validation cohort and 0.671 (95% CI, 0.602–0.740) in the external validation cohort, which was then confirmed with lower BS (0.103 in internal validation cohort and 0.169 in external validation cohort). Meanwhile, machine learning techniques for variable selection together with stepwise Cox regression for survival analysis shows a better prognostic accuracy than using stepwise Cox regression method only.ConclusionsThis study put forward a modified ICC scoring system based on prognosis factors selection incorporated with machine learning, for individualized prognosis evaluation in patients with ICC.


Sign in / Sign up

Export Citation Format

Share Document