Machine Learning Models of Hemorrhage/Ischemia in Moyamoya Disease and Analysis of Its Risk Factors

Abstract Object: Identify the risk factors for hemorrhage/ischemia in patients with moyamoya disease and establish models using Logistic regression (LR), XGboost and Multilayer Perceptron (MLP), evaluating and comparison the effects of those models; providing theoretical basis for moyamoya disease patients to prevent stroke recurrence. Methods: This retrospective study used data from the database of Jiang Xi Province Medical Big Data Engineering & Technology Research Center; the data of patients with moyamoya disease admitted to the second affiliated hospital of Nanchang university from January 1, 2012 to December 31, 2019 were collected. A total of 994 patients with moyamoya disease were screened, including 496 patients with cerebral infarction and 498 patients with cerebral hemorrhage. LR, XGboost and MLP were used to establish models for hemorrhage /ischemia in moyamoya disease, the effects of different models were verified and compared. Result: LR, XGboost and MLP models all had good discrimination (AUC>0.75), and their AUC value are 0.9227（95%CI:0.9215-0.9239）、0.9677(95%CI:0.9657-0.9696)、 0.9672（95%CI：0.9643-0.9701）. Compared with LR model, the prediction ability of XGboost and MLP model in training and test set is improved, which is increased by 18.11% and 14.34% respectively in training set, and there is a significant difference. Conclusion: Compared with the traditional LR model, the machine learning models are more effective in predicting hemorrhage/ischemia in moyamoya disease.

Download Full-text

Influence of social determinants of health and county vaccination rates on machine learning models to predict COVID-19 case growth in Tennessee

BMJ Health & Care Informatics ◽

10.1136/bmjhci-2021-100439 ◽

2021 ◽

Vol 28 (1) ◽

pp. e100439

Author(s):

Lukasz S Wylezinski ◽

Coleman R Harris ◽

Cody N Heiser ◽

Jamieson D Gray ◽

Charles F Spurlock

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Social Determinants Of Health ◽

Social Determinants ◽

Determinants Of Health ◽

Learning Models ◽

Vaccination Rates ◽

Data Framework ◽

The Impact ◽

Machine Learning Models

IntroductionThe SARS-CoV-2 (COVID-19) pandemic has exposed health disparities throughout the USA, particularly among racial and ethnic minorities. As a result, there is a need for data-driven approaches to pinpoint the unique constellation of clinical and social determinants of health (SDOH) risk factors that give rise to poor patient outcomes following infection in US communities.MethodsWe combined county-level COVID-19 testing data, COVID-19 vaccination rates and SDOH information in Tennessee. Between February and May 2021, we trained machine learning models on a semimonthly basis using these datasets to predict COVID-19 incidence in Tennessee counties. We then analyzed SDOH data features at each time point to rank the impact of each feature on model performance.ResultsOur results indicate that COVID-19 vaccination rates play a crucial role in determining future COVID-19 disease risk. Beginning in mid-March 2021, higher vaccination rates significantly correlated with lower COVID-19 case growth predictions. Further, as the relative importance of COVID-19 vaccination data features grew, demographic SDOH features such as age, race and ethnicity decreased while the impact of socioeconomic and environmental factors, including access to healthcare and transportation, increased.ConclusionIncorporating a data framework to track the evolving patterns of community-level SDOH risk factors could provide policy-makers with additional data resources to improve health equity and resilience to future public health emergencies.

Download Full-text

Machine Learning Models with Preoperative Risk Factors and Intraoperative Hypotension Parameters Predict Mortality After Cardiac Surgery

Journal of Cardiothoracic and Vascular Anesthesia ◽

10.1053/j.jvca.2020.07.029 ◽

2020 ◽

Author(s):

Marta Priscila Bento Fernandes ◽

Miguel Armengol de la Hoz ◽

Valluvan Rangasamy ◽

Balachundhar Subramaniam

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Cardiac Surgery ◽

Learning Models ◽

Preoperative Risk ◽

Preoperative Risk Factors ◽

Intraoperative Hypotension ◽

Machine Learning Models

Download Full-text

Machine Learning Algorithm Using Electronic Chart-Derived Data to Predict Delirium After Elderly Hip Fracture Surgeries: A Retrospective Case-Control Study

Frontiers in Surgery ◽

10.3389/fsurg.2021.634629 ◽

2021 ◽

Vol 8 ◽

Author(s):

Hong Zhao ◽

Jiaming You ◽

Yuexing Peng ◽

Yi Feng

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Hip Fracture ◽

Elderly Patients ◽

Postoperative Delirium ◽

Frailty Index ◽

Fracture Repair ◽

Learning Models ◽

Electronic Chart ◽

Machine Learning Models

Background: Elderly patients undergoing hip fracture repair surgery are at increased risk of delirium due to aging, comorbidities, and frailty. But current methods for identifying the high risk of delirium among hospitalized patients have moderate accuracy and require extra questionnaires. Artificial intelligence makes it possible to establish machine learning models that predict incident delirium risk based on electronic health data.Methods: We conducted a retrospective case-control study on elderly patients (≥65 years of age) who received orthopedic repair with hip fracture under spinal or general anesthesia between June 1, 2018, and May 31, 2019. Anesthesia records and medical charts were reviewed to collect demographic, surgical, anesthetic features, and frailty index to explore potential risk factors for postoperative delirium. Delirium was assessed by trained nurses using the Confusion Assessment Method (CAM) every 12 h during the hospital stay. Four machine learning risk models were constructed to predict the incidence of postoperative delirium: random forest, eXtreme Gradient Boosting (XGBoosting), support vector machine (SVM), and multilayer perception (MLP). K-fold cross-validation was deployed to accomplish internal validation and performance evaluation.Results: About 245 patients were included and postoperative delirium affected 12.2% (30/245) of the patients. Multiple logistic regression revealed that dementia/history of stroke [OR 3.063, 95% CI (1.231, 7.624)], blood transfusion [OR 2.631, 95% CI (1.055, 6.559)], and preparation time [OR 1.476, 95% CI (1.170, 1.862)] were associated with postoperative delirium, achieving an area under receiver operating curve (AUC) of 0.779, 95% CI (0.703, 0.856).The accuracy of machine learning models for predicting the occurrence of postoperative delirium ranged from 83.67 to 87.75%. Machine learning methods detected 16 risk factors contributing to the development of delirium. Preparation time, frailty index uses of vasopressors during the surgery, dementia/history of stroke, duration of surgery, and anesthesia were the six most important risk factors of delirium.Conclusion: Electronic chart-derived machine learning models could generate hospital-specific delirium prediction models and calculate the contribution of risk factors to the occurrence of delirium. Further research is needed to evaluate the significance and applicability of electronic chart-derived machine learning models for the detection risk of delirium in elderly patients undergoing hip fracture repair surgeries.

Download Full-text

Influence of social determinants of health and county vaccination rates on machine learning models to predict COVID-19 case growth in Tennessee

10.1101/2021.07.28.21260814 ◽

2021 ◽

Author(s):

Lukasz S Wylezinski ◽

Coleman R Harris ◽

Cody N Heiser ◽

Jamieson D Gray ◽

Charles F Spurlock

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Social Determinants Of Health ◽

Social Determinants ◽

The United States ◽

Determinants Of Health ◽

Learning Models ◽

Vaccination Rates ◽

The Impact ◽

Machine Learning Models

The SARS-CoV-2 (COVID-19) pandemic has exposed health disparities throughout the United States, particularly among racial and ethnic minorities. As a result, there is a need for data-driven approaches to pinpoint the unique constellation of clinical and social determinants of health (SDOH) risk factors that give rise to poor patient outcomes following infection in US communities. We combined county-level COVID-19 testing data, COVID-19 vaccination rates, and SDOH information in Tennessee. Between February-May 2021, we trained machine learning models on a semi-monthly basis using these datasets to predict COVID-19 incidence in Tennessee counties. We then analyzed SDOH data features at each time point to rank the impact of each feature on model performance. Our results indicate that COVID-19 vaccination rates play a crucial role in determining future COVID-19 disease risk. Beginning in mid-March 2021, higher vaccination rates significantly correlated with lower COVID-19 case growth predictions. Further, as the relative importance of COVID-19 vaccination data features grew, demographic SDOH features such as age, race, and ethnicity decreased while the impact of socioeconomic and environmental factors, including access to healthcare and transportation, increased. Incorporating a data framework to track the evolving patterns of community-level SDOH risk factors could provide policymakers with additional data resources to improve health equity and resilience to future public health emergencies.

Download Full-text

Analysis of main risk factors causing stroke in Shanxi Province based on machine learning models

Informatics in Medicine Unlocked ◽

10.1016/j.imu.2021.100712 ◽

2021 ◽

pp. 100712

Author(s):

Junjie Liu ◽

Yiyang Sun ◽

Jing Ma ◽

Jiachen Tu ◽

Yuhui Deng ◽

...

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Shanxi Province ◽

Learning Models ◽

Machine Learning Models

Download Full-text

Machine-Learning Based Early Warning System for Prediction for Disseminated Intravascular Coagulation after Allogeneic Hematopoietic Stem Cell Transplantation: A Nationwide Multicenter Study

Blood ◽

10.1182/blood-2021-150437 ◽

2021 ◽

Vol 138 (Supplement 1) ◽

pp. 2113-2113

Author(s):

Zhuo-Yu An ◽

Ye-Jun Wu ◽

Yun He ◽

Xiao-Lu Zhu ◽

Yan Su ◽

...

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Stem Cell ◽

Validation Cohort ◽

Lasso Regression ◽

Learning Models ◽

Good Discrimination ◽

Peking University ◽

Machine Learning Models ◽

Good Calibration

Abstract Introduction Allogeneic haematopoietic stem cell transplantation (allo-HSCT) has been demonstrated to be the most effective therapy for various malignant as well as nonmalignant haematological diseases. The wide use of allo-HSCT has inevitably led to a variety of complications after transplantation, with bleeding complications such as disseminated intravascular coagulation (DIC). DIC accounts for a significant proportion of life-threatening bleeding cases occurring after allo-HSCT. However, information on markers for early identification remains limited, and no predictive tools for DIC after allo-HSCT are available. This research aimed to identify the risk factors for DIC after allo-HSCT and establish prediction models to predict the occurrence of DIC after allo-HSCT. Methods The definition of DIC was based on the International Society of Thrombosis and Hemostasis (ISTH) scoring system. Overall, 197 patients with DIC after allo-HSCT at Peking University People's Hospital and other 7 centers in China from January 2010 to June 2021 were retrospectively identified. Each patient was randomly matched to 3 controls based on the time of allo-HSCT (±3 months) and length of follow-up (±6 months). A lasso regression model was used for data dimension reduction, feature selection, and risk factor building. Multivariable logistic regression analysis was used to develop the prediction model. We incorporated the clinical risk factors, and this was presented with a nomogram. The performance of the nomogram was assessed with respect to its calibration, discrimination, and clinical usefulness. Internal and external validation was assessed. Various machine learning models were further used to perform machine learning modeling by attempting to complete the data sample classification task, including XGBClassifier, LogisticRegression, MLPClassifier, RandomForestClassifier, and AdaBoostClassifier. Results A total of 7280 patients received allo-HSCT from January 2010 to June 2021, and DIC occurred in 197 of these patients (incidence of 2.7%). The derivation cohort included 120 DIC patients received allo-HSCT and 360 patients received allo-HSCT from Peking University People's Hospital, and the validation cohort included the remaining 77 patients received allo-HSCT and 231 patients received allo-HSCT from the other 7 centers. The median time for DIC events was 99.0 (IQR, 46.8-220) days after allo-HSCT. The overall survival of patients with DIC was significantly reduced (P ＜ 0.0001). By Lasso regression, the 10 variables with the highest importance were found to be prothrombin time activity (PTA), shock, C-reactive protein, internationalization normalized ratio, bacterial infection, oxygenation, fibrinogen, blood creatinine, white blood cell count, and acute respiratory distress syndrome (from highest to lowest). In the multivariate analysis, the independent risk factors for DIC included PTA, bacterial infection and shock (P <0.001), and these predictors were included in the clinical prediction nomogram. The model showed good discrimination, with a C-index of 0.975 (95%CI, 0.939 to 0.987 through internal validation) and good calibration. Application of the nomogram in the validation cohort still gave good discrimination (C-index, 0.778 [95% CI, 0.759 to 0.766]) and good calibration. Decision curve analysis demonstrated that the nomogram was clinically useful. The predictive value ROC curves of different machine learning models show that XGBClassifier is the best performing model for this dataset, with an area under the curve of 0.86. Conclusions Risk factors for DIC after allo-HSCT were identified, and a nomogram model and various machine learning models were established to predict the occurrence of DIC after allo-HSCT. Combined, these can help recognize high-risk patients and provide timely treatment. In the future, we will further refine the prognostic model utilizing nationwide multicenter data and conduct prospective clinical trials to reduce the incidence of DIC after allo-HSCT and improve the prognosis. Disclosures No relevant conflicts of interest to declare.

Download Full-text

A Modified Drake Equation for Assessing Adversarial Risk to Machine Learning Models

10.5121/csit.2021.111001 ◽

2021 ◽

Author(s):

Josh Kalin ◽

David Noever ◽

Matthew Ciolino ◽

Gerry Dozier

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Potential Risk ◽

Learning Models ◽

Contributing Factors ◽

Empirical Measures ◽

Research Fields ◽

Potential Risk Factors ◽

Adversarial Attack ◽

Machine Learning Models

Machine learning models present a risk of adversarial attack when deployed in production. Quantifying the contributing factors and uncertainties using empirical measures could assist the industry with assessing the risk of downloading and deploying common model types. This work proposes modifying the traditional Drake Equation’s formalism to estimate the number of potentially successful adversarial attacks on a deployed model. The Drake Equation is famously used for parameterizing uncertainties and it has been used in many research fields outside of its original intentions to estimate the number of radio-capable extra-terrestrial civilizations. While previous work has outlined methods for discovering vulnerabilities in public model architectures, the proposed equation seeks to provide a semi-quantitative benchmark for evaluating and estimating the potential risk factors for adversarial attacks.

Download Full-text

Comparing Decision Tree-Based Ensemble Machine Learning Models for COVID-19 Death Probability Profiling

10.1101/2020.12.06.20244756 ◽

2020 ◽

Author(s):

Carlos Pedro Gonçalves ◽

José Rouco

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Decision Tree ◽

Age Group ◽

Learning Models ◽

Ensemble Machine Learning ◽

Death Probability ◽

Group Sex ◽

The Impact ◽

Machine Learning Models

AbstractWe compare the performance of major decision tree-based ensemble machine learning models on the task of COVID-19 death probability prediction, conditional on three risk factors: age group, sex and underlying comorbidity or disease, using the US Centers for Disease Control and Prevention (CDC)’s COVID-19 case surveillance dataset. To evaluate the impact of the three risk factors on COVID-19 death probability, we extract and analyze the conditional probability profile produced by the best performer. The results show the presence of an exponential rise in death probability from COVID-19 with the age group, with males exhibiting a higher exponential growth rate than females, an effect that is stronger when an underlying comorbidity or disease is present, which also acts as an accelerator of COVID-19 death probability rise for both male and female subjects. The results are discussed in connection to healthcare and epidemiological concerns and in the degree to which they reinforce findings coming from other studies on COVID-19.

Download Full-text

Predicting mortality in SARS-COV-2 (COVID-19) positive patients in the inpatient setting using a Novel Deep Neural Network

10.1101/2020.12.13.20247254 ◽

2020 ◽

Author(s):

Maleeha Naseem ◽

Hajra Arshad ◽

Syeda Amrah Hashimi ◽

Furqan Irfan ◽

Fahad Shabbir Ahmed

Keyword(s):

Neural Network ◽

Machine Learning ◽

Risk Factors ◽

Deep Neural Network ◽

Multivariate Analyses ◽

High Accuracy ◽

Inpatient Setting ◽

Learning Models ◽

Rt Pcr ◽

Machine Learning Models

ABSTRACTBackgroundThe second wave of COVID-19 pandemic is anticipated to be worse than the initial one and will strain the healthcare systems even more during the winter months. Our aim was to develop a machine learning-based model to predict mortality using the deep learning Neo-V framework. We hypothesized this novel machine learning approach could be applied to COVID-19 patients to predict mortality successfully with high accuracy.MethodsThe current Deep-Neo-V model is built on our previously statistically rigorous machine learning framework [Fahad-Liaqat-Ahmad Intensive Machine (FLAIM) framework] that evaluated statistically significant risk factors, generated new combined variables and then supply these risk factors to deep neural network to predict mortality in RT-PCR positive COVID-19 patients in the inpatient setting. We analyzed adult patients (≥18 years) admitted to the Aga Khan University Hospital, Pakistan with a working diagnosis of COVID-19 infection (n=1228). We excluded patients that were negative on COVID-19 on RT-PCR, had incomplete or missing health records. The first phase selection of risk factor was done using Cox-regression univariate and multivariate analyses. In the second phase, we generated new variables and tested those statistically significant for mortality and in the third and final phase we applied deep neural networks and other traditional machine learning models like Decision Tree Model, k-nearest neighbor models and others.ResultsA total of 1228 cases were diagnosed as COVID-19 infection, we excluded 14 patients after the exclusion criteria and (n=)1214 patients were analyzed. We observed that several clinical and laboratory-based variables were statistically significant for both univariate and multivariate analyses while others were not. With most significant being septic shock (hazard ratio [HR], 4.30; 95% confidence interval [CI], 2.91-6.37), supportive treatment (HR, 3.51; 95% CI, 2.01-6.14), abnormal international normalized ratio (INR) (HR, 3.24; 95% CI, 2.28-4.63), admission to the intensive care unit (ICU) (HR, 3.24; 95% CI, 2.22-4.74), treatment with invasive ventilation (HR, 3.21; 95% CI, 2.15-4.79) and laboratory lymphocytic derangement (HR, 2.79; 95% CI, 1.6-4.86). Machine learning results showed our DNN (Neo-V) model outperformed all conventional machine learning models with test set accuracy of 99.53%, sensitivity of 89.87%, and specificity of 95.63%; positive predictive value, 50.00%; negative predictive value, 91.05%; and area under the curve of the receiver-operator curve of 88.5.ConclusionOur novel Deep-Neo-V model outperformed all other machine learning models. The model is easy to implement, user friendly and with high accuracy.

Download Full-text