Predicting the required pre-surgery blood volume in surgical patients based on machine learning

Mapping Intimacies ◽

10.1101/19008045 ◽

2019 ◽

Author(s):

Ruilin Li ◽

Xinyin Han ◽

Liping Sun ◽

Yannan Feng ◽

Xiaolin Sun ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Blood Transfusion ◽

Blood Volume ◽

Prediction Models ◽

Machine Learning Algorithms ◽

Random Forest Model ◽

Surgical Patients ◽

Forest Model ◽

Formidable Challenge

AbstractPrecisely predicting the required pre-surgery blood volume (PBV) in surgical patients is a formidable challenge in China. Inaccurate estimation is associate with excessive costs, postponed surgeries and adverse outcome after surgery due to in sufficient supply or inventory. This study aimed to predict required PBV based on machine learning techniques. 181,027 medical documents over 6 years were cleaned and finally obtained 92,057 blood transfusion records. The blood transfusion and surgery related factors of perioperative patients, surgeons experience volumes and the actual volumes of transfused RBCs were extracted. 6 machine learning algorithms were used to build prediction models. The surgery patients received allogenic RBCs or without transfusion, had total volume less than 10 units, or had the latest laboratory examinations of pre-surgery within 7 days were included, providing 118,823 data points. 39 predictive factors related to the RBCs transfusion were identified. Random forest model was selected to predict the required PBV of RBCs with 72.9% accuracy and strikingly improved the accuracy by 30.4% compared with surgeons experience, where 90% of data was used for training. We tested and demonstrated that both the data-driven models and the random forest model achieved higher accuracy than surgeons experience. Furthermore, we developed a computational tool, PTRBC, to precisely estimate the required PBV in surgical patients and we believe this tool will find more applications in assisting clinician decisions, not only confined to making accurate pre-surgery blood requirement predicting.

Download Full-text

The Random Forest Model Has the Best Accuracy Among the Four Pressure Ulcer Prediction Models Using Machine Learning Algorithms

Risk Management and Healthcare Policy ◽

10.2147/rmhp.s297838 ◽

2021 ◽

Vol Volume 14 ◽

pp. 1175-1187

Author(s):

Jie Song ◽

Yuan Gao ◽

Pengbin Yin ◽

Yi Li ◽

Yang Li ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Pressure Ulcer ◽

Prediction Models ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Random Forest Model ◽

Forest Model

Download Full-text

Accurate prediction of birth implementing a statistical model through the determination of steroid hormones in saliva

Scientific Reports ◽

10.1038/s41598-021-84924-0 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Silvia Alonso ◽

Sara Cáceres ◽

Daniel Vélez ◽

Luis Sanz ◽

Gema Silvan ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Area Under The Curve ◽

Machine Learning Algorithms ◽

Random Forest Model ◽

Forest Model ◽

Spontaneous Labour ◽

Hormonal Mechanism ◽

First Time ◽

Estrone Sulphate

AbstractSteroidal hormone interaction in pregnancy is crucial for adequate fetal evolution and preparation for childbirth and extrauterine life. Estrone sulphate, estriol, progesterone and cortisol play important roles in the initiation of labour mechanism at the start of contractions and cervical effacement. However, their interaction remains uncertain. Although several studies regarding the hormonal mechanism of labour have been reported, the prediction of date of birth remains a challenge. In this study, we present for the first time machine learning algorithms for the prediction of whether spontaneous labour will occur from week 37 onwards. Estrone sulphate, estriol, progesterone and cortisol were analysed in saliva samples collected from 106 pregnant women since week 34 by enzyme-immunoassay (EIA) techniques. We compared a random forest model with a traditional logistic regression over a dataset constructed with the values observed of these measures. We observed that the results, evaluated in terms of accuracy and area under the curve (AUC) metrics, are sensibly better in the random forest model. For this reason, we consider that machine learning methods contribute in an important way to the obstetric practice.

Download Full-text

Machine Learning Approach Using Routine Immediate Postoperative Laboratory Values for Predicting Postoperative Mortality

Journal of Personalized Medicine ◽

10.3390/jpm11121271 ◽

2021 ◽

Vol 11 (12) ◽

pp. 1271

Author(s):

Jaehyeong Cho ◽

Jimyung Park ◽

Eugene Jeong ◽

Jihye Shin ◽

Sangjeong Ahn ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Prediction Models ◽

External Validation ◽

Model Development ◽

Postoperative Mortality ◽

Random Forest Model ◽

Forest Model ◽

Laboratory Values ◽

Increased Risk

Background: Several prediction models have been proposed for preoperative risk stratification for mortality. However, few studies have investigated postoperative risk factors, which have a significant influence on survival after surgery. This study aimed to develop prediction models using routine immediate postoperative laboratory values for predicting postoperative mortality. Methods: Two tertiary hospital databases were used in this research: one for model development and another for external validation of the resulting models. The following algorithms were utilized for model development: LASSO logistic regression, random forest, deep neural network, and XGBoost. We built the models on the lab values from immediate postoperative blood tests and compared them with the SASA scoring system to demonstrate their efficacy. Results: There were 3817 patients who had immediate postoperative blood test values. All models trained on immediate postoperative lab values outperformed the SASA model. Furthermore, the developed random forest model had the best AUROC of 0.82 and AUPRC of 0.13, and the phosphorus level contributed the most to the random forest model. Conclusions: Machine learning models trained on routine immediate postoperative laboratory values outperformed previously published approaches in predicting 30-day postoperative mortality, indicating that they may be beneficial in identifying patients at increased risk of postoperative death.

Download Full-text

Clinical Prediction of Suicide Attempt in Schizophrenia Using a Machine Learning Approach

European Psychiatry ◽

10.1016/j.eurpsy.2017.01.297 ◽

2017 ◽

Vol 41 (S1) ◽

pp. S95-S96 ◽

Cited By ~ 1

Author(s):

V. De Luc ◽

A. Bani Fatemi ◽

N. Hettige

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Random Forest ◽

Suicide Ideation ◽

Suicide Attempts ◽

Machine Learning Algorithms ◽

Random Forest Model ◽

Predictive Tool ◽

Suicide Attempters ◽

Forest Model

ObjectiveSuicide is a major concern for those afflicted by schizophrenia. Identifying patients at the highest risk for future suicide attempts remains a complex problem for psychiatric intervention. Machine learning models allow for the integration of many risk factors in order to build an algorithm that predicts which patients are likely to attempt suicide. Currently, it is unclear how to integrate previously identified risk factors into a clinically relevant predictive tool to estimate the probability of a patient with schizophrenia for attempting suicide.MethodsWe conducted a cross-sectional assessment on a sample of 345 participants diagnosed with schizophrenia spectrum disorders. Suicide attempters and non-attempters were clearly identified using the Columbia Suicide Severity Rating Scale (C-SSRS) and the Beck Suicide Ideation Scale (BSS). We developed two classification algorithms using a regularized regression and random forest model with sociocultural and clinical variables as features to train the models.ResultsBoth classification models performed similarly in identifying suicide attempters and non-attempters. Our regularized logistic regression model demonstrated an accuracy of 66% and an area under the curve (AUC) of 0.71, while the random forest model demonstrated 65% accuracy and an AUC of 0.67.ConclusionMachine learning algorithms offer a relatively successful method for incorporating many clinical features to predict individuals at risk for future suicide attempts. Increased performance of these models using clinically relevant variables offers the potential to facilitate early treatment and intervention to prevent future suicide attempts.Disclosure of interestThe authors have not supplied their declaration of competing interest.

Download Full-text

Spatio-Temporal Variation Characteristics of Aboveground Biomass in the Headwater of the Yellow River Based on Machine Learning

Remote Sensing ◽

10.3390/rs13173404 ◽

2021 ◽

Vol 13 (17) ◽

pp. 3404

Author(s):

Rong Tang ◽

Yuting Zhao ◽

Huilong Lin

Keyword(s):

Machine Learning ◽

Random Forest ◽

Aboveground Biomass ◽

Yellow River ◽

Machine Learning Algorithms ◽

Random Forest Model ◽

Accurate Estimation ◽

The Yellow River ◽

Research Results ◽

Forest Model

Accurate estimation of the aboveground biomass (AGB) of grassland is a key link in understanding the regional carbon cycle. We used 501 aboveground measurements, 29 environmental variables, and machine learning algorithms to construct and verify a custom model of grassland biomass in the Headwater of the Yellow River (HYR) and selected the random forest model to analyze the temporal and spatial distribution characteristics and dynamic trends of the biomass in the HYR from 2001 to 2020. The research results show that: (1) the random forest model is superior to the other three models (R2val = 0.56, RMSEval = 51.3 g/m2); (2) the aboveground biomass in the HYR decreases spatially from southeast to northwest, and the annual average value and total values are 176.8 g/m2 and 20.73 Tg, respectively; (3) 69.51% of the area has shown an increasing trend and 30.14% of the area showed a downward trend, mainly concentrated in the southeast of Hongyuan County, the northeast of Aba County, and the north of Qumalai County. The research results can provide accurate spatial data and scientific basis for the protection of grassland resources in the HYR.

Download Full-text

Development of Prediction Models Using Machine Learning Algorithms for Girls with Suspected Central Precocious Puberty: Retrospective Study (Preprint)

10.2196/preprints.11728 ◽

2018 ◽

Author(s):

Liyan Pan ◽

Guangjian Liu ◽

Xiaojian Mao ◽

Huixian Li ◽

Jiexin Zhang ◽

...

Keyword(s):

Machine Learning ◽

Retrospective Study ◽

Random Forest ◽

Precocious Puberty ◽

Prediction Models ◽

Central Precocious Puberty ◽

Machine Learning Algorithms ◽

Stimulation Test ◽

Gnrh Analogue ◽

Prediction Probability

BACKGROUND Central precocious puberty (CPP) in girls seriously affects their physical and mental development in childhood. The method of diagnosis—gonadotropin-releasing hormone (GnRH)–stimulation test or GnRH analogue (GnRHa)–stimulation test—is expensive and makes patients uncomfortable due to the need for repeated blood sampling. OBJECTIVE We aimed to combine multiple CPP–related features and construct machine learning models to predict response to the GnRHa-stimulation test. METHODS In this retrospective study, we analyzed clinical and laboratory data of 1757 girls who underwent a GnRHa test in order to develop XGBoost and random forest classifiers for prediction of response to the GnRHa test. The local interpretable model-agnostic explanations (LIME) algorithm was used with the black-box classifiers to increase their interpretability. We measured sensitivity, specificity, and area under receiver operating characteristic (AUC) of the models. RESULTS Both the XGBoost and random forest models achieved good performance in distinguishing between positive and negative responses, with the AUC ranging from 0.88 to 0.90, sensitivity ranging from 77.91% to 77.94%, and specificity ranging from 84.32% to 87.66%. Basal serum luteinizing hormone, follicle-stimulating hormone, and insulin-like growth factor-I levels were found to be the three most important factors. In the interpretable models of LIME, the abovementioned variables made high contributions to the prediction probability. CONCLUSIONS The prediction models we developed can help diagnose CPP and may be used as a prescreening tool before the GnRHa-stimulation test.

Download Full-text

Data efficient Random Forest model for avalanche forecasting

10.5194/nhess-2019-379 ◽

2019 ◽

Author(s):

Manesh Chawla ◽

Amreek Singh

Keyword(s):

Machine Learning ◽

Random Forest ◽

Direct Reduction ◽

Random Forest Model ◽

Machine Learning Techniques ◽

Control Structures ◽

Avalanche Forecasting ◽

Forest Model ◽

Avalanche Flow ◽

Data Efficiency

Abstract. Fast downslope release of snow (avalanche) is a serious hazard to people living in snow bound mountains. Released snow mass can gain sufficient momentum on its down slope path to kill humans, uproot trees and rocks, destroy buildings. Direct reduction of avalanche threat is done by building control structures to add mechanical support to snowpack and reduce or deflect downward avalanche flow. On large terrains it is economically infeasible to use these methods on each high risk site.Therefore predicting and avoiding avalanches is the only feasible method to reduce threat but sufficient snow stability data for accurate forecasting is generally unavailable and difficult to collect. Forecasters infer snow stability from their knowledge of local weather, terrain and sparsely available snowpack observations. This inference process is vulnerable to human bias therefore machine learning models are used to find patterns from past data and generate helpful outputs to minimise and quantify uncertainty in forecasting process. These machine learning techniques require long past records of avalanches which are difficult to obtain. In this paper we propose a data efficient Random Forest model to address this problem. The model can generate a descriptive forecast showing reasoning and patterns which are difficult to observe manually. Our model advances the field by being inexpensive and convenient for operational forecasting due to its data efficiency, ease of automation and ability to describe its decisions.

Download Full-text

Machine Learning Models That Integrate Tumor Texture and Perfusion Characteristics Using Low-Dose Breast Computed Tomography Are Promising for Predicting Histological Biomarkers and Treatment Failure in Breast Cancer Patients

Cancers ◽

10.3390/cancers13236013 ◽

2021 ◽

Vol 13 (23) ◽

pp. 6013

Author(s):

Hyun-Soo Park ◽

Kwang-sig Lee ◽

Bo-Kyoung Seo ◽

Eun-Sil Kim ◽

Kyu-Ran Cho ◽

...

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Random Forest ◽

Cancer Patients ◽

Low Dose ◽

Random Forest Model ◽

Breast Cancer Patients ◽

Learning Models ◽

Forest Model ◽

Machine Learning Models

This prospective study enrolled 147 women with invasive breast cancer who underwent low-dose breast CT (80 kVp, 25 mAs, 1.01–1.38 mSv) before treatment. From each tumor, we extracted eight perfusion parameters using the maximum slope algorithm and 36 texture parameters using the filtered histogram technique. Relationships between CT parameters and histological factors were analyzed using five machine learning algorithms. Performance was compared using the area under the receiver-operating characteristic curve (AUC) with the DeLong test. The AUCs of the machine learning models increased when using both features instead of the perfusion or texture features alone. The random forest model that integrated texture and perfusion features was the best model for prediction (AUC = 0.76). In the integrated random forest model, the AUCs for predicting human epidermal growth factor receptor 2 positivity, estrogen receptor positivity, progesterone receptor positivity, ki67 positivity, high tumor grade, and molecular subtype were 0.86, 0.76, 0.69, 0.65, 0.75, and 0.79, respectively. Entropy of pre- and postcontrast images and perfusion, time to peak, and peak enhancement intensity of hot spots are the five most important CT parameters for prediction. In conclusion, machine learning using texture and perfusion characteristics of breast cancer with low-dose CT has potential value for predicting prognostic factors and risk stratification in breast cancer patients.

Download Full-text

Identification of the diagnostic signature of sepsis based on bioinformatic analysis of gene expression and machine learning

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207323666201204130031 ◽

2020 ◽

Vol 23 ◽

Author(s):

Qian Zhao ◽

Ning Xu ◽

Hui Guo ◽

Jianguo Li

Keyword(s):

Gene Expression ◽

Machine Learning ◽

Random Forest ◽

Early Diagnosis ◽

Bioinformatic Analysis ◽

Random Forest Model ◽

Diagnostic Methods ◽

Ppi Network ◽

Hub Genes ◽

Forest Model

Background: Sepsis is a life-threatening disease caused by the dysregulated host response to the infection, and being the major cause of death to patients in intensive care unit (ICU). Objective: Early diagnosis of sepsis could significantly reduce in-hospital mortality. Though generated from infection, the development of sepsis follows its own psychological process and disciplines, alters with gender, health status and other factors. Hence, the analysis of mass data by bioinformatic tools and machine learning is a promising method for exploring early diagnosis manners. Method: We collected miRNA and mRNA expression data of sepsis blood samples from Gene Expression Omnibus (GEO) and ArrayExpress databases, screened out differentially expressed genes (DEGs) by R software, predicted miRNA targets on TargetScanHuman and miRTarBase websites, conducted Gene Ontology (GO) term and KEGG pathway enrichment based on overlapping DEGs. The STRING database and Cytoscape were used to build protein-protein interaction (PPI) network and predict hub genes. Then we constructed a Random Forest model by using the hub genes to assess sample type. Results: Bioinformatic analysis of GEO dataset revealed 46 overlapping DEGs in sepsis. The PPI network analysis identified five hub genes, SOCS3, KBTBD6, FBXL5, FEM1C and WSB1. Random Forest model based on these five hub genes was used to assess GSE95233 and GSE95233 datasets, and the area under curve (AUC) of ROC are 0.900 and 0.7988, respectively, which confirmed the efficacy of this model. Conclusion: The integrated analysis of gene expression in sepsis and the effective Random Forest model built in this study may provide promising diagnostic methods for sepsis.

Download Full-text

Machine Learning Approaches to Radiogenomics of Breast Cancer using Low-Dose Perfusion Computed Tomography: Predicting Prognostic Biomarkers and Molecular Subtypes

Scientific Reports ◽

10.1038/s41598-019-54371-z ◽

2019 ◽

Vol 9 (1) ◽

Cited By ~ 2

Author(s):

Eun Kyung Park ◽

Kwang-sig Lee ◽

Bo Kyoung Seo ◽

Kyu Ran Cho ◽

Ok Hee Woo ◽

...

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Random Forest ◽

Invasive Breast Cancer ◽

Low Dose ◽

Molecular Subtypes ◽

Random Forest Model ◽

Prognostic Biomarkers ◽

Learning Approaches ◽

Forest Model

AbstractRadiogenomics investigates the relationship between imaging phenotypes and genetic expression. Breast cancer is a heterogeneous disease that manifests complex genetic changes and various prognosis and treatment response. We investigate the value of machine learning approaches to radiogenomics using low-dose perfusion computed tomography (CT) to predict prognostic biomarkers and molecular subtypes of invasive breast cancer. This prospective study enrolled a total of 723 cases involving 241 patients with invasive breast cancer. The 18 CT parameters of cancers were analyzed using 5 machine learning models to predict lymph node status, tumor grade, tumor size, hormone receptors, HER2, Ki67, and the molecular subtypes. The random forest model was the best model in terms of accuracy and the area under the receiver-operating characteristic curve (AUC). On average, the random forest model had 13% higher accuracy and 0.17 higher AUC than the logistic regression. The most important CT parameters in the random forest model for prediction were peak enhancement intensity (Hounsfield units), time to peak (seconds), blood volume permeability (mL/100 g), and perfusion of tumor (mL/min per 100 mL). Machine learning approaches to radiogenomics using low-dose perfusion breast CT is a useful noninvasive tool for predicting prognostic biomarkers and molecular subtypes of invasive breast cancer.

Download Full-text