Benchmarking Cesarean Delivery Rates using Machine Learning‐Derived Optimal Classification Trees

Author(s):  
Alexis C. Gimovsky ◽  
Daisy Zhuo ◽  
Jordan Levine ◽  
Jack Dunn ◽  
Maxime Amarm ◽  
...  
Author(s):  
Dimitris Bertsimas ◽  
Daisy Zhuo ◽  
Jordan Levine ◽  
Jack Dunn ◽  
Zdzislaw Tobota ◽  
...  

Background: We have previously shown that the machine learning methodology of optimal classification trees (OCTs) can accurately predict risk after congenital heart surgery (CHS). We have now applied this methodology to define benchmarking standards after CHS, permitting case-adjusted hospital-specific performance evaluation. Methods: The European Congenital Heart Surgeons Association Congenital Database data subset (31 792 patients) who had undergone any of the 10 “benchmark procedure group” primary procedures were analyzed. OCT models were built predicting hospital mortality (HM), and prolonged postoperative mechanical ventilatory support time (MVST) or length of hospital stay (LOS), thereby establishing case-adjusted benchmarking standards reflecting the overall performance of all participating hospitals, designated as the “virtual hospital.” These models were then used to predict individual hospitals’ expected outcomes (both aggregate and, importantly, for risk-matched patient cohorts) for their own specific cases and case-mix, based on OCT analysis of aggregate data from the “virtual hospital.” Results: The raw average rates were HM = 4.4%, MVST = 15.3%, and LOS = 15.5%. Of 64 participating centers, in comparison with each hospital's specific case-adjusted benchmark, 17.0% statistically (under 90% confidence intervals) overperformed and 26.4% underperformed with respect to the predicted outcomes for their own specific cases and case-mix. For MVST and LOS, overperformers were 34.0% and 26.4%, and underperformers were 28.3% and 43.4%, respectively. OCT analyses reveal hospital-specific patient cohorts of either overperformance or underperformance. Conclusions: OCT benchmarking analysis can assess hospital-specific case-adjusted performance after CHS, both overall and patient cohort-specific, serving as a tool for hospital self-assessment and quality improvement.


Author(s):  
Dimitris Bertsimas ◽  
Daisy Zhuo ◽  
Jack Dunn ◽  
Jordan Levine ◽  
Eugenio Zuccarelli ◽  
...  

Objective: Risk assessment tools typically used in congenital heart surgery (CHS) assume that various possible risk factors interact in a linear and additive fashion, an assumption that may not reflect reality. Using artificial intelligence techniques, we sought to develop nonlinear models for predicting outcomes in CHS. Methods: We built machine learning (ML) models to predict mortality, postoperative mechanical ventilatory support time (MVST), and hospital length of stay (LOS) for patients who underwent CHS, based on data of more than 235,000 patients and 295,000 operations provided by the European Congenital Heart Surgeons Association Congenital Database. We used optimal classification trees (OCTs) methodology for its interpretability and accuracy, and compared to logistic regression and state-of-the-art ML methods (Random Forests, Gradient Boosting), reporting their area under the curve (AUC or c-statistic) for both training and testing data sets. Results: Optimal classification trees achieve outstanding performance across all three models (mortality AUC = 0.86, prolonged MVST AUC = 0.85, prolonged LOS AUC = 0.82), while being intuitively interpretable. The most significant predictors of mortality are procedure, age, and weight, followed by days since previous admission and any general preoperative patient risk factors. Conclusions: The nonlinear ML-based models of OCTs are intuitively interpretable and provide superior predictive power. The associated risk calculator allows easy, accurate, and understandable estimation of individual patient risks, in the theoretical framework of the average performance of all centers represented in the database. This methodology has the potential to facilitate decision-making and resource optimization in CHS, enabling total quality management and precise benchmarking initiatives.


2017 ◽  
Vol 106 (7) ◽  
pp. 1039-1082 ◽  
Author(s):  
Dimitris Bertsimas ◽  
Jack Dunn

2020 ◽  
Vol 15 ◽  
pp. 08-15
Author(s):  
Nisana Siddegowda Prema ◽  
Mullur Puttabuddi Pushpalatha

The study aims to analyze the association between gestational diabetes mellitus (GDM) and other risk factors of cesarean delivery using machine learning (ML). The dataset used for the analysis is from the pregnancy risk assessment survey (PRAMS), considered in two scenarios, i.e., all the data is taken, and all the data of the women who developed GDM. Further, the data is developed in two groups Data-I and Data-II by considering multiparous and primiparous women details, respectively. The correlation analysis and major classification algorithms are applied to the data. It is founded that the top risk factors for the first time cesarean delivery are the age, height, weight, race of the women, presence of hypertension and gestational diabetes mellitus. The major risk factor for repeated cesarean delivery is the previous cesarean delivery. The presence of GDM is also one of the risk factors for cesarean delivery.


COMPSTAT ◽  
2000 ◽  
pp. 427-432
Author(s):  
Petr Savický ◽  
Jan Klaschka ◽  
Jaromír Antoch

2019 ◽  
Vol 15 (S341) ◽  
pp. 109-113
Author(s):  
Agnieszka Pollo ◽  
Aleksandra Solarz ◽  
Małgorzata Siudek ◽  
Katarzyna Małek ◽  
Maciej Bilicki ◽  
...  

AbstractIn this paper we address two questions related to data analysis in large astronomical datasets, and we demonstrate how they can be answered making use of machine learning techniques. The first question is: how to efficiently find previously unknown or rare objects which can be expected to exist in big data samples? Using the largest existing extragalactic all-sky survey, provided by the WISE satellite, we demonstrate that, surprisingly, supervised classification methods can come to aid. The second question is: having a sufficiently large data sample, how can we look for new optimal classification schemes, possibly finding new and previously unknown classes and subclasses of sources? Based on the VIPERS cutting-edge galaxy catalog at redshift z > 0.5, we demonstrate that unsupervised classification methods can give unexpected but physically well-motivated results.


2014 ◽  
pp. 115-123
Author(s):  
Rachid Beghdad

The purpose of this study is to identify some higher-level KDD features, and to train the resulting set with an appropriate machine learning technique, in order to classify and predict attacks. To achieve that, a two-steps approach is proposed. Firstly, the Fisher’s ANOVA technique was used to deduce the important features. Secondly, 4 types of classification trees: ID3, C4.5, classification and regression tree (CART), and random tree (RnDT), were tested to classify and detect attacks. According to our tests, the RndT leads to the better results. That is why we will present here the classification and prediction results of this technique in details. Some of the remaining results will be used later to make comparisons. We used the KDD’99 data sets to evaluate the considered algorithms. For these evaluations, only the four attack categories’ case was considered. Our simulations show the efficiency of our approach, and show also that it is very competitive with some similar previous works.


Sign in / Sign up

Export Citation Format

Share Document