scholarly journals Early Prediction of High Risk Gestational Diabetes Mellitus via Machine Learning Models

2020 ◽  
Author(s):  
Yan-Ting Wu ◽  
Chen-Jie Zhang ◽  
Ben Willem Mol ◽  
Cheng Li ◽  
Lei Chen ◽  
...  
2019 ◽  
Vol 181 (5) ◽  
pp. 565-577 ◽  
Author(s):  
Liron Yoffe ◽  
Avital Polsky ◽  
Avital Gilam ◽  
Chen Raff ◽  
Federico Mecacci ◽  
...  

Design Gestational diabetes mellitus (GDM) is one of the most common pregnancy complications and its prevalence is constantly rising worldwide. Diagnosis is commonly in the late second or early third trimester of pregnancy, though the development of GDM starts early; hence, first-trimester diagnosis is feasible. Objective Our objective was to identify microRNAs that best distinguish GDM samples from those of healthy pregnant women and to evaluate the predictive value of microRNAs for GDM detection in the first trimester. Methods We investigated the abundance of circulating microRNAs in the plasma of pregnant women in their first trimester. Two populations were included in the study to enable population-specific as well as cross-population inspection of expression profiles. Each microRNA was tested for differential expression in GDM vs control samples, and their efficiency for GDM detection was evaluated using machine-learning models. Results Two upregulated microRNAs (miR-223 and miR-23a) were identified in GDM vs the control set, and validated on a new cohort of women. Using both microRNAs in a logistic-regression model, we achieved an AUC value of 0.91. We further demonstrated the overall predictive value of microRNAs using several types of multivariable machine-learning models that included the entire set of expressed microRNAs. All models achieved high accuracy when applied on the dataset (mean AUC = 0.77). The significance of the classification results was established via permutation tests. Conclusions Our findings suggest that circulating microRNAs are potential biomarkers for GDM in the first trimester. This warrants further examination and lays the foundation for producing a novel early non-invasive diagnostic tool for GDM.


2020 ◽  
Author(s):  
Yan-Ting Wu ◽  
Chen-Jie Zhang ◽  
Ben Willem Mol ◽  
Cheng Li ◽  
Lei Chen ◽  
...  

AbstractAimsGestational diabetes mellitus (GDM) is a pregnancy-specific disorder that can usually be diagnosed after 24 gestational weeks. So far, there is no accurate method to predict GDM in early pregnancy.MethodsWe collected data extracted from the hospital’s electronic medical record system included 73 features in the first trimester. We also recorded the occurrence of GDM, diagnosed at 24-28 weeks of pregnancy. We conducted a feature selection method to select a panel of most discriminative features. We then developed advanced machine learning models, using Deep Neural Network (DNN), Support Vector Machine (SVM), K-Nearest Neighboring (KNN), and Logistic Regression (LR), based on these features.ResultsWe studied 16,819 women (2,696 GDM) and 14,992 women (1,837 GDM) for the training and validation group. DNN, SVM, KNN, and LR models based on the 73-feature set demonstrated the best discriminative power with corresponding area under the curve (AUC) values of 0.92 (95%CI 0.91, 0.93), 0.82 (95%CI 0.81, 0.83), 0.63 (95%CI 0.62, 0.64), and 0.85 (95%CI 0.84, 0.85), respectively. The 7-feature (selected from the 73-feature set) DNN, SVM, KNN, and LR models had the best discriminative power with corresponding AUCs of 0.84 (95%CI 0.83, 0.84), 0.69 (95%CI 0.68, 0.70), 0.68 (95%CI 0.67, 0.69), and 0.84 (95% CI 0.83, 0.85), respectively. The 7-feature LR model had the best Hosmer-Lemeshow test outcome. Notably, the AUCs of the existing prediction models did not exceed 0.75.ConclusionsOur feature selection and machine learning models showed superior predictive power in early GDM detection than previous methods; these improved models will better serve clinical practices in preventing GDM.Research in Context sectionEvidence before this studyA hysteretic diagnosis of GDM in the 3rd trimester is too late to prevent exposure of the embryos or fetuses to an intrauterine hyperglycemia environment during early pregnancy.Prediction models for gestational diabetes are not uncommon in previous literature reports, but laboratory indicators are rarely involved in predictive indicators.The penetration of AI into the medical field makes us want to introduce it into GDM predictive models.What is the key question?Whether the GDM prediction model established by machine learning has the ability to surpass the traditional LR model?Added value of this studyUsing machine learning to select features is an effective method.DNN prediction model have effective discrimination power for predicting GDM in early pregnancy, but it cannot completely replace LR. KNN and SVM are even worse than LR in this study.Implications of all the available evidenceThe biggest significance of our research is not only to build a prediction model that surpasses previous ones, but also to demonstrate the advantages and disadvantages of different machine learning methods through a practical case.


2021 ◽  
Vol 39 (28_suppl) ◽  
pp. 330-330
Author(s):  
Teja Ganta ◽  
Stephanie Lehrman ◽  
Rachel Pappalardo ◽  
Madalene Crow ◽  
Meagan Will ◽  
...  

330 Background: Machine learning models are well-positioned to transform cancer care delivery by providing oncologists with more accurate or accessible information to augment clinical decisions. Many machine learning projects, however, focus on model accuracy without considering the impact of using the model in real-world settings and rarely carry forward to clinical implementation. We present a human-centered systems engineering approach to address clinical problems with workflow interventions utilizing machine learning algorithms. Methods: We aimed to develop a mortality predictive tool, using a Random Forest algorithm, to identify oncology patients at high risk of death within 30 days to move advance care planning (ACP) discussions earlier in the illness trajectory. First, a project sponsor defined the clinical need and requirements of an intervention. The data scientists developed the predictive algorithm using data available in the electronic health record (EHR). A multidisciplinary workgroup was assembled including oncology physicians, advanced practice providers, nurses, social workers, chaplain, clinical informaticists, and data scientists. Meeting bi-monthly, the group utilized human-centered design (HCD) methods to understand clinical workflows and identify points of intervention. The workgroup completed a workflow redesign workshop, a 90-minute facilitated group discussion, to integrate the model in a future state workflow. An EHR (Epic) analyst built the user interface to support the intervention per the group’s requirements. The workflow was piloted in thoracic oncology and bone marrow transplant with plans to scale to other cancer clinics. Results: Our predictive model performance on test data was acceptable (sensitivity 75%, specificity 75%, F-1 score 0.71, AUC 0.82). The workgroup identified a “quality of life coordinator” who: reviews an EHR report of patients scheduled in the upcoming 7 days who have a high risk of 30-day mortality; works with the oncology team to determine ACP clinical appropriateness; documents the need for ACP; identifies potential referrals to supportive oncology, social work, or chaplain; and coordinates the oncology appointment. The oncologist receives a reminder on the day of the patient’s scheduled visit. Conclusions: This workgroup is a viable approach that can be replicated at institutions to address clinical needs and realize the full potential of machine learning models in healthcare. The next steps for this project are to address end-user feedback from the pilot, expand the intervention to other cancer disease groups, and track clinical metrics.


2021 ◽  
Vol 12 (02) ◽  
pp. 372-382
Author(s):  
Christine Xia Wu ◽  
Ernest Suresh ◽  
Francis Wei Loong Phng ◽  
Kai Pik Tai ◽  
Janthorn Pakdeethai ◽  
...  

Abstract Objective To develop a risk score for the real-time prediction of readmissions for patients using patient specific information captured in electronic medical records (EMR) in Singapore to enable the prospective identification of high-risk patients for enrolment in timely interventions. Methods Machine-learning models were built to estimate the probability of a patient being readmitted within 30 days of discharge. EMR of 25,472 patients discharged from the medicine department at Ng Teng Fong General Hospital between January 2016 and December 2016 were extracted retrospectively for training and internal validation of the models. We developed and implemented a real-time 30-day readmission risk score generation in the EMR system, which enabled the flagging of high-risk patients to care providers in the hospital. Based on the daily high-risk patient list, the various interfaces and flow sheets in the EMR were configured according to the information needs of the various stakeholders such as the inpatient medical, nursing, case management, emergency department, and postdischarge care teams. Results Overall, the machine-learning models achieved good performance with area under the receiver operating characteristic ranging from 0.77 to 0.81. The models were used to proactively identify and attend to patients who are at risk of readmission before an actual readmission occurs. This approach successfully reduced the 30-day readmission rate for patients admitted to the medicine department from 11.7% in 2017 to 10.1% in 2019 (p < 0.01) after risk adjustment. Conclusion Machine-learning models can be deployed in the EMR system to provide real-time forecasts for a more comprehensive outlook in the aspects of decision-making and care provision.


2018 ◽  
Vol 19 (11) ◽  
pp. 3696 ◽  
Author(s):  
Anna Pleskacova ◽  
Vendula Bartakova ◽  
Katarina Chalasova ◽  
Lukas Pacal ◽  
Katerina Kankova ◽  
...  

Uric acid (UA) levels are associated with many diseases including those related to lifestyle. The aim of this study was to evaluate the influence of clinical and anthropometric parameters on UA and xanthine (X) levels during pregnancy and postpartum in women with physiological pregnancy and pregnancy complicated by gestational diabetes mellitus (GDM), and to evaluate their impact on adverse perinatal outcomes. A total of 143 participants were included. Analyte levels were determined by HPLC with ultraviolet detection (HPLC-UV). Several single-nucleotide polymorphisms (SNPs) in UA transporters were genotyped using commercial assays. UA levels were higher within GDM women with pre-gestational obesity, those in high-risk groups, and those who required insulin during pregnancy. X levels were higher in the GDM group during pregnancy and also postpartum. Positive correlations between UA and X levels with body mass index (BMI) and glycemia levels were found. Gestational age at delivery was negatively correlated with UA and X levels postpartum. Postpartum X levels were significantly higher in women who underwent caesarean sections. Our data support a possible link between increased UA levels and a high-risk GDM subtype. UA levels were higher among women whose glucose tolerance was severely disturbed. Mid-gestational UA and X levels were not linked to adverse perinatal outcomes.


Sign in / Sign up

Export Citation Format

Share Document