scholarly journals Automated Electronic Health Record–Based Tool for Identification of Patients With Metastatic Disease to Facilitate Clinical Trial Patient Ascertainment

2021 ◽  
pp. 719-727
Author(s):  
Jeffrey Kirshner ◽  
Kelly Cohn ◽  
Steven Dunder ◽  
Karri Donahue ◽  
Madeline Richey ◽  
...  

PURPOSE To facilitate identification of clinical trial participation candidates, we developed a machine learning tool that automates the determination of a patient's metastatic status, on the basis of unstructured electronic health record (EHR) data. METHODS This tool scans EHR documents, extracting text snippet features surrounding key words (such as metastatic, progression, and local). A regularized logistic regression model was trained and used to classify patients across five metastatic categories: highly likely and likely positive, highly likely and likely negative, and unknown. Using a real-world oncology database of patients with solid tumors with manually abstracted information as reference, we calculated sensitivity, specificity, negative predictive value (NPV), and positive predictive value (PPV). We validated the performance in a real-world data set, evaluating accuracy gains upon additional user review of tool's outputs after integration into clinic workflows. RESULTS In the training data set (N = 66,532), the model sensitivity and specificity (% [95% CI]) were 82.4 [81.9 to 83.0] and 95.5 [95.3 to 96.7], respectively; the PPV was 89.3 [88.8 to 90.0], and the NPV was 94.0 [93.8 to 94.2]. In the validation sample (n = 200 from five distinct care sites), after user review of model outputs, values increased to 97.1 [85.1 to 99.9] for sensitivity, 98.2 [94.8 to 99.6] for specificity, 91.9 [78.1 to 98.3] for PPV, and 99.4 [96.6 to 100.0] for NPV. The model assigned 163 of 200 patients to the highly likely categories. The error prevalence was 4% before and 2% after user review. CONCLUSION This tool infers metastatic status from unstructured EHR data with high accuracy and high confidence in more than 75% of cases, without requiring additional manual review. By enabling efficient characterization of metastatic status, this tool could mitigate a key barrier for patient ascertainment and clinical trial participation in community clinics.

2019 ◽  
Vol 37 (15_suppl) ◽  
pp. e17112-e17112
Author(s):  
Debra E. Irwin ◽  
Ellen Thiel

e17112 Background: For endometrial cancer (EC), laparoscopic hysterectomy (LH) is an effective, minimally invasive surgical treatment; however, this approach may not be recommended for obese patients due to increased risk for complications. Methods: This retrospective study utilized insurance claims linked to electronic health record (EHR) data contained in the IBM MarketScan Explorys Claims-EHR Data Set. Newly diagnosed EC patients (1/1/2007 - 6/30/2017) with continuous enrollment during a 12-month baseline and 6-month follow-up period were selected. Patients were stratified into four BMI subgroups based on baseline BMI on the EHR: normal or underweight (BMI < 25), overweight (BMI 25- < 30), obese (BMI 30- < 40), morbidly obese (BMI > 40), and were required to have had a hysterectomy within the follow-up period. Emergency room visits and rehospitalization within 30 days of hysterectomy were measured. Results: A total of 1,090 newly-diagnosed EC patients met the selection criteria, of whom, 16% were normal/underweight, 19% were overweight, 39% were obese, and 26% were morbidly obese. The proportion of patients receiving LH increased as BMI category increased (Table 1). Among those with LH between 6% and 15% had an ER visit or rehospitalization in 30 days, and rates were higher among other hysterectomy modalities. Conclusions: This real-world analysis shows that LH is utilized in a high proportion of morbidly obese EC patients, despite that it is frequently deemed infeasible in this patient population. Although the rate of ER visits and rehospitalization is lower among LH patients than those undergoing traditional hysterectomy across all BMI strata, further research is needed to determine the optimal patient population to receive LH.[Table: see text]


2021 ◽  
Vol 21 ◽  
pp. 100692
Author(s):  
Niina Laaksonen ◽  
Juha-Matti Varjonen ◽  
Minna Blomster ◽  
Antti Palomäki ◽  
Tuija Vasankari ◽  
...  

2020 ◽  
Vol 41 (S1) ◽  
pp. s39-s39
Author(s):  
Pontus Naucler ◽  
Suzanne D. van der Werff ◽  
John Valik ◽  
Logan Ward ◽  
Anders Ternhag ◽  
...  

Background: Healthcare-associated infection (HAI) surveillance is essential for most infection prevention programs and continuous epidemiological data can be used to inform healthcare personal, allocate resources, and evaluate interventions to prevent HAIs. Many HAI surveillance systems today are based on time-consuming and resource-intensive manual reviews of patient records. The objective of HAI-proactive, a Swedish triple-helix innovation project, is to develop and implement a fully automated HAI surveillance system based on electronic health record data. Furthermore, the project aims to develop machine-learning–based screening algorithms for early prediction of HAI at the individual patient level. Methods: The project is performed with support from Sweden’s Innovation Agency in collaboration among academic, health, and industry partners. Development of rule-based and machine-learning algorithms is performed within a research database, which consists of all electronic health record data from patients admitted to the Karolinska University Hospital. Natural language processing is used for processing free-text medical notes. To validate algorithm performance, manual annotation was performed based on international HAI definitions from the European Center for Disease Prevention and Control, Centers for Disease Control and Prevention, and Sepsis-3 criteria. Currently, the project is building a platform for real-time data access to implement the algorithms within Region Stockholm. Results: The project has developed a rule-based surveillance algorithm for sepsis that continuously monitors patients admitted to the hospital, with a sensitivity of 0.89 (95% CI, 0.85–0.93), a specificity of 0.99 (0.98–0.99), a positive predictive value of 0.88 (0.83–0.93), and a negative predictive value of 0.99 (0.98–0.99). The healthcare-associated urinary tract infection surveillance algorithm, which is based on free-text analysis and negations to define symptoms, had a sensitivity of 0.73 (0.66–0.80) and a positive predictive value of 0.68 (0.61–0.75). The sensitivity and positive predictive value of an algorithm based on significant bacterial growth in urine culture only was 0.99 (0.97–1.00) and 0.39 (0.34–0.44), respectively. The surveillance system detected differences in incidences between hospital wards and over time. Development of surveillance algorithms for pneumonia, catheter-related infections and Clostridioides difficile infections, as well as machine-learning–based models for early prediction, is ongoing. We intend to present results from all algorithms. Conclusions: With access to electronic health record data, we have shown that it is feasible to develop a fully automated HAI surveillance system based on algorithms using both structured data and free text for the main healthcare-associated infections.Funding: Sweden’s Innovation Agency and Stockholm County CouncilDisclosures: None


2016 ◽  
Vol 120 ◽  
pp. S118-S119
Author(s):  
Patrick Lefebvre ◽  
Wing Chow ◽  
Dominic Pilon ◽  
Bruno Emond ◽  
Marie-Hélène Lafeuille ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document