scholarly journals Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method

JMIRx Med ◽  
10.2196/27017 ◽  
2021 ◽  
Vol 2 (3) ◽  
pp. e27017 ◽  
Author(s):  
Roselie A Bright ◽  
Summer K Rankin ◽  
Katherine Dowdy ◽  
Sergey V Blok ◽  
Susan J Bright ◽  
...  

Background Big data tools provide opportunities to monitor adverse events (patient harm associated with medical care) (AEs) in the unstructured text of electronic health care records (EHRs). Writers may explicitly state an apparent association between treatment and adverse outcome (“attributed”) or state the simple treatment and outcome without an association (“unattributed”). Many methods for finding AEs in text rely on predefining possible AEs before searching for prespecified words and phrases or manual labeling (standardization) by investigators. We developed a method to identify possible AEs, even if unknown or unattributed, without any prespecifications or standardization of notes. Our method was inspired by word-frequency analysis methods used to uncover the true authorship of disputed works credited to William Shakespeare. We chose two use cases, “transfusion” and “time-based.” Transfusion was chosen because new transfusion AE types were becoming recognized during the study data period; therefore, we anticipated an opportunity to find unattributed potential AEs (PAEs) in the notes. With the time-based case, we wanted to simulate near real-time surveillance. We chose time periods in the hope of detecting PAEs due to contaminated heparin from mid-2007 to mid-2008 that were announced in early 2008. We hypothesized that the prevalence of contaminated heparin may have been widespread enough to manifest in EHRs through symptoms related to heparin AEs, independent of clinicians’ documentation of attributed AEs. Objective We aimed to develop a new method to identify attributed and unattributed PAEs using the unstructured text of EHRs. Methods We used EHRs for adult critical care admissions at a major teaching hospital (2001-2012). For each case, we formed a group of interest and a comparison group. We concatenated the text notes for each admission into one document sorted by date, and deleted replicate sentences and lists. We identified statistically significant words in the group of interest versus the comparison group. Documents in the group of interest were filtered to those words, followed by topic modeling on the filtered documents to produce topics. For each topic, the three documents with the maximum topic scores were manually reviewed to identify PAEs. Results Topics centered around medical conditions that were unique to or more common in the group of interest, including PAEs. In each use case, most PAEs were unattributed in the notes. Among the transfusion PAEs was unattributed evidence of transfusion-associated cardiac overload and transfusion-related acute lung injury. Some of the PAEs from mid-2007 to mid-2008 were increased unattributed events consistent with AEs related to heparin contamination. Conclusions The Shakespeare method could be a useful supplement to AE reporting and surveillance of structured EHR data. Future improvements should include automation of the manual review process.

2021 ◽  
Author(s):  
Roselie A Bright ◽  
Summer K Rankin ◽  
Katherine Dowdy ◽  
Sergey V Blok ◽  
Susan J Bright ◽  
...  

BACKGROUND Big data tools provide opportunities to monitor adverse events (patient harm associated with medical care) (AEs) in the unstructured text of electronic health care records (EHRs). Writers may explicitly state an apparent association between treatment and adverse outcome (“attributed”) or state the simple treatment and outcome without an association (“unattributed”). Many methods for finding AEs in text rely on predefining possible AEs before searching for prespecified words and phrases or manual labeling (standardization) by investigators. We developed a method to identify possible AEs, even if unknown or unattributed, without any prespecifications or standardization of notes. Our method was inspired by word-frequency analysis methods used to uncover the true authorship of disputed works credited to William Shakespeare. We chose two use cases, “transfusion” and “time-based.” Transfusion was chosen because new transfusion AE types were becoming recognized during the study data period; therefore, we anticipated an opportunity to find unattributed potential AEs (PAEs) in the notes. With the time-based case, we wanted to simulate near real-time surveillance. We chose time periods in the hope of detecting PAEs due to contaminated heparin from mid-2007 to mid-2008 that were announced in early 2008. We hypothesized that the prevalence of contaminated heparin may have been widespread enough to manifest in EHRs through symptoms related to heparin AEs, independent of clinicians’ documentation of attributed AEs. OBJECTIVE We aimed to develop a new method to identify attributed and unattributed PAEs using the unstructured text of EHRs. METHODS We used EHRs for adult critical care admissions at a major teaching hospital (2001-2012). For each case, we formed a group of interest and a comparison group. We concatenated the text notes for each admission into one document sorted by date, and deleted replicate sentences and lists. We identified statistically significant words in the group of interest versus the comparison group. Documents in the group of interest were filtered to those words, followed by topic modeling on the filtered documents to produce topics. For each topic, the three documents with the maximum topic scores were manually reviewed to identify PAEs. RESULTS Topics centered around medical conditions that were unique to or more common in the group of interest, including PAEs. In each use case, most PAEs were unattributed in the notes. Among the transfusion PAEs was unattributed evidence of transfusion-associated cardiac overload and transfusion-related acute lung injury. Some of the PAEs from mid-2007 to mid-2008 were increased unattributed events consistent with AEs related to heparin contamination. CONCLUSIONS The Shakespeare method could be a useful supplement to AE reporting and surveillance of structured EHR data. Future improvements should include automation of the manual review process.


Author(s):  
Roselie A. Bright ◽  
Susan J. Bright-Ponte ◽  
Lee Anne Palmer ◽  
Summer K. Rankin ◽  
Sergey Blok

ABSTRACTBackgroundElectronic health records (EHRs) and big data tools offer the opportunity for surveillance of adverse events (patient harm associated with medical care). We chose the case of transfusion adverse events (TAEs) and potential TAEs (PTAEs) because 1.) real dates were obscured in the study data, and 2.) there was emerging recognition of new types during the study data period.ObjectiveWe aimed to use the structured data in electronic health records (EHRs) to find TAEs and PTAEs among adults.MethodsWe used 49,331 adult admissions involving critical care at a major teaching hospital, 2001-2012, in the MIMIC-III EHRs database. We formed a T (defined as packed red blood cells, platelets, or plasma) group of 21,443 admissions vs. 25,468 comparison (C) admissions. The ICD-9-CM diagnosis codes were compared for T vs. C, described, and tested with statistical tools.ResultsTAEs such as transfusion associated circulatory overload (TACO; 12 T cases; rate ratio (RR) 15.61; 95% CI 2.49 to 98) were found. There were also PTAEs similar to TAEs, such as fluid overload disorder (361 T admissions; RR 2.24; 95% CI 1.88 to 2.65), similar to TACO. Some diagnoses could have been sequelae of TAEs, including nontraumatic compartment syndrome of abdomen (52 T cases; RR 6.76; 95% CI 3.40 to 14.9) possibly being a consequence of TACO.ConclusionsSurveillance for diagnosis codes that could be TAE sequelae or unrecognized TAE might be useful supplements to existing medical product adverse event programs.


JMIRx Med ◽  
10.2196/31568 ◽  
2021 ◽  
Vol 2 (3) ◽  
pp. e31568
Author(s):  
Roselie A Bright ◽  
Summer K Rankin ◽  
Katherine Dowdy ◽  
Sergey V Blok ◽  
Susan J Bright ◽  
...  


Sign in / Sign up

Export Citation Format

Share Document