scholarly journals Automated Extraction of VTE Events From Narrative Radiology Reports in Electronic Health Records

Medical Care ◽  
2017 ◽  
Vol 55 (10) ◽  
pp. e73-e80 ◽  
Author(s):  
Zhe Tian ◽  
Simon Sun ◽  
Tewodros Eguale ◽  
Christian M. Rochefort
2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Irene Pérez-Díez ◽  
Raúl Pérez-Moraga ◽  
Adolfo López-Cerdán ◽  
Jose-Maria Salinas-Serrano ◽  
María de la Iglesia-Vayá

Abstract Background Medical texts such as radiology reports or electronic health records are a powerful source of data for researchers. Anonymization methods must be developed to de-identify documents containing personal information from both patients and medical staff. Although currently there are several anonymization strategies for the English language, they are also language-dependent. Here, we introduce a named entity recognition strategy for Spanish medical texts, translatable to other languages. Results We tested 4 neural networks on our radiology reports dataset, achieving a recall of 97.18% of the identifying entities. Alongside, we developed a randomization algorithm to substitute the detected entities with new ones from the same category, making it virtually impossible to differentiate real data from synthetic data. The three best architectures were tested with the MEDDOCAN challenge dataset of electronic health records as an external test, achieving a recall of 69.18%. Conclusions The strategy proposed, combining named entity recognition tasks with randomization of entities, is suitable for Spanish radiology reports. It does not require a big training corpus, thus it could be easily extended to other languages and medical texts, such as electronic health records.


2016 ◽  
Vol 15 (3) ◽  
pp. e353
Author(s):  
S-R. Leyh-Bannurah ◽  
P. Dell'Oglio ◽  
Z. Tian ◽  
M. Graefen ◽  
H. Huland ◽  
...  

Author(s):  
L Malin Overmars ◽  
Bram van Es ◽  
Floor Groepenhoff ◽  
Mark C H De Groot ◽  
Gerard Pasterkamp ◽  
...  

Abstract Introduction With the aging European population, the incidence of coronary artery disease (CAD) is expected to rise. This will likely result in an increased imaging use. Symptom recognition can be complicated, as symptoms caused by CAD can be atypical, particularly in women. Early CAD exclusion may help to optimize use of diagnostic resources and thus improve the sustainability of the healthcare system. Objective To develop sex-stratified algorithms, trained on routinely available electronic health records, raw electrocardiograms, and hematology data to exclude CAD in patients upfront. Methods We trained XGBoost algorithms on data from patients from the Utrecht Patient-Oriented Database, who underwent coronary computed tomography angiography (CCTA), and/or stress cardiac magnetic resonance (CMR) imaging or stress single-photon emission computerized tomography (SPECT) in the UMC Utrecht. Outcomes were extracted from radiology reports. We aimed to maximize negative predictive value (NPV) to minimize the false negative risk with acceptable specificity. Results Of 6,808 CCTA patients (31% female), 1029 females (48%) and 1908 males (45%) had no diagnosis of CAD. Of 3,053 CMR/SPECT patients (45% female), 650 females (47%) and 881 males (48%) had no diagnosis of CAD. On the train and test set, the CCTA models achieved NPVs and specificities of 0.95 and 0.19 (females) and 0.96 and 0.09 (males). The CMR/SPECT models achieved NPVs and specificities of 0.75 and 0.041 (females) and 0.92 and 0.026 (males). Conclusion CAD can be excluded from EHRs with high NPV. Our study demonstrates new possibilities to reduce unnecessary imaging in women and men suspected of CAD.


2018 ◽  
Vol 17 (2) ◽  
pp. e1209
Author(s):  
S.-R. Leyh-Bannurah ◽  
Z. Tian ◽  
P.I. Karakiewicz ◽  
U. Wolffgang ◽  
D. Pehrke ◽  
...  

2018 ◽  
Vol 199 (4S) ◽  
Author(s):  
Sami-Ramzi Leyh-Bannurah ◽  
Tian Zhe ◽  
Pierre Karakiewicz ◽  
Ulrich Wolffgang ◽  
Dirk Pehrke ◽  
...  

2020 ◽  
Author(s):  
Irene Pérez-Díez ◽  
Raúl Pérez-Moraga ◽  
Adolfo López-Cerdán ◽  
Jose-Maria Salinas-Serrano ◽  
María de la Iglesia-Vayá

Medical texts such as radiology reports or electronic health records are a powerful source of data for researchers. Anonymization methods must be developed to de-identify documents containing personal information from both patients and medical staff. Although currently there are several anonymization strategies for the English language, they are also language-dependent. Here, we introduce a named entity recognition strategy for Spanish medical texts, translatable to other languages. We tested 4 neural networks on our radiology reports dataset, achieving a recall of 97.18% of the identifying entities. Along-side, we developed a randomization algorithm to substitute the detected entities with new ones from the same category, making it virtually impossible to differentiate real data from synthetic data. The three best architectures were tested with the MEDDOCAN challenge dataset of electronic health records as an external test, achieving a recall of 69.18%. The strategy proposed, combining named entity recognition tasks with randomization of entities, is suitable for Spanish radiology reports. It does not require a big training corpus, thus it can be easily extended to other languages and medical texts, such as electronic health records.


Sign in / Sign up

Export Citation Format

Share Document