Using Electronic Health Record Data to Rapidly Identify Children with Glomerular Disease for Clinical Research

BackgroundThe rarity of pediatric glomerular disease makes it difficult to identify sufficient numbers of participants for clinical trials. This leaves limited data to guide improvements in care for these patients.MethodsThe authors developed and tested an electronic health record (EHR) algorithm to identify children with glomerular disease. We used EHR data from 231 patients with glomerular disorders at a single center to develop a computerized algorithm comprising diagnosis, kidney biopsy, and transplant procedure codes. The algorithm was tested using PEDSnet, a national network of eight children’s hospitals with data on >6.5 million children. Patients with three or more nephrologist encounters (n=55,560) not meeting the computable phenotype definition of glomerular disease were defined as nonglomerular cases. A reviewer blinded to case status used a standardized form to review random samples of cases (n=800) and nonglomerular cases (n=798).ResultsThe final algorithm consisted of two or more diagnosis codes from a qualifying list or one diagnosis code and a pretransplant biopsy. Performance characteristics among the population with three or more nephrology encounters were sensitivity, 96% (95% CI, 94% to 97%); specificity, 93% (95% CI, 91% to 94%); positive predictive value (PPV), 89% (95% CI, 86% to 91%); negative predictive value, 97% (95% CI, 96% to 98%); and area under the receiver operating characteristics curve, 94% (95% CI, 93% to 95%). Requiring that the sum of nephrotic syndrome diagnosis codes exceed that of glomerulonephritis codes identified children with nephrotic syndrome or biopsy-based minimal change nephropathy, FSGS, or membranous nephropathy, with 94% sensitivity and 92% PPV. The algorithm identified 6657 children with glomerular disease across PEDSnet, ≥50% of whom were seen within 18 months.ConclusionsThe authors developed an EHR-based algorithm and demonstrated that it had excellent classification accuracy across PEDSnet. This tool may enable faster identification of cohorts of pediatric patients with glomerular disease for observational or prospective studies.

Download Full-text

HAI-Proactive: Development of an Automated Surveillance System for Healthcare-Associated Infections in Sweden

Infection Control and Hospital Epidemiology ◽

10.1017/ice.2020.519 ◽

2020 ◽

Vol 41 (S1) ◽

pp. s39-s39

Author(s):

Pontus Naucler ◽

Suzanne D. van der Werff ◽

John Valik ◽

Logan Ward ◽

Anders Ternhag ◽

...

Keyword(s):

Positive Predictive Value ◽

Electronic Health Record ◽

Predictive Value ◽

Surveillance System ◽

Free Text ◽

Health Record ◽

Electronic Health Record Data ◽

Record Data ◽

Electronic Health ◽

Healthcare Associated

Background: Healthcare-associated infection (HAI) surveillance is essential for most infection prevention programs and continuous epidemiological data can be used to inform healthcare personal, allocate resources, and evaluate interventions to prevent HAIs. Many HAI surveillance systems today are based on time-consuming and resource-intensive manual reviews of patient records. The objective of HAI-proactive, a Swedish triple-helix innovation project, is to develop and implement a fully automated HAI surveillance system based on electronic health record data. Furthermore, the project aims to develop machine-learning–based screening algorithms for early prediction of HAI at the individual patient level. Methods: The project is performed with support from Sweden’s Innovation Agency in collaboration among academic, health, and industry partners. Development of rule-based and machine-learning algorithms is performed within a research database, which consists of all electronic health record data from patients admitted to the Karolinska University Hospital. Natural language processing is used for processing free-text medical notes. To validate algorithm performance, manual annotation was performed based on international HAI definitions from the European Center for Disease Prevention and Control, Centers for Disease Control and Prevention, and Sepsis-3 criteria. Currently, the project is building a platform for real-time data access to implement the algorithms within Region Stockholm. Results: The project has developed a rule-based surveillance algorithm for sepsis that continuously monitors patients admitted to the hospital, with a sensitivity of 0.89 (95% CI, 0.85–0.93), a specificity of 0.99 (0.98–0.99), a positive predictive value of 0.88 (0.83–0.93), and a negative predictive value of 0.99 (0.98–0.99). The healthcare-associated urinary tract infection surveillance algorithm, which is based on free-text analysis and negations to define symptoms, had a sensitivity of 0.73 (0.66–0.80) and a positive predictive value of 0.68 (0.61–0.75). The sensitivity and positive predictive value of an algorithm based on significant bacterial growth in urine culture only was 0.99 (0.97–1.00) and 0.39 (0.34–0.44), respectively. The surveillance system detected differences in incidences between hospital wards and over time. Development of surveillance algorithms for pneumonia, catheter-related infections and Clostridioides difficile infections, as well as machine-learning–based models for early prediction, is ongoing. We intend to present results from all algorithms. Conclusions: With access to electronic health record data, we have shown that it is feasible to develop a fully automated HAI surveillance system based on algorithms using both structured data and free text for the main healthcare-associated infections.Funding: Sweden’s Innovation Agency and Stockholm County CouncilDisclosures: None

Download Full-text

Electronic Health Record Data Quality Issues Are Not Remedied by Increasing Granularity of Diagnosis Codes

JAMA Cardiology ◽

10.1001/jamacardio.2019.0830 ◽

2019 ◽

Vol 4 (5) ◽

pp. 465 ◽

Cited By ~ 2

Author(s):

Ann Marie Navar

Keyword(s):

Electronic Health Record ◽

Data Quality ◽

Health Record ◽

Electronic Health Record Data ◽

Diagnosis Codes ◽

Record Data ◽

Quality Issues ◽

Electronic Health

Download Full-text

A Phenotyping Algorithm to Identify People With HIV in Electronic Health Record Data (HIV-Phen): Development and Evaluation Study

JMIR Formative Research ◽

10.2196/28620 ◽

2021 ◽

Vol 5 (11) ◽

pp. e28620

Author(s):

Sarah B May ◽

Thomas P Giordano ◽

Assaf Gottlieb

Keyword(s):

Electronic Health Record ◽

Predictive Value ◽

Laboratory Tests ◽

Significant Proportion ◽

Surveillance Data ◽

Health Record ◽

Data Sets ◽

Local Data ◽

Electronic Health Record Data ◽

Electronic Health

Background Identification of people with HIV from electronic health record (EHR) data is an essential first step in the study of important HIV outcomes, such as risk assessment. This task has been historically performed via manual chart review, but the increased availability of large clinical data sets has led to the emergence of phenotyping algorithms to automate this process. Existing algorithms for identifying people with HIV rely on a combination of International Classification of Disease codes and laboratory tests or closely mimic clinical testing guidelines for HIV diagnosis. However, we found that existing algorithms in the literature missed a significant proportion of people with HIV in our data. Objective The aim of this study is to develop and evaluate HIV-Phen, an updated criteria-based HIV phenotyping algorithm. Methods We developed an algorithm using HIV-specific laboratory tests and medications and compared it with previously published algorithms in national and local data sets to identify cohorts of people with HIV. Cohort demographics were compared with those reported in the national and local surveillance data. Chart reviews were performed on a subsample of patients from the local database to calculate the sensitivity, specificity, positive predictive value, negative predictive value, and accuracy of the algorithm. Results Our new algorithm identified substantially more people with HIV in both national (up to an 85.75% increase) and local (up to an 83.20% increase) EHR databases than the previously published algorithms. The demographic characteristics of people with HIV identified using our algorithm were similar to those reported in national and local HIV surveillance data. Our algorithm demonstrated improved sensitivity over existing algorithms (98% vs 56%-92%) while maintaining a similar overall accuracy (96% vs 80%-96%). Conclusions We developed and evaluated an updated criteria-based phenotyping algorithm for identifying people with HIV in EHR data that demonstrates improved sensitivity over existing algorithms.

Download Full-text

A Phenotyping Algorithm to Identify People With HIV in Electronic Health Record Data (HIV-Phen): Development and Evaluation Study (Preprint)

10.2196/preprints.28620 ◽

2021 ◽

Author(s):

Sarah B May ◽

Thomas P Giordano ◽

Assaf Gottlieb

Keyword(s):

Electronic Health Record ◽

Predictive Value ◽

Laboratory Tests ◽

Significant Proportion ◽

Surveillance Data ◽

Health Record ◽

Data Sets ◽

Local Data ◽

Electronic Health Record Data ◽

Electronic Health

BACKGROUND Identification of people with HIV from electronic health record (EHR) data is an essential first step in the study of important HIV outcomes, such as risk assessment. This task has been historically performed via manual chart review, but the increased availability of large clinical data sets has led to the emergence of phenotyping algorithms to automate this process. Existing algorithms for identifying people with HIV rely on a combination of International Classification of Disease codes and laboratory tests or closely mimic clinical testing guidelines for HIV diagnosis. However, we found that existing algorithms in the literature missed a significant proportion of people with HIV in our data. OBJECTIVE The aim of this study is to develop and evaluate HIV-Phen, an updated criteria-based HIV phenotyping algorithm. METHODS We developed an algorithm using HIV-specific laboratory tests and medications and compared it with previously published algorithms in national and local data sets to identify cohorts of people with HIV. Cohort demographics were compared with those reported in the national and local surveillance data. Chart reviews were performed on a subsample of patients from the local database to calculate the sensitivity, specificity, positive predictive value, negative predictive value, and accuracy of the algorithm. RESULTS Our new algorithm identified substantially more people with HIV in both national (up to an 85.75% increase) and local (up to an 83.20% increase) EHR databases than the previously published algorithms. The demographic characteristics of people with HIV identified using our algorithm were similar to those reported in national and local HIV surveillance data. Our algorithm demonstrated improved sensitivity over existing algorithms (98% vs 56%-92%) while maintaining a similar overall accuracy (96% vs 80%-96%). CONCLUSIONS We developed and evaluated an updated criteria-based phenotyping algorithm for identifying people with HIV in EHR data that demonstrates improved sensitivity over existing algorithms.

Download Full-text

Automated chronic disease surveillance and visualization using electronic health record data

Emerging Health Threats Journal ◽

10.3402/ehtj.v4i0.11102 ◽

2011 ◽

Vol 4 (0) ◽

Author(s):

Michael Klompas ◽

Chaim Kirby ◽

Jason McVetta ◽

Paul Oppedisano ◽

John Brownstein ◽

...

Keyword(s):

Chronic Disease ◽

Electronic Health Record ◽

Disease Surveillance ◽

Health Record ◽

Electronic Health Record Data ◽

Record Data ◽

Electronic Health

Download Full-text

Faculty Opinions recommendation of Evaluating delivery of low tidal volume ventilation in six icus using electronic health record data.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.734212415.793572533 ◽

2020 ◽

Author(s):

Jeremy Beitler

Keyword(s):

Electronic Health Record ◽

Tidal Volume ◽

Health Record ◽

Electronic Health Record Data ◽

Low Tidal Volume ◽

Low Tidal Volume Ventilation ◽

Volume Ventilation ◽

Record Data ◽

Electronic Health

Download Full-text

Clinical Comparison Between Trial Participants and Potentially Eligible Patients Using Electronic Health Record Data: A Generalizability Assessment Method

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2021.103822 ◽

2021 ◽

pp. 103822

Author(s):

James R. Rogers ◽

George Hripcsak ◽

Ying Kuen Cheung ◽

Chunhua Weng

Keyword(s):

Electronic Health Record ◽

Assessment Method ◽

Health Record ◽

Electronic Health Record Data ◽

Clinical Comparison ◽

Record Data ◽

Electronic Health ◽

Trial Participants

Download Full-text

Leveraging electronic health record data to inform hospital resource management

Health Care Management Science ◽

10.1007/s10729-021-09554-4 ◽

2021 ◽

Author(s):

José Carlos Ferrão ◽

Mónica Duarte Oliveira ◽

Daniel Gartner ◽

Filipe Janela ◽

Henrique M. G. Martins

Keyword(s):

Resource Management ◽

Electronic Health Record ◽

Health Record ◽

Hospital Resource ◽

Electronic Health Record Data ◽

Record Data ◽

Electronic Health

Download Full-text

Predicting baby feeding method from unstructured electronic health record data

Proceedings of the ACM sixth international workshop on Data and text mining in biomedical informatics - DTMBIO '12 ◽

10.1145/2390068.2390075 ◽

2012 ◽

Cited By ~ 1

Author(s):

Ashwani Rao ◽

Kristin Maiden ◽

Ben Carterette ◽

Deb Ehrenthal

Keyword(s):

Electronic Health Record ◽

Health Record ◽

Electronic Health Record Data ◽

Feeding Method ◽

Record Data ◽

Electronic Health

Download Full-text

Validation of an Internationally Derived Patient Severity Phenotype to Support COVID-19 Analytics from Electronic Health Record Data

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocab018 ◽

2021 ◽

Author(s):

Jeffrey G Klann ◽

Griffin M Weber ◽

Hossein Estiri ◽

Bertrand Moal ◽

Paul Avillach ◽

...

Keyword(s):

Machine Learning ◽

Electronic Health Record ◽

Chart Review ◽

Learning Approach ◽

Health Record ◽

Learning Approaches ◽

Electronic Health Record Data ◽

Icu Admission ◽

Machine Learning Approach ◽

Electronic Health

Abstract Introduction The Consortium for Clinical Characterization of COVID-19 by EHR (4CE) is an international collaboration addressing COVID-19 with federated analyses of electronic health record (EHR) data. Objective We sought to develop and validate a computable phenotype for COVID-19 severity. Methods Twelve 4CE sites participated. First we developed an EHR-based severity phenotype consisting of six code classes, and we validated it on patient hospitalization data from the 12 4CE clinical sites against the outcomes of ICU admission and/or death. We also piloted an alternative machine-learning approach and compared selected predictors of severity to the 4CE phenotype at one site. Results The full 4CE severity phenotype had pooled sensitivity of 0.73 and specificity 0.83 for the combined outcome of ICU admission and/or death. The sensitivity of individual code categories for acuity had high variability - up to 0.65 across sites. At one pilot site, the expert-derived phenotype had mean AUC 0.903 (95% CI: 0.886, 0.921), compared to AUC 0.956 (95% CI: 0.952, 0.959) for the machine-learning approach. Billing codes were poor proxies of ICU admission, with as low as 49% precision and recall compared to chart review. Discussion We developed a severity phenotype using 6 code classes that proved resilient to coding variability across international institutions. In contrast, machine-learning approaches may overfit hospital-specific orders. Manual chart review revealed discrepancies even in the gold-standard outcomes, possibly due to heterogeneous pandemic conditions. Conclusion We developed an EHR-based severity phenotype for COVID-19 in hospitalized patients and validated it at 12 international sites.

Download Full-text