scholarly journals Using electronic medical records analysis to investigate the effectiveness of lifestyle programs in real-world primary care is challenging: a case study in diabetes mellitus

2012 ◽  
Vol 65 (7) ◽  
pp. 785-792 ◽  
Author(s):  
Joris J. Linmans ◽  
Wolfgang Viechtbauer ◽  
Tjarco Koppenaal ◽  
Mark Spigt ◽  
J. André Knottnerus
Diabetes ◽  
2020 ◽  
Vol 69 (Supplement 1) ◽  
pp. 908-P
Author(s):  
SOSTENES MISTRO ◽  
THALITA V.O. AGUIAR ◽  
VANESSA V. CERQUEIRA ◽  
KELLE O. SILVA ◽  
JOSÉ A. LOUZADO ◽  
...  

2013 ◽  
Vol 2 (3) ◽  
pp. 198-212 ◽  
Author(s):  
Aviv Shachak ◽  
Catherine Montgomery ◽  
Rustam Dow ◽  
Jan Barnsley ◽  
Karen Tu ◽  
...  

2017 ◽  
Vol 10 (1) ◽  
pp. 16-27 ◽  
Author(s):  
Ebenezer S. Owusu Adjah ◽  
Olga Montvida ◽  
Julius Agbeve ◽  
Sanjoy K. Paul

Background:Identification of diseased patients from primary care based electronic medical records (EMRs) has methodological challenges that may impact epidemiologic inferences.Objective:To compare deterministic clinically guided selection algorithms with probabilistic machine learning (ML) methodologies for their ability to identify patients with type 2 diabetes mellitus (T2DM) from large population based EMRs from nationally representative primary care database.Methods:Four cohorts of patients with T2DM were defined by deterministic approach based on disease codes. The database was mined for a set of best predictors of T2DM and the performance of six ML algorithms were compared based on cross-validated true positive rate, true negative rate, and area under receiver operating characteristic curve.Results:In the database of 11,018,025 research suitable individuals, 379 657 (3.4%) were coded to have T2DM. Logistic Regression classifier was selected as best ML algorithm and resulted in a cohort of 383,330 patients with potential T2DM. Eighty-three percent (83%) of this cohort had a T2DM code, and 16% of the patients with T2DM code were not included in this ML cohort. Of those in the ML cohort without disease code, 52% had at least one measure of elevated glucose level and 22% had received at least one prescription for antidiabetic medication.Conclusion:Deterministic cohort selection based on disease coding potentially introduces significant mis-classification problem. ML techniques allow testing for potential disease predictors, and under meaningful data input, are able to identify diseased cohorts in a holistic way.


Sign in / Sign up

Export Citation Format

Share Document