Prediction of first cardiovascular disease event in 2.9 million individuals using Danish administrative healthcare data: a nationwide, registry-based derivation and validation study
Abstract Aims To derive and validate a risk prediction model with nationwide coverage to predict individual and population-level risk of cardiovascular disease (CVD). Methods and Results All 2.98 million Danish residents aged 30-85 years free of CVD were included on January 1, 2014 and followed through December 31, 2018 using nationwide administrative healthcare registries. Model predictors and outcome were pre-specified. Predictors were: Age, sex, education, use of antithrombotic, blood pressure-lowering, glucose-lowering, or lipid-lowering drugs, and a smoking proxy of smoking-cessation drug use or chronic obstructive pulmonary disease. Outcome was 5-year risk of first CVD event, a combination of ischemic heart disease, heart failure, peripheral artery disease, stroke, or cardiovascular death. Predictions were computed using cause-specific Cox regression models. The final model fitted in the full data was internally-externally validated in each Danish Region. The model was well-calibrated in all Regions. Areas under the curve (AUC) and Brier scores ranged from 76.3% to 79.6% and 3.3 to 4.4. The model was superior to an age-sex benchmark model with differences in AUC and Brier scores ranging from 1.2% to 1.5% and -0.02 to -0.03. Average predicted risks in each Danish municipality ranged from 2.8% to 5.9%. Predicted risks for a 66-year-old ranged from 2.6% to 25.3%. Personalized predicted risks across ages 30-85 were presented in an online calculator (https://hjerteforeningen.shinyapps.io/cvd-risk-manuscript/). Conclusion A CVD risk prediction model based solely on nationwide administrative registry data provided accurate prediction of personal and population-level 5-year first CVD event risk in the Danish population. This may inform clinical and public health primary prevention efforts.