Deep learning for automatic calcium scoring in population based cardiovascular screening
Abstract Background High volumes of standardized coronary artery calcium (CAC) scans are generated in screening that need to be scored accurately and efficiently to risk stratify individuals. Purpose To evaluate the performance of deep learning based software for automatic coronary calcium scoring in a screening setting. Methods Participants from the Robinsca trial that underwent low-dose ECG-triggered cardiac CT for calcium scoring were included. CAC was measured with fully automated deep learning prototype and compared to the original manual assessment of the Robinsca trial. Detection rate, positive Agatston score and risk categorization (0–99, 100–399, ≥400) were compared using McNemar test, ICC, and Cohen's kappa. False negative (FN), false positive (FP) rate and diagnostic accuracy were determined for preventive treatment initiation (cut-off ≥100 AU). Results In total, 997 participants were included between December 2015 and June 2016. Median age was 61.0 y (IQR: 11.0) and 54.4% was male. A high agreement for detection was found between deep learning based and manual scoring, κ=0.87 (95% CI 0.85–0.89). Median Agatston score was 58.4 (IQR: 12.3–200.2) and 61.2 (IQR: 13.9–212.9) for deep learning based and manual assessment respectively, ICC was 0.958 (95% CI 0.951–0.964). Reclassification rate was 2.0%, with a very high agreement with κ=0.960 (95% CI: 0.943–0.997), p<0.001. FN rate was 0.7% and FP rate was 0.1% and diagnostic accuracy was 99.2% for initiation of preventive treatment. Conclusion Deep learning based software for automatic CAC scoring can be used in a cardiovascular CT screening setting with high accuracy for risk categorization and initiation of preventive treatment. FUNDunding Acknowledgement Type of funding sources: Public grant(s) – EU funding. Main funding source(s): Robinsca trial was supported by advanced grant of European Research Council