Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models

Abstract Background Comprehensive breast cancer risk prediction models enable identifying and targeting women at high-risk, while reducing interventions in those at low-risk. Breast cancer risk prediction models used in clinical practice have low discriminatory accuracy (0.53–0.64). Machine learning (ML) offers an alternative approach to standard prediction modeling that may address current limitations and improve accuracy of those tools. The purpose of this study was to compare the discriminatory accuracy of ML-based estimates against a pair of established methods—the Breast Cancer Risk Assessment Tool (BCRAT) and Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm (BOADICEA) models. Methods We quantified and compared the performance of eight different ML methods to the performance of BCRAT and BOADICEA using eight simulated datasets and two retrospective samples: a random population-based sample of U.S. breast cancer patients and their cancer-free female relatives (N = 1143), and a clinical sample of Swiss breast cancer patients and cancer-free women seeking genetic evaluation and/or testing (N = 2481). Results Predictive accuracy (AU-ROC curve) reached 88.28% using ML-Adaptive Boosting and 88.89% using ML-random forest versus 62.40% with BCRAT for the U.S. population-based sample. Predictive accuracy reached 90.17% using ML-adaptive boosting and 89.32% using ML-Markov chain Monte Carlo generalized linear mixed model versus 59.31% with BOADICEA for the Swiss clinic-based sample. Conclusions There was a striking improvement in the accuracy of classification of women with and without breast cancer achieved with ML algorithms compared to the state-of-the-art model-based approaches. High-accuracy prediction techniques are important in personalized medicine because they facilitate stratification of prevention strategies and individualized clinical management.

Download Full-text

Abstract 268: Absolute breast cancer risk according to three risk prediction models: Inverse associations with risk of death and poor prognostic features

10.1158/1538-7445.am2014-268 ◽

2014 ◽

Author(s):

Mark E. Sherman ◽

Laura Ichikawa ◽

Diana Miglioretti ◽

Pamela Vacek ◽

Jeffrey Tice ◽

...

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Risk Prediction ◽

Prediction Models ◽

Risk Prediction Models ◽

Prognostic Features ◽

Risk Of Death

Download Full-text

Abstract 2600: Improving breast cancer risk prediction models: the addition of a genetic risk score, mammographic density, and endogenous hormones

10.1158/1538-7445.am2016-2600 ◽

2016 ◽

Author(s):

Xuehong Zhang ◽

Megan Rice ◽

Shelley S. Tworoger ◽

Bernard A. Rosner ◽

A. Heather Eliassen ◽

...

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Risk Prediction ◽

Mammographic Density ◽

Genetic Risk ◽

Prediction Models ◽

Genetic Risk Score ◽

Endogenous Hormones ◽

Risk Prediction Models

Download Full-text

Validating Breast Cancer Risk Prediction Models in the Korean Cancer Prevention Study-II Biobank

Cancer Epidemiology Biomarkers & Prevention ◽

10.1158/1055-9965.epi-19-1478 ◽

2020 ◽

Vol 29 (6) ◽

pp. 1271-1277 ◽

Cited By ~ 1

Author(s):

Yon Ho Jee ◽

Chi Gao ◽

Jihye Kim ◽

Seho Park ◽

Sun Ha Jee ◽

...

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Cancer Prevention ◽

Risk Prediction ◽

Prediction Models ◽

Prevention Study ◽

Risk Prediction Models

Download Full-text

Abstract 4169: Population-based breast cancer risk estimates associated with cancer predisposition gene mutations from 32,298 breast cancer patients and 31,869 matched unaffected controls from the CARRIERS study

10.1158/1538-7445.am2019-4169 ◽

2019 ◽

Author(s):

Fergus J. Couch ◽

Chunling Hu ◽

Steven N. Hart ◽

Rohan Gnanaolivu ◽

Jenna Lilyquist ◽

...

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Cancer Patients ◽

Gene Mutations ◽

Population Based ◽

Cancer Predisposition ◽

Breast Cancer Patients ◽

Risk Estimates ◽

Cancer Predisposition Gene

Download Full-text

Abstract GS2-01: Age-related breast cancer risk estimates for the general population based on sequencing of cancer predisposition genes in 19,228 breast cancer patients and 20,211 matched unaffected controls from US based cohorts in the CARRIERS study

10.1158/1538-7445.sabcs18-gs2-01 ◽

2019 ◽

Author(s):

FJ Couch ◽

C Hu ◽

SN Hart ◽

RD Gnanaolivu ◽

J Lilyquist ◽

...

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

General Population ◽

Cancer Risk ◽

Cancer Patients ◽

Population Based ◽

Cancer Predisposition ◽

Breast Cancer Patients ◽

Age Related ◽

Predisposition Genes

Download Full-text

Breast Cancer Risk Prediction Models: Challenges in Clinical Application

10.1188/19.cjon.256-259 ◽

2019 ◽

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Risk Prediction ◽

Clinical Application ◽

Prediction Models ◽

Risk Prediction Models

Download Full-text

Utilization of breast cancer risk prediction models by cancer genetic counselors in clinical practice predominantly in the United States

Journal of Genetic Counseling ◽

10.1002/jgc4.1442 ◽

2021 ◽

Author(s):

Min Seon Park ◽

Scott M. Weissman ◽

Kristen J. Vogel Postula ◽

Carmen S. Williams ◽

Caitlin B. Mauer ◽

...

Keyword(s):

Breast Cancer ◽

United States ◽

Clinical Practice ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Risk Prediction ◽

Prediction Models ◽

The United States ◽

Cancer Genetic ◽

Risk Prediction Models

Download Full-text

Development of breast cancer risk prediction models using the UK biobank dataset

Epidemiology Open Access ◽

10.4172/2161-1165-c1-020 ◽

2018 ◽

Vol 08 ◽

Author(s):

Kawthar Al-ajmi

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Risk Prediction ◽

Prediction Models ◽

Uk Biobank ◽

Risk Prediction Models ◽

The Uk

Download Full-text

Uncertainty quantification in breast cancer risk prediction models using self-reported family health history

Journal of Clinical and Translational Science ◽

10.1017/cts.2016.9 ◽

2017 ◽

Vol 1 (1) ◽

pp. 53-59 ◽

Cited By ~ 2

Author(s):

Lance T. Pflieger ◽

Clinton C. Mason ◽

Julio C. Facelli

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Risk Prediction ◽

Clinical Setting ◽

Prediction Models ◽

Family Health ◽

Clinical Settings ◽

Risk Prediction Models ◽

Health History

Introduction. Family health history (FHx) is an important factor in breast and ovarian cancer risk assessment. As such, multiple risk prediction models rely strongly on FHx data when identifying a patient’s risk. These models were developed using verified information and when translated into a clinical setting assume that a patient’s FHx is accurate and complete. However, FHx information collected in a typical clinical setting is known to be imprecise and it is not well understood how this uncertainty may affect predictions in clinical settings. Methods. Using Monte Carlo simulations and existing measurements of uncertainty of self-reported FHx, we show how uncertainty in FHx information can alter risk classification when used in typical clinical settings. Results. We found that various models ranged from 52% to 64% for correct tier-level classification of pedigrees under a set of contrived uncertain conditions, but that significant misclassification are not negligible. Conclusions. Our work implies that (i) uncertainty quantification needs to be considered when transferring tools from a controlled research environment to a more uncertain environment (i.e, a health clinic) and (ii) better FHx collection methods are needed to reduce uncertainty in breast cancer risk prediction in clinical settings.

Download Full-text