Abstract
Background: Benign breast disease (BBD) is a risk factor for breast cancer (BC); however, little is known about the genetic alterations present at the time of BBD diagnosis, and how these relate to risk of incident BC. Methods: A subset of a long-term BBD cohort was selected to examine DNA variation across three BBD groups (42 future estrogen receptor-positive (ER+) BC, 36 future estrogen receptor-negative (ER-) BC and 42 controls cancer-free for at least 16 years post-BBD). DNA extracted from archival formalin fixed, paraffin-embedded (FFPE) tissue blocks was tested and analyzed for presence of DNA alterations using a targeted panel of 93 BC-associated genes. To address artifacts frequently observed in FFPE tissues (e.g., C>T changes), we applied three filtering strategies based on alternative allele frequencies and nucleotide substitution context. Gene-level associations were performed using two burden tests, SKAT-O and logistic regression, and adjusted for clinical and technical covariates including epithelial percentage, histologic impression, patient age and year of BBD biopsy. Results: After filtering, the variant frequency of SNPs in our samples was highly consistent with population allele frequencies reported in 1KG/ExAC (0.986, p <1e-16). Ten genes were found to be associated with later cancer status by four of 12 association methods, with nominal p-values <0.05: MED12, MSH2, BRIP1, PMS1, GATA3, MUC16, FAM175A, EXT2, MLH1 and TGFB1. Additionally, analysis revealed 11 and 19 gene-level associations with p-values <0.05 and OR<1 for all cancer cases and ER- cases, respectively. Comparing between the three case groups, BBD ER+ cases were closer to controls in mutation profile, while BBD ER- cases were distinct. Notably, the variant burden was significantly higher in controls than in either ER+ or ER- cases. CD45 expression was associated with mutational burden (p=0.003).Conclusions: Somatic mutations were more frequent in benign breast tissue from women who did not develop cancer, opening questions of clonal diversity or immune-mediated restraint on future cancer development. CD45 expression was positively associated with mutational burden, most strongly in controls. Further studies in both normal and premalignant tissues are needed to better understand the role of somatic gene mutations and their contribution to future cancer development.