Stage-differentiated modelling of DNA methylation landscapes uncovers salient biomarkers and prognostic signatures in colorectal cancer progression
Abstract Background: Aberrant DNA methylation acts epigenetically to skew the gene transcription rate up or down, with causative roles in the etiology of cancers. However research on the role of DNA methylation in driving the progression of cancers is limited. In this study, we have developed a comprehensive computational framework for the stage-differentiated modelling of DNA methylation landscapes in colorectal cancer (CRC), and unravelled significant stagewise signposts of CRC progression. Methods: The methylation β - matrix was derived from the public-domain TCGA data, converted into M-value matrix, annotated with AJCC stages, and analysed for stage-salient genes using multiple approaches involving stage-differentiated linear modelling of methylation patterns and/or expression patterns. Differentially methylated genes (DMGs) were identified using a contrast against controls (adjusted p-value <0.001 and |log fold-change of M-value| >2). These results were filtered using a series of all possible pairwise stage contrasts (p-value <0.05) to obtain stage-salient DMGs. These were then subjected to a consensus analysis, followed by Kaplan–Meier survival analysis to evaluate the impact of methylation patterns of consensus stage-salient biomarkers on disease prognosis.Results: We found significant genome-wide changes in methylation patterns in cancer cases relative to controls agnostic of stage. Our stage-differentiated analysis yielded the following stage-salient genes: one stage-I gene (FBN1), one stage-II gene (FOXG1), one stage-III gene (HCN1) and four stage-IV genes (NELL1, ZNF135, FAM123A, LAMA1). All the biomarkers were hypermethylated, indicating down-regulation and signifying a CpG island Methylator Phenotype (CIMP) manifestation. A significant prognostic signature consisting of FBN1 and FOXG1 survived all the steps of our analysis pipeline, and represents a novel early-stage biomarker. Conclusions: We have designed a workflow for stage-differentiated consensus analysis, and identified stage-salient diagnostic biomarkers and an early-stage prognostic biomarker panel. Our studies further yield a novel CIMP-like signature of potential clinical import underlying CRC progression.