The methylation of SDC2 and TFPI2 defined three methylator phenotypes of colorectal cancer
Methylation-based noninvasive molecular diagnostics are easy and feasible tools for the early detection of colorectal cancer (CRC). However, many of them have the limitation of low sensitivity with some CRCs detection failed in clinical practice. In this study, the clinical and pathological characteristics, as well as molecular features of three methylator-groups, defined by the promoter methylation status of SDC2 and TFPI2, were investigated in order to improve the performance of CRC detection. The Illumina Infinium 450k Human DNA methylation data and clinical information of CRCs were collected from The Cancer Genome Atlas (TCGA) project and Gene Expression Omnibus (GEO) database. CRC samples were divided into three groups, HH (dual-positive), HL (single positive) and LL (dual-negative) according to the methylation status of SDC2 and TFPI2 promoters. Differences in age, tumor location, microsatellite instable status and differentially expressed genes (DEGs) were evaluated among the three groups and these findings were then confirmed in our inner CRC dataset. The combination of methylated SDC2 and TFPI2 showed a superior performance of distinguishing CRCs from normal controls than each alone. Samples of HL group were more often originated from left-side CRCs whereas very few of them were from right-side (P < 0.05). HH grouped CRCs showed a higher level of microsatellite instability and mutation load than other two groups (mean nonsynonymous mutations for HH/HL/LL: 10.55/3.91/7.02, P = 0.0055). All mutations of BRAF, one of the five typical CpG island methylator phenotype (CIMP) related genes, were found in HH group (HH/HL/LL: 51/0/0, P = 0.018). Also there was a significantly older patient age at the diagnosis in HH group. Gene expression analysis identified 37, 84 and 22 group-specific DEGs for HH, HL and LL, respectively. Functional enrichment analysis suggested that HH specific DEGs were mainly related to the regulation of transcription and other processes, while LL specific DEGs were enriched in the biological processes of extracellular matrix interaction and cell migration. The three defined mathylator groups showed great difference in tumor location, patient age, MSI and ECM biological process, which could facilitate the development of more effective biomarkers for CRC detection.