scholarly journals Normal Cell-Type Epigenetics and Breast Cancer Classification: A Case Study of Cell Mixture–Adjusted Analysis of DNA Methylation Data from Tumors

2014 ◽  
Vol 13s4 ◽  
pp. CIN.S13980 ◽  
Author(s):  
Eugene Andrέs Houseman ◽  
Tan A. Ince

Historically, breast cancer classification has relied on prognostic subtypes. Thus, unlike hematopoietic cancers, breast tumor classification lacks phylogenetic rationale. The feasibility of phylogenetic classification of breast tumors has recently been demonstrated based on estrogen receptor (ER), androgen receptor (AR), vitamin D receptor (VDR) and Keratin 5 expression. Four hormonal states (HR0–3) comprising 11 cellular subtypes of breast cells have been proposed. This classification scheme has been shown to have relevance to clinical prognosis. We examine the implications of such phylogenetic classification on DNA methylation of both breast tumors and normal breast tissues by applying recently developed deconvolution algorithms to three DNA methylation data sets archived on Gene Expression Omnibus. We propose that breast tumors arising from a particular cell-of-origin essentially magnify the epigenetic state of their original cell type. We demonstrate that DNA methylation of tumors manifests patterns consistent with cell-specific epigenetic states, that these states correspond roughly to previously posited normal breast cell types, and that estimates of proportions of the underlying cell types are predictive of tumor phenotypes. Taken together, these findings suggest that the epigenetics of breast tumors is ultimately based on the underlying phylogeny of normal breast tissue.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Hanyu Zhang ◽  
Ruoyi Cai ◽  
James Dai ◽  
Wei Sun

AbstractWe introduce a new computational method named EMeth to estimate cell type proportions using DNA methylation data. EMeth is a reference-based method that requires cell type-specific DNA methylation data from relevant cell types. EMeth improves on the existing reference-based methods by detecting the CpGs whose DNA methylation are inconsistent with the deconvolution model and reducing their contributions to cell type decomposition. Another novel feature of EMeth is that it allows a cell type with known proportions but unknown reference and estimates its methylation. This is motivated by the case of studying methylation in tumor cells while bulk tumor samples include tumor cells as well as other cell types such as infiltrating immune cells, and tumor cell proportion can be estimated by copy number data. We demonstrate that EMeth delivers more accurate estimates of cell type proportions than several other methods using simulated data and in silico mixtures. Applications in cancer studies show that the proportions of T regulatory cells estimated by DNA methylation have expected associations with mutation load and survival time, while the estimates from gene expression miss such associations.


2021 ◽  
Author(s):  
Wei Zhang ◽  
Hanwen Xu ◽  
Rong Qiao ◽  
Bixi Zhong ◽  
Xianglin Zhang ◽  
...  

Quantifying the cell proportions, especially for rare cell types in some scenarios, is of great value to track signals related to certain phenotypes or diseases. Although some methods have been pro-posed to infer cell proportions from multi-component bulk data, they are substantially less effective for estimating rare cell type proportions since they are highly sensitive against feature outliers and collinearity. Here we proposed a new deconvolution algorithm named ARIC to estimate cell type proportions from bulk gene expression or DNA methylation data. ARIC utilizes a novel two-step marker selection strategy, including component-wise condition number-based feature collinearity elimination and adaptive outlier markers removal. This strategy can systematically obtain effective markers that ensure a robust and precise weighted υ-support vector regression-based proportion prediction. We showed that ARIC can estimate fractions accurately in both DNA methylation and gene expression data from different experiments. Taken together, ARIC is a promising tool to solve the deconvolution problem of bulk data where rare components are of vital importance.


Author(s):  
Weiwei Zhang ◽  
Hao Wu ◽  
Ziyi Li

Abstract Motivation It is a common practice in epigenetics research to profile DNA methylation on tissue samples, which is usually a mixture of different cell types. To properly account for the mixture, estimating cell compositions has been recognized as an important first step. Many methods were developed for quantifying cell compositions from DNA methylation data, but they mostly have limited applications due to lack of reference or prior information. Results We develop Tsisal, a novel complete deconvolution method which accurately estimate cell compositions from DNA methylation data without any prior knowledge of cell types or their proportions. Tsisal is a full pipeline to estimate number of cell types, cell compositions, and identify cell-type-specific CpG sites. It can also assign cell type labels when (full or part of) reference panel is available. Extensive simulation studies and analyses of seven real data sets demonstrate the favorable performance of our proposed method compared with existing deconvolution methods serving similar purpose. Availability The proposed method Tsisal is implemented as part of the R/Bioconductor package TOAST at https://bioconductor.org/packages/TOAST. Contact [email protected] and [email protected]. Supplementary information Supplementary data are available at Bioinformatics online.


Cancers ◽  
2020 ◽  
Vol 12 (12) ◽  
pp. 3639
Author(s):  
Sen Peng ◽  
Lora L. Hebert ◽  
Jennifer M. Eschbacher ◽  
Suwon Kim

The human breast is composed of diverse cell types. Studies have delineated mammary epithelial cells, but the other cell types in the breast have scarcely been characterized. In order to gain insight into the cellular composition of the tissue, we performed droplet-mediated RNA sequencing of 3193 single cells isolated from a postmenopausal breast tissue without enriching for epithelial cells. Unbiased clustering analysis identified 10 distinct cell clusters, seven of which were nonepithelial devoid of cytokeratin expression. The remaining three cell clusters expressed cytokeratins (CKs), representing breast epithelial cells; Cluster 2 and Cluster 7 cells expressed luminal and basal CKs, respectively, whereas Cluster 9 cells expressed both luminal and basal CKs, as well as other CKs of unknown specificity. To assess which cell type(s) potentially contributes to breast cancer, we used the differential gene expression signature of each cell cluster to derive gene set variation analysis (GSVA) scores and classified breast tumors in The Cancer Gene Atlas (TGGA) dataset (n = 1100) by assigning the highest GSVA scoring cell cluster number for each tumor. The results showed that five clusters (Clusters 2, 3, 7, 8, and 9) could categorize >85% of breast tumors collectively. Notably, Cluster 2 (luminal epithelial) and Cluster 3 (fibroblast) tumors were equally prevalent in the luminal breast cancer subtypes, whereas Cluster 7 (basal epithelial) and Cluster 9 (other epithelial) tumors were present primarily in the triple-negative breast cancer (TNBC) subtype. Cluster 8 (immune) tumors were present in all subtypes, indicating that immune cells may contribute to breast cancer regardless of the subtypes. Cluster 9 tumors were significantly associated with poor patient survival in TNBC, suggesting that this epithelial cell type may give rise to an aggressive TNBC subset.


2009 ◽  
Vol 10 (1) ◽  
Author(s):  
Herman MJ Sontrop ◽  
Perry D Moerland ◽  
René van den Ham ◽  
Marcel JT Reinders ◽  
Wim FJ Verhaegh

2021 ◽  
Vol 23 (1) ◽  
Author(s):  
Shoghag Panjarian ◽  
Jozef Madzo ◽  
Kelsey Keith ◽  
Carolyn M. Slater ◽  
Carmen Sapienza ◽  
...  

Abstract Background DNA methylation alterations have similar patterns in normal aging tissue and in cancer. In this study, we investigated breast tissue-specific age-related DNA methylation alterations and used those methylation sites to identify individuals with outlier phenotypes. Outlier phenotype is identified by unsupervised anomaly detection algorithms and is defined by individuals who have normal tissue age-dependent DNA methylation levels that vary dramatically from the population mean. Methods We generated whole-genome DNA methylation profiles (GSE160233) on purified epithelial cells and used publicly available Infinium HumanMethylation 450K array datasets (TCGA, GSE88883, GSE69914, GSE101961, and GSE74214) for discovery and validation. Results We found that hypermethylation in normal breast tissue is the best predictor of hypermethylation in cancer. Using unsupervised anomaly detection approaches, we found that about 10% of the individuals (39/427) were outliers for DNA methylation from 6 DNA methylation datasets. We also found that there were significantly more outlier samples in normal-adjacent to cancer (24/139, 17.3%) than in normal samples (15/228, 5.2%). Additionally, we found significant differences between the predicted ages based on DNA methylation and the chronological ages among outliers and not-outliers. Additionally, we found that accelerated outliers (older predicted age) were more frequent in normal-adjacent to cancer (14/17, 82%) compared to normal samples from individuals without cancer (3/17, 18%). Furthermore, in matched samples, we found that the epigenome of the outliers in the pre-malignant tissue was as severely altered as in cancer. Conclusions A subset of patients with breast cancer has severely altered epigenomes which are characterized by accelerated aging in their normal-appearing tissue. In the future, these DNA methylation sites should be studied further such as in cell-free DNA to determine their potential use as biomarkers for early detection of malignant transformation and preventive intervention in breast cancer.


Sign in / Sign up

Export Citation Format

Share Document