scholarly journals Empirical Bayes Model Comparisons for Differential Methylation Analysis

2012 ◽  
Vol 2012 ◽  
pp. 1-10 ◽  
Author(s):  
Mingxiang Teng ◽  
Yadong Wang ◽  
Seongho Kim ◽  
Lang Li ◽  
Changyu Shen ◽  
...  

A number of empirical Bayes models (each with different statistical distribution assumptions) have now been developed to analyze differential DNA methylation using high-density oligonucleotide tiling arrays. However, it remains unclear which model performs best. For example, for analysis of differentially methylated regions for conservative and functional sequence characteristics (e.g., enrichment of transcription factor-binding sites (TFBSs)), the sensitivity of such analyses, using various empirical Bayes models, remains unclear. In this paper, five empirical Bayes models were constructed, based on either a gamma distribution or a log-normal distribution, for the identification of differential methylated loci and their cell division—(1, 3, and 5) and drug-treatment-(cisplatin) dependent methylation patterns. While differential methylation patterns generated by log-normal models were enriched with numerous TFBSs, we observed almost no TFBS-enriched sequences using gamma assumption models. Statistical and biological results suggest log-normal, rather than gamma, empirical Bayes model distribution to be a highly accurate and precise method for differential methylation microarray analysis. In addition, we presented one of the log-normal models for differential methylation analysis and tested its reproducibility by simulation study. We believe this research to be the first extensive comparison of statistical modeling for the analysis of differential DNA methylation, an important biological phenomenon that precisely regulates gene transcription.

Blood ◽  
2012 ◽  
Vol 120 (21) ◽  
pp. 653-653 ◽  
Author(s):  
Ying Qu ◽  
Andreas Lennartsson ◽  
Verena I. Gaidzik ◽  
Stefan Deneberg ◽  
Sofia Bengtzén ◽  
...  

Abstract Abstract 653 DNA methylation is involved in multiple biologic processes including normal cell differentiation and tumorigenesis. In AML, methylation patterns have been shown to differ significantly from normal hematopoietic cells. Most studies of DNA methylation in AML have previously focused on CpG islands within the promoter of genes, representing only a very small proportion of the DNA methylome. In this study, we performed genome-wide methylation analysis of 62 AML patients with CN-AML and CD34 positive cells from healthy controls by Illumina HumanMethylation450K Array covering 450.000 CpG sites in CpG islands as well as genomic regions far from CpG islands. Differentially methylated CpG sites (DMS) between CN-AML and normal hematopoietic cells were calculated and the most significant enrichment of DMS was found in regions more than 4kb from CpG Islands, in the so called open sea where hypomethylation was the dominant form of aberrant methylation. In contrast, CpG islands were not enriched for DMS and DMS in CpG islands were dominated by hypermethylation. DMS successively further away from CpG islands in CpG island shores (up to 2kb from CpG Island) and shelves (from 2kb to 4kb from Island) showed increasing degree of hypomethylation in AML cells. Among regions defined by their relation to gene structures, CpG dinucleotide located in theoretic enhancers were found to be the most enriched for DMS (Chi χ2<0.0001) with the majority of DMS showing decreased methylation compared to CD34 normal controls. To address the relation to gene expression, GEP (gene expression profiling) by microarray was carried out on 32 of the CN-AML patients. Totally, 339723 CpG sites covering 18879 genes were addressed on both platforms. CpG methylation in CpG islands showed the most pronounced anti-correlation (spearman ρ =-0.4145) with gene expression level, followed by CpG island shores (mean spearman rho for both sides' shore ρ=-0.2350). As transcription factors (TFs) have shown to be crucial for AML development, we especially studied differential methylation of an unbiased selection of 1638 TFs. The most enriched differential methylation between CN-AML and normal CD34 positive cells were found in TFs known to be involved in hematopoiesis and with Wilms tumor protein-1 (WT1), activator protein 1 (AP-1) and runt-related transcription factor 1 (RUNX1) being the most differentially methylated TFs. The differential methylation in WT 1 and RUNX1 was located in intragenic regions which were confirmed by pyro-sequencing. AML cases were characterized with respect to mutations in FLT3, NPM1, IDH1, IDH2 and DNMT3A. Correlation analysis between genome wide methylation patterns and mutational status showed statistically significant hypomethylation of CpG Island (p<0.0001) and to a lesser extent CpG island shores (p<0.001) and the presence of DNMT3A mutations. This links DNMT3A mutations for the first time to a hypomethylated phenotype. Further analyses correlating methylation patterns to other clinical data such as clinical outcome are ongoing. In conclusion, our study revealed that non-CpG island regions and in particular enhancers are the most aberrantly methylated genomic regions in AML and that WT 1 and RUNX1 are the most differentially methylated TFs. Furthermore, our data suggests a hypomethylated phenotype in DNMT3A mutated AML. Disclosures: No relevant conflicts of interest to declare.


BMC Cancer ◽  
2019 ◽  
Vol 19 (1) ◽  
Author(s):  
Yang Liu ◽  
Yue Gu ◽  
Mu Su ◽  
Hui Liu ◽  
Shumei Zhang ◽  
...  

Abstract Background It is generally believed that DNA methylation, as one of the most important epigenetic modifications, participates in the regulation of gene expression and plays an important role in the development of cancer, and there exits epigenetic heterogeneity among cancers. Therefore, this study tried to screen for reliable prognostic markers for different cancers, providing further explanation for the heterogeneity of cancers, and more targets for clinical transformation studies of cancer from epigenetic perspective. Methods This article discusses the epigenetic heterogeneity of cancer in detail. Firstly, DNA methylation data of seven cancer types were obtained from Illumina Infinium HumanMethylation 450 K platform of TCGA database. Then, differential methylation analysis was performed in the promotor region. Secondly, pivotal gene markers were obtained by constructing the DNA methylation correlation network and the gene interaction network in the KEGG pathway, and 317 marker genes obtained from two networks were integrated as candidate markers for the prognosis model. Finally, we used the univariate and multivariate COX regression models to select specific independent prognostic markers for each cancer, and studied the risk factor of these genes by doing survival analysis. Results First, the cancer type-specific gene markers were obtained by differential methylation analysis and they were found to be involved in different biological functions by enrichment analysis. Moreover, specific and common diagnostic markers for each type of cancer was sorted out and Kaplan-Meier survival analysis showed that there was significant difference in survival between the two risk groups. Conclusions This study screened out reliable prognostic markers for different cancers, providing a further explanation for the heterogeneity of cancer at the DNA methylation level and more targets for clinical conversion studies of cancer.


2020 ◽  
Vol 21 (S6) ◽  
Author(s):  
Xinyu Hu ◽  
Li Tang ◽  
Linconghua Wang ◽  
Fang-Xiang Wu ◽  
Min Li

Abstract Background DNA methylation in the human genome is acknowledged to be widely associated with biological processes and complex diseases. The Illumina Infinium methylation arrays have been approved as one of the most efficient and universal technologies to investigate the whole genome changes of methylation patterns. As methylation arrays may still be the dominant method for detecting methylation in the anticipated future, it is crucial to develop a reliable workflow to analysis methylation array data. Results In this study, we develop a web service MADA for the whole process of methylation arrays data analysis, which includes the steps of a comprehensive differential methylation analysis pipeline: pre-processing (data loading, quality control, data filtering, and normalization), batch effect correction, differential methylation analysis, and downstream analysis. In addition, we provide the visualization of pre-processing, differentially methylated probes or regions, gene ontology, pathway and cluster analysis results. Moreover, a customization function for users to define their own workflow is also provided in MADA. Conclusions With the analysis of two case studies, we have shown that MADA can complete the whole procedure of methylation array data analysis. MADA provides a graphical user interface and enables users with no computational skills and limited bioinformatics background to carry on complicated methylation array data analysis. The web server is available at: http://120.24.94.89:8080/MADA


F1000Research ◽  
2018 ◽  
Vol 6 ◽  
pp. 2055 ◽  
Author(s):  
Yunshun Chen ◽  
Bhupinder Pal ◽  
Jane E. Visvader ◽  
Gordon K. Smyth

Cytosine methylation is an important DNA epigenetic modification. In vertebrates, methylation occurs at CpG sites, which are dinucleotides where a cytosine is immediately followed by a guanine in the DNA sequence from 5' to 3'. When located in the promoter region of a gene, DNA methylation is often associated with transcriptional silencing of the gene. Aberrant DNA methylation is associated with the development of various diseases such as cancer. Bisulfite sequencing (BS-seq) is the current "gold-standard" technology for high-resolution profiling of DNA methylation. Reduced representation bisulfite sequencing (RRBS) is an efficient form of BS-seq that targets CpG-rich DNA regions in order to save sequencing costs. A typical bioinformatics aim is to identify CpGs that are differentially methylated (DM) between experimental conditions. This workflow demonstrates that differential methylation analysis of RRBS data can be conducted using software and methodology originally developed for RNA-seq data. The RNA-seq pipeline is adapted to methylation by adding extra columns to the design matrix to account for read coverage at each CpG, after which the RRBS and RNA-seq pipelines are almost identical. This approach is statistically natural and gives analysts access to a rich collection of analysis tools including generalized linear models, gene set testing and pathway analysis. The article presents a complete start to finish case study analysis of RRBS profiles of different cell populations from the mouse mammary gland using the Bioconductor package edgeR. We show that lineage-committed cells are typically hyper-methylated compared to progenitor cells and this is true on all the autosomes but not the sex chromosomes. We demonstrate a strong negative correlation between methylation of promoter regions and gene expression as measured by RNA-seq for the same cell types, showing that methylation is a regulatory mechanism involved in epithelial linear commitment.


eLife ◽  
2017 ◽  
Vol 6 ◽  
Author(s):  
Sebastian Canovas ◽  
Elena Ivanova ◽  
Raquel Romar ◽  
Soledad García-Martínez ◽  
Cristina Soriano-Úbeda ◽  
...  

The number of children born since the origin of Assisted Reproductive Technologies (ART) exceeds 5 million. The majority seem healthy, but a higher frequency of defects has been reported among ART-conceived infants, suggesting an epigenetic cost. We report the first whole-genome DNA methylation datasets from single pig blastocysts showing differences between in vivo and in vitro produced embryos. Blastocysts were produced in vitro either without (C-IVF) or in the presence of natural reproductive fluids (Natur-IVF). Natur-IVF embryos were of higher quality than C-IVF in terms of cell number and hatching ability. RNA-Seq and DNA methylation analyses showed that Natur-IVF embryos have expression and methylation patterns closer to in vivo blastocysts. Genes involved in reprogramming, imprinting and development were affected by culture, with fewer aberrations in Natur-IVF embryos. Methylation analysis detected methylated changes in C-IVF, but not in Natur-IVF, at genes whose methylation could be critical, such as IGF2R and NNAT.


2021 ◽  
Author(s):  
Jennifer Lu ◽  
Darren Korbie ◽  
Matt Trau

DNA methylation is one of the most commonly studied epigenetic biomarkers, due to its role in disease and development. The Illumina Infinium methylation arrays still remains the most common method to interrogate methylation across the human genome, due to its capabilities of screening over 480, 000 loci simultaneously. As such, initiatives such as The Cancer Genome Atlas (TCGA) have utilized this technology to examine the methylation profile of over 20,000 cancer samples. There is a growing body of methods for pre-processing, normalisation and analysis of array-based DNA methylation data. However, the shape and sampling distribution of probe-wise methylation that could influence the way data should be examined was rarely discussed. Therefore, this article introduces a pipeline that predicts the shape and distribution of normalised methylation patterns prior to selection of the most optimal inferential statistics screen for differential methylation. Additionally, we put forward an alternative pipeline, which employed feature selection, and demonstrate its ability to select for biomarkers with outstanding differences in methylation, which does not require the predetermination of the shape or distribution of the data of interest. Availability: The Distribution test and the feature selection pipelines are available for download at: https://github.com/uqjlu8/DistributionTest Keywords: DNA methylation, Biomarkers, Cancers, Data Distribution, TCGA, 450K


2020 ◽  
Vol 135 (1) ◽  
pp. 167-173 ◽  
Author(s):  
Barbara Elisabeth Koop ◽  
Felix Mayer ◽  
Tanju Gündüz ◽  
Jacqueline Blum ◽  
Julia Becker ◽  
...  

AbstractAge estimation based on the analysis of DNA methylation patterns has become a focus of forensic research within the past few years. However, there is little data available regarding postmortem DNA methylation analysis yet, and literature mainly encompasses analysis of blood from corpses without any signs of decomposition. It is not entirely clear yet which other types of specimen are suitable for postmortem epigenetic age estimation, and if advanced decomposition may affect methylation patterns of CpG sites. In living persons, buccal swabs are an easily accessible source of DNA for epigenetic age estimation. In this work, the applicability of this approach (buccal swabs as source of DNA) under different postmortem conditions was tested. Methylation levels of PDE4C were investigated in buccal swab samples collected from 73 corpses (0–90 years old; mean: 51.2) in different stages of decomposition. Moreover, buccal swab samples from 142 living individuals (0–89 years old; mean 41.2) were analysed. As expected, methylation levels exhibited a high correlation with age in living individuals (training set: r2 = 0.87, validation set: r2 = 0.85). This was also the case in postmortem samples (r2 = 0.90), independent of the state of decomposition. Only in advanced putrified cases with extremely low DNA amounts, epigenetic age estimation was not possible. In conclusion, buccal swabs are a suitable and easy to collect source for DNA methylation analysis as long as sufficient amounts of DNA are present.


Sign in / Sign up

Export Citation Format

Share Document