scholarly journals A data-driven approach to preprocessing Illumina 450K methylation array data

BMC Genomics ◽  
2013 ◽  
Vol 14 (1) ◽  
pp. 293 ◽  
Author(s):  
Ruth Pidsley ◽  
Chloe C Y Wong ◽  
Manuela Volta ◽  
Katie Lunnon ◽  
Jonathan Mill ◽  
...  
2016 ◽  
Vol 8 (1) ◽  
Author(s):  
Emma Cazaly ◽  
Russell Thomson ◽  
James R. Marthick ◽  
Adele F. Holloway ◽  
Jac Charlesworth ◽  
...  

2014 ◽  
Vol 15 (11) ◽  
Author(s):  
Jean-Philippe Fortin ◽  
Aurélie Labbe ◽  
Mathieu Lemire ◽  
Brent W Zanke ◽  
Thomas J Hudson ◽  
...  

2014 ◽  
Author(s):  
Jean-Philippe Fortin ◽  
Aurélie Labbe ◽  
Mathieu Lemire ◽  
Brent W. Zanke ◽  
Thomas J. Hudson ◽  
...  

AbstractWe propose an extension to quantile normalization which removes unwanted technical variation using control probes. We adapt our algorithm, functional normalization, to the Illumina 450k methylation array and address the open problem of normalizing methylation data with global epigenetic changes, such as human cancers. Using datasets from The Cancer Genome Atlas and a large case-control study, we show that our algorithm outperforms all existing normalization methods with respect to replication of results between experiments, and yields robust results even in the presence of batch effects. Functional normalization can be applied to any microarray platform, provided suitable control probes are available.


2016 ◽  
Author(s):  
Shan V. Andrews ◽  
Christine Ladd-Acosta ◽  
Andrew P. Feinberg ◽  
Kasper D. Hansen ◽  
M. Daniele Fallin

AbstractBackgroundThe Illumina 450K array has been widely used in epigenetic association studies. Current quality-control (QC) pipelines typically remove certain sets of probes, such as those containing a SNP or with multiple mapping locations. An additional set of potentially problematic probes are those with DNA methylation (DNAm) distributions characterized by two or more distinct clusters separated by gaps. Data-driven identification of such probes may offer additional insights for downstream analyses.ResultsWe developed a procedure, termed “gap hunting”, to identify probes showing clustered distributions. Among 590 peripheral blood samples from the Study to Explore Early Development, we identified 11,007 “gap probes”. The vast majority (9,199) are likely attributed to an underlying SNP(s) or other variant in the probe, although SNP-affected probes exist that do not produce a gap signals. Specific factors predict which SNPs lead to gap signals, including type of nucleotide change, probe type, DNA strand, and overall methylation state. These expected effects are demonstrated in paired genotype and 450k data on the same samples. Gap probes can also serve as a surrogate for the local genetic sequence on a haplotype scale and can be used to adjust for population stratification.ConclusionsThe characteristics of gap probes reflect potentially informative biology. QC pipelines may benefit from an efficient data-driven approach that “flags” gap probes, rather than filtering such probes, followed by careful interpretation of downstream association analyses. Our results should translate directly to the recently released Illumina 850K EPIC array given the similar chemistry and content design.


2018 ◽  
Vol 24 (7) ◽  
pp. 1503-1509 ◽  
Author(s):  
Andrew D Beggs ◽  
Jonathan James ◽  
Germaine Caldwell ◽  
Toby Prout ◽  
Mark P Dilworth ◽  
...  

Abstract Background and aims Ulcerative colitis (UC) is associated with a higher background risk of dysplasia and/or neoplasia due to chronic inflammation. There exist few biomarkers for identification of patients with dysplasia, and targeted biopsies in this group of patients are inaccurate in reliably identifying dysplasia. We aimed to examine the epigenome of UC dysplasia and to identify and validate potential biomarkers Methods Colonic samples from patients with UC-associated dysplasia or neoplasia underwent epigenome-wide analysis on the Illumina 450K methylation array. Markers were validated by bisulphite pyrosequencing on a secondary validation cohort and accuracy calculated using logistic regression and receiver-operator curves. Results Twelve samples from 4 patients underwent methylation array analysis and 6 markers (GNG7, VAV3, KIF5C, PIK3R5, TUBB6, and ZNF583) were taken forward for secondary validation on a cohort of 71 colonic biopsy samples consisting of normal uninflamed mucosa from control patients, acute and chronic colitis, “field” mucosa in patients with dysplasia/neoplasia, dysplasia, and neoplasia. Methylation in the beta-tubulin TUBB6 correlated with the presence of dysplasia (P < 0.0001) and accurately discriminated between dysplasia and nondysplastic tissue, even in the apparently normal field mucosa downstream from dysplastic lesions (AUC 0.84, 95% CI 0.81–0.87). Conclusions Methylation in TUBB6 is a potential biomarker for UC- associated dysplasia. Further validation is needed and is ongoing as part of the ENDCAP-C study.


2012 ◽  
Author(s):  
Michael Ghil ◽  
Mickael D. Chekroun ◽  
Dmitri Kondrashov ◽  
Michael K. Tippett ◽  
Andrew Robertson ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document