scholarly journals Minimum error calibration and normalization for genomic copy number analysis

Genomics ◽  
2020 ◽  
Vol 112 (5) ◽  
pp. 3331-3341 ◽  
Author(s):  
Bo Gao ◽  
Michael Baudis
2019 ◽  
Author(s):  
Bo Gao ◽  
Michael Baudis

AbstractCopy number variations (CNV) are regional deviations from the normal autosomal bi-allelic DNA content. While germline CNVs are a major contributor to genomic syndromes and inherited diseases, the majority of cancers accumulate extensive “somatic” CNV (sCNV or CNA) during the process of oncogenetic transformation and progression. While specific sCNV have closely been associated with tumorigenesis, intriguingly many neoplasias exhibit recurrent sCNV patterns beyond the involvement of a few cancer driver genes.Currently, CNV profiles of tumor samples are generated using genomic micro-arrays or high-throughput DNA sequencing. Regardless of the underlying technology, genomic copy number data is derived from the relative assessment and integration of multiple signals, with the data generation process being prone to contamination from several sources. Estimated copy number values have no absolute and linear correlation to their corresponding DNA levels, and the extent of deviation differs between sample profiles which poses a great challenge for data integration and comparison in large scale genome analysis.In this study, we present a novel method named Minimum Error Calibration and Normalization of Copy Numbers Analysis (Mecan4CNA). For each sCNV profile,Mecan4CNAreduces the noise level, calculates values representing the normal DNA copies (baseline) and the change of one copy (level distance), and finally normalizes all values. Experiments ofMecan4CNAon simulated data showed an overall accuracy of 93% and 91% in determining the baseline and level distance, respectively. Comparison of baseline and level distance estimation with existing methods and karyotyping data on the NCI-60 tumor cell line produced coherent results. To estimate the method’s impact on downstream analyses we performed GISTIC analyses on original andMecan4CNAdata from the Cancer Genome Atlas (TCGA) where the normalized data showed prominent improvements of both sensitivity and specificity in detecting focal regions.In general,Mecan4CNAprovides an advanced method for CNA data normalization especially in research involving data of high volume and heterogeneous quality. but with its informative output and visualization can also facilitate analysis of individual CNA profiles.Mecan4CNAis freely available as a Python package and through Github.


2016 ◽  
Vol 146 (4) ◽  
pp. 439-447 ◽  
Author(s):  
Katherine B. Geiersbach ◽  
Carlynn Willmore-Payne ◽  
Alexandra V. Pasi ◽  
Christian N. Paxton ◽  
Theresa L. Werner ◽  
...  

2015 ◽  
Vol 36 (11) ◽  
pp. 1381-1387 ◽  
Author(s):  
Martin Selmansberger ◽  
Herbert Braselmann ◽  
Julia Hess ◽  
Tetiana Bogdanova ◽  
Michael Abend ◽  
...  

2016 ◽  
Vol 29 (3) ◽  
pp. 227-239 ◽  
Author(s):  
May P Chan ◽  
Aleodor A Andea ◽  
Paul W Harms ◽  
Alison B Durham ◽  
Rajiv M Patel ◽  
...  

2016 ◽  
Vol 26 (6) ◽  
pp. 844-851 ◽  
Author(s):  
Zihua Wang ◽  
Peter Andrews ◽  
Jude Kendall ◽  
Beicong Ma ◽  
Inessa Hakker ◽  
...  

2021 ◽  
Vol 89 (9) ◽  
pp. S106-S107
Author(s):  
Marieke Klein ◽  
Omar Shanta ◽  
Oanh Hong ◽  
Jeffrey MacDonald ◽  
Bhooma Thiruvahindrapuram ◽  
...  

2016 ◽  
Vol 18 (10) ◽  
pp. 1052-1055 ◽  
Author(s):  
Ye Cao ◽  
Zhihua Li ◽  
Jill A. Rosenfeld ◽  
Amber N. Pursley ◽  
Ankita Patel ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document