scholarly journals Copy number variation detection and genotyping from exome sequence data

2012 ◽  
Vol 22 (8) ◽  
pp. 1525-1532 ◽  
Author(s):  
N. Krumm ◽  
P. H. Sudmant ◽  
A. Ko ◽  
B. J. O'Roak ◽  
M. Malig ◽  
...  
2020 ◽  
Vol 36 (12) ◽  
pp. 3890-3891
Author(s):  
Linjie Wu ◽  
Han Wang ◽  
Yuchao Xia ◽  
Ruibin Xi

Abstract Motivation Whole-genome sequencing (WGS) is widely used for copy number variation (CNV) detection. However, for most bacteria, their circular genome structure and high replication rate make reads more enriched near the replication origin. CNV detection based on read depth could be seriously influenced by such replication bias. Results We show that the replication bias is widespread using ∼200 bacterial WGS data. We develop CNV-BAC (CNV-Bacteria) that can properly normalize the replication bias and other known biases in bacterial WGS data and can accurately detect CNVs. Simulation and real data analysis show that CNV-BAC achieves the best performance in CNV detection compared with available algorithms. Availability and implementation CNV-BAC is available at https://github.com/XiDsLab/CNV-BAC. Supplementary information Supplementary data are available at Bioinformatics online.


2018 ◽  
Vol 5 (3) ◽  
pp. 307-314 ◽  
Author(s):  
Lydia Sagath ◽  
Vilma-Lotta Lehtokari ◽  
Salla Välipakka ◽  
Bjarne Udd ◽  
Carina Wallgren-Pettersson ◽  
...  

2015 ◽  
Vol 143 (suppl_1) ◽  
pp. A013-A013
Author(s):  
Linda B. Baughn ◽  
Getiria Onsongo ◽  
Matthew Bower ◽  
Christine Henzler ◽  
Kevin A.T. Silverstein ◽  
...  

2011 ◽  
Vol 12 (8) ◽  
pp. R80 ◽  
Author(s):  
Jiqiu Cheng ◽  
Evelyne Vanneste ◽  
Peter Konings ◽  
Thierry Voet ◽  
Joris R Vermeesch ◽  
...  

Genes ◽  
2020 ◽  
Vol 11 (2) ◽  
pp. 141 ◽  
Author(s):  
Feichen Shen ◽  
Jeffrey M. Kidd

Gene duplication is a major mechanism for the evolution of gene novelty, and copy-number variation makes a major contribution to inter-individual genetic diversity. However, most approaches for studying copy-number variation rely upon uniquely mapping reads to a genome reference and are unable to distinguish among duplicated sequences. Specialized approaches to interrogate specific paralogs are comparatively slow and have a high degree of computational complexity, limiting their effective application to emerging population-scale data sets. We present QuicK-mer2, a self-contained, mapping-free approach that enables the rapid construction of paralog-specific copy-number maps from short-read sequence data. This approach is based on the tabulation of unique k-mer sequences from short-read data sets, and is able to analyze a 20X coverage human genome in approximately 20 min. We applied our approach to newly released sequence data from the 1000 Genomes Project, constructed paralog-specific copy-number maps from 2457 unrelated individuals, and uncovered copy-number variation of paralogous genes. We identify nine genes where none of the analyzed samples have a copy number of two, 92 genes where the majority of samples have a copy number other than two, and describe rare copy number variation effecting multiple genes at the APOBEC3 locus.


Sign in / Sign up

Export Citation Format

Share Document