scholarly journals Copy number variant detection in inbred strains from short read sequence data

2009 ◽  
Vol 26 (4) ◽  
pp. 565-567 ◽  
Author(s):  
J. T. Simpson ◽  
R. E. McIntyre ◽  
D. J. Adams ◽  
R. Durbin
PLoS ONE ◽  
2017 ◽  
Vol 12 (7) ◽  
pp. e0180647 ◽  
Author(s):  
Bhuvan Molparia ◽  
Eshaan Nichani ◽  
Ali Torkamani

2015 ◽  
Vol 42 (7) ◽  
pp. 1556-1571 ◽  
Author(s):  
Ayten Yiğiter ◽  
Jie Chen ◽  
Lingling An ◽  
Nazan Danacioğlu

PLoS ONE ◽  
2010 ◽  
Vol 5 (12) ◽  
pp. e14456 ◽  
Author(s):  
Debby W. Tsuang ◽  
Steven P. Millard ◽  
Benjamin Ely ◽  
Peter Chi ◽  
Kenneth Wang ◽  
...  

2017 ◽  
Author(s):  
Vijay Kumar ◽  
Julie Rosenbaum ◽  
Zihua Wang ◽  
Talitha Forcier ◽  
Michael Ronemus ◽  
...  

ABSTRACTWe introduce a new protocol, mutational sequencing or muSeq, which randomly deaminates unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling a 135,000 fragment PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone.


Genes ◽  
2020 ◽  
Vol 11 (2) ◽  
pp. 141 ◽  
Author(s):  
Feichen Shen ◽  
Jeffrey M. Kidd

Gene duplication is a major mechanism for the evolution of gene novelty, and copy-number variation makes a major contribution to inter-individual genetic diversity. However, most approaches for studying copy-number variation rely upon uniquely mapping reads to a genome reference and are unable to distinguish among duplicated sequences. Specialized approaches to interrogate specific paralogs are comparatively slow and have a high degree of computational complexity, limiting their effective application to emerging population-scale data sets. We present QuicK-mer2, a self-contained, mapping-free approach that enables the rapid construction of paralog-specific copy-number maps from short-read sequence data. This approach is based on the tabulation of unique k-mer sequences from short-read data sets, and is able to analyze a 20X coverage human genome in approximately 20 min. We applied our approach to newly released sequence data from the 1000 Genomes Project, constructed paralog-specific copy-number maps from 2457 unrelated individuals, and uncovered copy-number variation of paralogous genes. We identify nine genes where none of the analyzed samples have a copy number of two, 92 genes where the majority of samples have a copy number other than two, and describe rare copy number variation effecting multiple genes at the APOBEC3 locus.


2020 ◽  
Vol 22 (1) ◽  
pp. 40-49 ◽  
Author(s):  
Salla Välipakka ◽  
Marco Savarese ◽  
Lydia Sagath ◽  
Meharji Arumilli ◽  
Teresa Giugliano ◽  
...  

2016 ◽  
Vol 27 ◽  
pp. vi402
Author(s):  
J.D. Haimes ◽  
J. Covino ◽  
N. Manoj ◽  
E. Baravik ◽  
L. Johnson ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document