Haplotype phasing of whole human genomes using bead-based barcode partitioning in a single tube

2017 ◽  
Vol 35 (9) ◽  
pp. 852-857 ◽  
Author(s):  
Fan Zhang ◽  
Lena Christiansen ◽  
Jerushah Thomas ◽  
Dmitry Pokholok ◽  
Ros Jackson ◽  
...  
2019 ◽  
Author(s):  
Zhoutao Chen ◽  
Long Pham ◽  
Tsai-Chin Wu ◽  
Guoya Mo ◽  
Yu Xia ◽  
...  

AbstractLong-range sequencing information is required for haplotype phasing, de novo assembly and structural variation detection. Current long-read sequencing technologies can provide valuable long-range information but at a high cost with low accuracy and high DNA input requirement. We have developed a single-tube Transposase Enzyme Linked Long-read Sequencing (TELL-Seq™) technology, which enables a low-cost, high-accuracy and high-throughput short-read next generation sequencer to routinely generate over 100 Kb long-range sequencing information with as little as 0.1 ng input material. In a PCR tube, millions of clonally barcoded beads are used to uniquely barcode long DNA molecules in an open bulk reaction without dilution and compartmentation. The barcode linked reads are used to successfully assemble genomes ranging from microbes to human. These linked-reads also generate mega-base-long phased blocks and provide a cost-effective tool for detecting structural variants in a genome, which are important to identify compound heterozygosity in recessive Mendelian diseases and discover genetic drivers and diagnostic biomarkers in cancers.


2015 ◽  
Author(s):  
James A Stapleton ◽  
Jeongwoon Kim ◽  
John P Hamilton ◽  
Ming Wu ◽  
Luiz C Irber ◽  
...  

Next-generation DNA sequencing has revolutionized the study of biology. However, the short read lengths of the dominant instruments complicate assembly of complex genomes and haplotype phasing of mixtures of similar sequences. Here we demonstrate a method to reconstruct the sequences of individual nucleic acid molecules up to 11.6 kilobases in length from short (150-bp) reads. We show that our method can construct 99.97%-accurate synthetic reads from bacterial, plant, and animal genomic samples, full-length mRNA sequences from human cancer cell lines, and individual HIV env gene variants from a mixture. The preparation of multiple samples can be multiplexed into a single tube, further reducing effort and cost relative to competing approaches. Our approach generates sequencing libraries in three days from less than one microgram of DNA in a single-tube format without custom equipment or specialized expertise.


2001 ◽  
Vol 6 (2) ◽  
pp. 131-136 ◽  
Author(s):  
MARIA SOMODEVILLA-TORRES ◽  
PETER TIMMS ◽  
RAY HARRIS ◽  
C. PHILLIP MORRIS ◽  
ANGELA VAN DAAL

Author(s):  
Seyoung Mun ◽  
Songmi Kim ◽  
Wooseok Lee ◽  
Keunsoo Kang ◽  
Thomas J. Meyer ◽  
...  

AbstractAdvances in next-generation sequencing (NGS) technology have made personal genome sequencing possible, and indeed, many individual human genomes have now been sequenced. Comparisons of these individual genomes have revealed substantial genomic differences between human populations as well as between individuals from closely related ethnic groups. Transposable elements (TEs) are known to be one of the major sources of these variations and act through various mechanisms, including de novo insertion, insertion-mediated deletion, and TE–TE recombination-mediated deletion. In this study, we carried out de novo whole-genome sequencing of one Korean individual (KPGP9) via multiple insert-size libraries. The de novo whole-genome assembly resulted in 31,305 scaffolds with a scaffold N50 size of 13.23 Mb. Furthermore, through computational data analysis and experimental verification, we revealed that 182 TE-associated structural variation (TASV) insertions and 89 TASV deletions contributed 64,232 bp in sequence gain and 82,772 bp in sequence loss, respectively, in the KPGP9 genome relative to the hg19 reference genome. We also verified structural differences associated with TASVs by comparative analysis with TASVs in recent genomes (AK1 and TCGA genomes) and reported their details. Here, we constructed a new Korean de novo whole-genome assembly and provide the first study, to our knowledge, focused on the identification of TASVs in an individual Korean genome. Our findings again highlight the role of TEs as a major driver of structural variations in human individual genomes.


2020 ◽  
Author(s):  
Fangyan Yu ◽  
Ka Wai Leong ◽  
Alexander Makrigiorgos ◽  
Viktor A Adalsteinsson ◽  
Ioannis Ladas ◽  
...  

Abstract Sensitive detection of microsatellite instability (MSI) in tissue or liquid biopsies using next generation sequencing (NGS) has growing prognostic and predictive applications in cancer. However, the complexities of NGS make it cumbersome as compared to established multiplex-PCR detection of MSI. We present a new approach to detect MSI using inter-Alu-PCR followed by targeted NGS, that combines the practical advantages of multiplexed-PCR with the breadth of information provided by NGS. Inter-Alu-PCR employs poly-adenine repeats of variable length present in every Alu element and provides a massively-parallel, rapid approach to capture poly-A-rich genomic fractions within short 80–150bp amplicons generated from adjacent Alu-sequences. A custom-made software analysis tool, MSI-tracer, enables Alu-associated MSI detection from tissue biopsies or MSI-tracing at low-levels in circulating-DNA. MSI-associated indels at somatic-indel frequencies of 0.05–1.5% can be detected depending on the availability of matching normal tissue and the extent of instability. Due to the high Alu copy-number in human genomes, a single inter-Alu-PCR retrieves enough information for identification of MSI-associated-indels from ∼100 pg circulating-DNA, reducing current limits by ∼2-orders of magnitude and equivalent to circulating-DNA obtained from finger-sticks. The combined practical and informational advantages of inter-Alu-PCR make it a powerful tool for identifying tissue-MSI-status or tracing MSI-associated-indels in liquid biopsies.


Sign in / Sign up

Export Citation Format

Share Document