ReALLEN: structural variation discovery in cancer genome by sensitive analysis of single-end reads

Mapping Intimacies ◽

10.1101/506329 ◽

2018 ◽

Author(s):

Ryo Kanno ◽

Daisuke Tanaka ◽

Hideaki Nanamiya ◽

Takao Isogai

Keyword(s):

Next Generation Sequencing ◽

Cancer Genomics ◽

Sequence Data ◽

Simulated Dataset ◽

Ion Torrent ◽

Next Generation ◽

Structural Variations ◽

Large Rearrangement ◽

Real Dataset ◽

Generation Sequencing

AbstractBackgroundThe structural abnormalities in chromosomes are important issues in cancer genomics. Next generation sequencing technologies have big potentials to detect the structural variations precisely and comprehensively. Nevertheless, it is still difficult problem to detect large structural variations from short read sequence data. Major efforts have been achieved with paired-end reads, since discordant pairs directly reflect the existence of large rearrangement. Furthermore, approaches to detect structural variations from single-end reads are still worthwhile challenge because they allow wide choices of sequencing platforms.ResultWe present ReALLEN, a series of tools to detect genomic rearrangement with base-pair resolution from single-end reads provided by next generation sequencing. We examined the performance of ReALLEN using simulated dataset and real dataset sequenced by Ion Torrent systems. In most cases on the simulated dataset, ReALLEN showed nearly 100% precision and better sensitivity than other major tools. Notably, ReALLEN showed stable scores even if it was on some unfavorable conditions, for example, low coverage or small variant size. On the real dataset sequenced by Ion Torrent systems, ReALLEN accurately found an insertional translocation that was crucial for the diagnosis of chronic myeloid leukemia.ConclusionReALLEN is useful to researchers in finding genomic rearrangements. It will contribute to discovery of cancer-specific fusion proteins, precise diagnosis of known types of cancers, and understanding of genetic diseases caused by abnormal chromosomes.

Evaluation of Ion Torrent next-generation sequencing for thalassemia diagnosis

Journal of International Medical Research ◽

10.1177/0300060520967778 ◽

2020 ◽

Vol 48 (12) ◽

pp. 030006052096777

Author(s):

Peisong Chen ◽

Xuegao Yu ◽

Hao Huang ◽

Wentao Zeng ◽

Xiaohong He ◽

...

Keyword(s):

Next Generation Sequencing ◽

Ion Torrent ◽

Next Generation ◽

Large Deletions ◽

Different Types ◽

Variant Detection ◽

Polymerase Chain ◽

Next Generation Sequencing Ngs ◽

Accuracy And Repeatability ◽

Generation Sequencing

Introduction To evaluate a next-generation sequencing (NGS) workflow in the screening and diagnosis of thalassemia. Methods In this prospective study, blood samples were obtained from people undergoing genetic screening for thalassemia at our centre in Guangzhou, China. Genomic DNA was polymerase chain reaction (PCR)-amplified and sequenced using the Ion Torrent system and results compared with traditional genetic analyses. Results Of the 359 subjects, 148 (41%) were confirmed to have thalassemia. Variant detection identified 35 different types including the most common. Identification of the mutational sites by NGS were consistent with those identified by Sanger sequencing and Gap-PCR. The sensitivity and specificities of the Ion Torrent NGS were 100%. In a separate test of 16 samples, results were consistent when repeated ten times. Conclusion Our NGS workflow based on the Ion Torrent sequencer was successful in the detection of large deletions and non-deletional defects in thalassemia with high accuracy and repeatability.

Next-Generation Sequencing for Cancer Genomics

Next Generation Sequencing in Cancer Research ◽

10.1007/978-1-4614-7645-0_3 ◽

2013 ◽

pp. 55-74

Author(s):

Aarti N. Desai ◽

Abhay Jere

Keyword(s):

Next Generation Sequencing ◽

Cancer Genomics ◽

Next Generation ◽

Generation Sequencing

Validation of variants using cost effective highresolution melting (HRM) analysis predicted from target re-sequencing in Eucalyptus

Acta Botanica Croatica ◽

10.37427/botcro-2020-019 ◽

2020 ◽

Vol 79 (2) ◽

pp. 105-113

Author(s):

Abdul Bari Muneera Parveen ◽

Divya Lakshmanan ◽

Modhumita Ghosh Dasgupta

Keyword(s):

Next Generation Sequencing ◽

Large Scale ◽

Sequence Data ◽

Cost Effective ◽

Nucleotide Polymorphisms ◽

Next Generation ◽

Time Saving ◽

Hrm Analysis ◽

The Cost ◽

Generation Sequencing

The advent of next-generation sequencing has facilitated large-scale discovery and mapping of genomic variants for high-throughput genotyping. Several research groups working in tree species are presently employing next generation sequencing (NGS) platforms for marker discovery, since it is a cost effective and time saving strategy. However, most trees lack a chromosome level genome map and validation of variants for downstream application becomes obligatory. The cost associated with identifying potential variants from the enormous amount of sequence data is a major limitation. In the present study, high resolution melting (HRM) analysis was optimized for rapid validation of single nucleotide polymorphisms (SNPs), insertions or deletions (InDels) and simple sequence repeats (SSRs) predicted from exome sequencing of parents and hybrids of Eucalyptus tereticornis Sm. ? Eucalyptus grandis Hill ex Maiden generated from controlled hybridization. The cost per data point was less than 0.5 USD, providing great flexibility in terms of cost and sensitivity, when compared to other validation methods. The sensitivity of this technology in variant detection can be extended to other applications including Bar-HRM for species authentication and TILLING for detection of mutants.

Next-generation sequencing of BRCA1 and BRCA2 genes for rapid detection of germline mutations in hereditary breast/ovarian cancer

PeerJ ◽

10.7717/peerj.6661 ◽

2019 ◽

Vol 7 ◽

pp. e6661 ◽

Cited By ~ 7

Author(s):

Arianna Nicolussi ◽

Francesca Belardinilli ◽

Yasaman Mahdavian ◽

Valeria Colicchia ◽

Sonia D’Inzeo ◽

...

Keyword(s):

Next Generation Sequencing ◽

Sanger Sequencing ◽

Germline Mutations ◽

Ion Torrent ◽

Next Generation ◽

Training Set ◽

Brca1 And Brca2 ◽

Ion Torrent Pgm ◽

Validation Set ◽

Generation Sequencing

Background Conventional methods used to identify BRCA1 and BRCA2 germline mutations in hereditary cancers, such as Sanger sequencing/multiplex ligation-dependent probe amplification (MLPA), are time-consuming and expensive, due to the large size of the genes. The recent introduction of next-generation sequencing (NGS) benchtop platforms offered a powerful alternative for mutation detection, dramatically improving the speed and the efficiency of DNA testing. Here we tested the performance of the Ion Torrent PGM platform with the Ion AmpliSeq BRCA1 and BRCA2 Panel in our clinical routine of breast/ovarian hereditary cancer syndrome assessment. Methods We first tested the NGS approach in a cohort of 11 patients (training set) who had previously undergone genetic diagnosis in our laboratory by conventional methods. Then, we applied the optimized pipeline to the consecutive cohort of 136 uncharacterized probands (validation set). Results By minimal adjustments in the analytical pipeline of Torrent Suite Software we obtained a 100% concordance with Sanger results regarding the identification of single nucleotide alterations, insertions, and deletions with the exception of three large genomic rearrangements (LGRs) contained in the training set. The optimized pipeline applied to the validation set (VS), identified pathogenic and polymorphic variants, including a novel BRCA2 pathogenic variant at exon 3, 100% of which were confirmed by Sanger in their correct zygosity status. To identify LGRs, all negative samples of the VS were subjected to MLPA analysis. Discussion Our experience strongly supports that the Ion Torrent PGM technology in BRCA1 and BRCA2 germline variant identification, combined with MLPA analysis, is highly sensitive, easy to use, faster, and cheaper than traditional (Sanger sequencing/MLPA) approaches.

Metagenomic analysis using next-generation sequencing of pathogens in bronchoalveolar lavage fluid from pediatric patients with respiratory failure

Scientific Reports ◽

10.1038/s41598-019-49372-x ◽

2019 ◽

Vol 9 (1) ◽

Cited By ~ 7

Author(s):

Suguru Takeuchi ◽

Jun-ichi Kawada ◽

Kazuhiro Horiba ◽

Yusuke Okuno ◽

Toshihiko Okumura ◽

...

Keyword(s):

Respiratory Failure ◽

Next Generation Sequencing ◽

Bronchoalveolar Lavage ◽

Bronchoalveolar Lavage Fluid ◽

Pediatric Patients ◽

Respiratory Infections ◽

Sequence Data ◽

Lavage Fluid ◽

Next Generation ◽

Generation Sequencing

Abstract Next-generation sequencing (NGS) has been applied in the field of infectious diseases. Bronchoalveolar lavage fluid (BALF) is considered a sterile type of specimen that is suitable for detecting pathogens of respiratory infections. The aim of this study was to comprehensively identify causative pathogens using NGS in BALF samples from immunocompetent pediatric patients with respiratory failure. Ten patients hospitalized with respiratory failure were included. BALF samples obtained in the acute phase were used to prepare DNA- and RNA-sequencing libraries. The libraries were sequenced on MiSeq, and the sequence data were analyzed using metagenome analysis tools. A mean of 2,041,216 total reads were sequenced for each library. Significant bacterial or viral sequencing reads were detected in eight of the 10 patients. Furthermore, candidate pathogens were detected in three patients in whom etiologic agents were not identified by conventional methods. The complete genome of enterovirus D68 was identified in two patients, and phylogenetic analysis suggested that both strains belong to subclade B3, which is an epidemic strain that has spread worldwide in recent years. Our results suggest that NGS can be applied for comprehensive molecular diagnostics as well as surveillance of pathogens in BALF from patients with respiratory infection.

A Model Study of In Silico Proficiency Testing for Clinical Next-Generation Sequencing

Archives of Pathology & Laboratory Medicine ◽

10.5858/arpa.2016-0194-cp ◽

2016 ◽

Vol 140 (10) ◽

pp. 1085-1091 ◽

Cited By ~ 21

Author(s):

Eric J. Duncavage ◽

Haley J. Abel ◽

Jason D. Merker ◽

John B. Bodner ◽

Qin Zhao ◽

...

Keyword(s):

Next Generation Sequencing ◽

Proficiency Testing ◽

In Silico ◽

Absolute Difference ◽

Ion Torrent ◽

Next Generation ◽

Single Nucleotide Variants ◽

Single Nucleotide ◽

Clinical Laboratories ◽

Generation Sequencing

Context.—Most current proficiency testing challenges for next-generation sequencing assays are methods-based proficiency testing surveys that use DNA from characterized reference samples to test both the wet-bench and bioinformatics/dry-bench aspects of the tests. Methods-based proficiency testing surveys are limited by the number and types of mutations that either are naturally present or can be introduced into a single DNA sample. Objective.—To address these limitations by exploring a model of in silico proficiency testing in which sequence data from a single well-characterized specimen are manipulated electronically. Design.—DNA from the College of American Pathologists reference genome was enriched using the Illumina TruSeq and Life Technologies AmpliSeq panels and sequenced on the MiSeq and Ion Torrent platforms, respectively. The resulting data were mutagenized in silico and 26 variants, including single-nucleotide variants, deletions, and dinucleotide substitutions, were added at variant allele fractions (VAFs) from 10% to 50%. Participating clinical laboratories downloaded these files and analyzed them using their clinical bioinformatics pipelines. Results.—Laboratories using the AmpliSeq/Ion Torrent and/or the TruSeq/MiSeq participated in the 2 surveys. On average, laboratories identified 24.6 of 26 variants (95%) overall and 21.4 of 22 variants (97%) with VAFs greater than 15%. No false-positive calls were reported. The most frequently missed variants were single-nucleotide variants with VAFs less than 15%. Across both challenges, reported VAF concordance was excellent, with less than 1% median absolute difference between the simulated VAF and mean reported VAF. Conclusions.—The results indicate that in silico proficiency testing is a feasible approach for methods-based proficiency testing, and demonstrate that the sensitivity and specificity of current next-generation sequencing bioinformatics across clinical laboratories are high.

Comparison of Normalization Methods for Construction of Large, Multiplex Amplicon Pools for Next-Generation Sequencing

Applied and Environmental Microbiology ◽

10.1128/aem.02585-09 ◽

2010 ◽

Vol 76 (12) ◽

pp. 3863-3868 ◽

Cited By ~ 48

Author(s):

J. Kirk Harris ◽

Jason W. Sahl ◽

Todd A. Castoe ◽

Brandie D. Wagner ◽

David D. Pollock ◽

...

Keyword(s):

Next Generation Sequencing ◽

Massively Parallel Sequencing ◽

Sequence Data ◽

Cost Savings ◽

Massively Parallel ◽

Next Generation ◽

Normalization Methods ◽

The Cost ◽

Generation Sequencing

ABSTRACT Constructing mixtures of tagged or bar-coded DNAs for sequencing is an important requirement for the efficient use of next-generation sequencers in applications where limited sequence data are required per sample. There are many applications in which next-generation sequencing can be used effectively to sequence large mixed samples; an example is the characterization of microbial communities where ≤1,000 sequences per samples are adequate to address research questions. Thus, it is possible to examine hundreds to thousands of samples per run on massively parallel next-generation sequencers. However, the cost savings for efficient utilization of sequence capacity is realized only if the production and management costs associated with construction of multiplex pools are also scalable. One critical step in multiplex pool construction is the normalization process, whereby equimolar amounts of each amplicon are mixed. Here we compare three approaches (spectroscopy, size-restricted spectroscopy, and quantitative binding) for normalization of large, multiplex amplicon pools for performance and efficiency. We found that the quantitative binding approach was superior and represents an efficient scalable process for construction of very large, multiplex pools with hundreds and perhaps thousands of individual amplicons included. We demonstrate the increased sequence diversity identified with higher throughput. Massively parallel sequencing can dramatically accelerate microbial ecology studies by allowing appropriate replication of sequence acquisition to account for temporal and spatial variations. Further, population studies to examine genetic variation, which require even lower levels of sequencing, should be possible where thousands of individual bar-coded amplicons are examined in parallel.

Next-generation sequencing and systematics: What can a billion base pairs of DNA sequence data do for you?

Taxon ◽

10.1002/tax.606002 ◽

2011 ◽

Vol 60 (6) ◽

pp. 1552-1566 ◽

Cited By ~ 49

Author(s):

Nicola Harrison ◽

Catherine Anne Kidner

Keyword(s):

Next Generation Sequencing ◽

Dna Sequence ◽

Sequence Data ◽

Next Generation ◽

Base Pairs ◽

Dna Sequence Data ◽

Generation Sequencing

Enabling Precision Oncology Through Precision Diagnostics

Annual Review of Pathology Mechanisms of Disease ◽

10.1146/annurev-pathmechdis-012418-012735 ◽

2020 ◽

Vol 15 (1) ◽

pp. 97-121 ◽

Cited By ~ 4

Author(s):

Noah A. Brown ◽

Kojo S.J. Elenitoba-Johnson

Keyword(s):

Next Generation Sequencing ◽

Cancer Patients ◽

Cancer Genomics ◽

Cost Effective ◽

Routine Clinical Practice ◽

Precision Oncology ◽

Next Generation ◽

Sequencing Data ◽

Molecular Alterations ◽

Generation Sequencing

Genomic testing enables clinical management to be tailored to individual cancer patients based on the molecular alterations present within cancer cells. Genomic sequencing results can be applied to detect and classify cancer, predict prognosis, and target therapies. Next-generation sequencing has revolutionized the field of cancer genomics by enabling rapid and cost-effective sequencing of large portions of the genome. With this technology, precision oncology is quickly becoming a realized paradigm for managing the treatment of cancer patients. However, many challenges must be overcome to efficiently implement the transition of next-generation sequencing from research applications to routine clinical practice, including using specimens commonly available in the clinical setting; determining how to process, store, and manage large amounts of sequencing data; determining how to interpret and prioritize molecular findings; and coordinating health professionals from multiple disciplines.

Next-generation sequencing in metastatic lung cancer patients: Translation from sequence data into clinical practice.

Journal of Clinical Oncology ◽

10.1200/jco.2015.33.15_suppl.e12521 ◽

2015 ◽

Vol 33 (15_suppl) ◽

pp. e12521-e12521

Author(s):

Jessica Ribeiro Gomes ◽

Raphael Brandao Moreira ◽

Renata D'Alpino D'Alpino ◽

Marcelo Rocha S Cruz ◽

Tercia Tarciane Soares de Sousa ◽

...

Keyword(s):

Lung Cancer ◽

Clinical Practice ◽

Next Generation Sequencing ◽

Cancer Patients ◽

Sequence Data ◽

Metastatic Lung Cancer ◽

Next Generation ◽

Lung Cancer Patients ◽

Metastatic Lung ◽

Generation Sequencing