Insights on early mutational events in SARS-CoV-2 virus reveal founder effects across geographical regions

Here we aim to describe early mutational events across samples from publicly available SARS-CoV-2 sequences from the sequence read archive and GenBank repositories. Up until 27 March 2020, we downloaded 50 illumina datasets, mostly from China, USA (WA State) and Australia (VIC). A total of 30 datasets (60%) contain at least a single founder mutation and most of the variants are missense (over 63%). Five-point mutations with clonal (founder) effect were found in USA next-generation sequencing samples. Sequencing samples from North America in GenBank (22 April 2020) present this signature with up to 39% allele frequencies among samples (n = 1,359). Australian variant signatures were more diverse than USA samples, but still, clonal events were found in these samples. Mutations in the helicase, encoded by the ORF1ab gene in SARS-CoV-2 were predominant, among others, suggesting that these regions are actively evolving. Finally, we firmly urge that primer sets for diagnosis be carefully designed, since rapidly occurring variants would affect the performance of the reverse transcribed quantitative PCR (RT-qPCR) based viral testing.

Download Full-text

Insights on early mutational events in SARS-CoV-2 virus reveal founder effects across geographical regions

10.1101/2020.04.09.034462 ◽

2020 ◽

Cited By ~ 1

Author(s):

Carlos Farkas ◽

Francisco Fuentes-Villalobos ◽

José Luis Garrido ◽

Jody J Haigh ◽

María Inés Barría

Keyword(s):

Founder Mutation ◽

Point Mutations ◽

Washington Dc ◽

Founder Effects ◽

High Quality ◽

Coding Regions ◽

Sequence Read Archive ◽

Geographical Regions ◽

Viral Testing ◽

Primer Sets

AbstractHere we aim to describe early mutational events across samples from publicly available SARS-CoV-2 sequences from the sequence read archive repository. Up until March 27, 2020, we downloaded 53 illumina datasets, mostly from China, USA (Washington DC) and Australia (Victoria). Of 30 high quality datasets, 27 datasets (90%) contain at least a single founder mutation and most of the variants are missense (over 63%). Five-point mutations with clonal (founder) effect were found in USA sequencing samples. Sequencing samples from USA in GenBank present this signature with 50% allele frequencies among samples. Australian mutation signatures were more diverse than USA samples, but still, clonal events were found in those samples. Mutations in the helicase and orf1a coding regions from SARS-CoV-2 were predominant, among others, suggesting that these proteins are prone to evolve by natural selection. Finally, we firmly urge that primer sets for diagnosis be carefully designed, since rapidly occurring variants would affect the performance of the reverse transcribed quantitative PCR (RT-qPCR) based viral testing.

Download Full-text

Next-Generation Sequencing Technology Reveals a Characteristic Pattern of Molecular Mutations in 75% of Chronic Myelomonocytic Leukemia (CMML) by Detecting Frequent Alterations in TET2, RUNX1, CBL, and RAS.

Blood ◽

10.1182/blood.v114.22.417.417 ◽

2009 ◽

Vol 114 (22) ◽

pp. 417-417 ◽

Cited By ~ 1

Author(s):

Alexander Kohlmann ◽

Vera Grossmann ◽

Claudia Haferlach ◽

Beray Kazak ◽

Sonja Schindela ◽

...

Keyword(s):

Next Generation Sequencing ◽

Mutation Screening ◽

Point Mutations ◽

Chronic Myelomonocytic Leukemia ◽

Melting Curve Analysis ◽

Next Generation ◽

Monosomy 7 ◽

Equity Ownership ◽

Myelomonocytic Leukemia ◽

Generation Sequencing

Abstract Abstract 417 Chronic myelomonocytic leukemia (CMML) is a clonal hematopoietic malignancy that is characterized by features of both a myeloproliferative neoplasm and a myelodysplastic syndrome. Here, we analyzed 81 CMML cases (45 CMML-1, 36 CMML-2). In chromosome banding analysis 59/76 (77.6%) patients showed a normal karyotype (data not availabel in 5 cases). Recurrent chromosome aberrations were trisomy 8 (n=6; 7.9%), monosomy 7 (n=3; 3.9%), and loss of the Y-chromosome (n=5; 6.6%). Fluorescence in situ hybridization (FISH) detected the deletion of one allele of the TET2 gene in 4/71 cases (5.6%). Thus, the majority of cases can not be genetically characterized by these techniques. Therefore, we applied next-generation sequencing (NGS) technology to investigate 7 candidate genes, represented by 43 PCR-products, at known mutational hotspot regions, i.e. CBL (exons 8 and 9), JAK2 (exons 12 and 14), MPL (exon 10), NRAS (exons 2 and 3), and KRAS (exons 2 and 3). In addition, complete coding regions were analyzed for RUNX1 (beta isoform) and TET2. NGS was performed using 454 FLX amplicon chemistry (Roche Diagnostics Corporation, Branford, CT). The median number of base pairs sequenced per patient was 9.24 Mb. For each target gene a median of 911 reads was generated (coverage range: 736-fold to 1606-fold). This approach allowed a high-sensitive detection of molecular mutations, e.g. detecting the JAK2 V617F mutation down to 1.16% of reads. In total, 146 variances were detected by this comprehensive molecular mutation screening (GS Amplicon Variant Analyzer software version 2.0.01). In 80.4% of variances consistent results were obtained after confirming NGS mutations with melting curve analysis and conventional sequencing. In the remaining discrepant variances (19.6%) NGS deep-sequencing outperformed conventional methods due to the higher sensitivity of the platform. After excluding 19 polymorphisms or silent mutations 127 distinct mutations in 61/81 patients (75.3%) were detected: CBL: n=21 point mutations and one deletion (18 bp) found in 20 cases (24%); JAK2: n=8 mutations (V617F) found in 8 cases (9.8%); MPL: no mutations found; NRAS: n=23 mutations found in 18 cases (22.2%); KRAS: n=12 mutations found in 10 cases (12.3%); RUNX1: n=6 point mutations and one deletion (14 bp) found in 7 cases (8.6%); and TET2: n=49 point mutations and 6 deletions (2-19 bp; 5/6 out-of-frame) found in 41 cases (50.6%). Furthermore, in 21 TET2-mutated cases 11 mutations previously described in the literature were detectable, whereas 28 cases carried novel mutations (n=28). In the cohort of TET2-mutated cases 17/41 (41.3%) patients harbored TET2 abnormalities as sole aberration. Interestingly, CBL mutations were found to be significantly associated with TET2 mutations (Fisher's exact test, p=0.008). In 17 of 20 (85.0%) CBL-mutated cases TET2 abnormalities were concomitantly observed. In contrast, no significant associations were found between any of the point mutations or deletions and the karyotype. There were also no associations observed between molecular aberrations and the diagnostic categories CMML-1 and CMML-2. With respect to clinical data a trend for better outcome was seen for patients that carried either or both TET2 and CBL mutations (median OS 130.4 vs. 17.3 months, alive at 2 yrs: 72.0% vs. 43.9%; p=0.13). In conclusion, 75.3% of CMMLs harbored at least one molecular aberration. In median 2 mutations per case were observed. Compared to limited data from the literature we detected not only a higher frequency of CBL mutations, but also add data on novel TET2 mutations. In particular, comprehensive NGS screening here for the first time has demonstrated its strength to further genetically characterize and delineate prognostic groups within this type of hematological malignancy. Disclosures: Kohlmann: MLL Munich Leukemia Laboratory: Employment. Grossmann:MLL Munich Leukemia Laboratory: Employment. Haferlach:MLL Munich Leukemia Laboratory: Equity Ownership. Kazak:MLL Munich Leukemia Laboratory: Employment. Schindela:MLL Munich Leukemia Laboratory: Employment. Weiss:MLL Munich Leukemia Laboratory: Employment. Dicker:MLL Munich Leukemia Laboratory: Employment. Schnittger:MLL Munich Leukemia Laboratory: Equity Ownership. Kern:MLL Munich Leukemia Laboratory: Equity Ownership. Haferlach:MLL Munich Leukemia Laboratory: Equity Ownership.

Download Full-text

Maturity onset diabetes of youth (MODY) in Turkish children: sequence analysis of 11 causative genes by next generation sequencing

Journal of Pediatric Endocrinology and Metabolism ◽

10.1515/jpem-2015-0039 ◽

2016 ◽

Vol 29 (4) ◽

Cited By ~ 8

Author(s):

Sebahat Yılmaz Ağladıoğlu ◽

Zehra Aycan ◽

Semra Çetinkaya ◽

Veysel Nijat Baş ◽

Aşan Önder ◽

...

Keyword(s):

Type 2 Diabetes ◽

Next Generation Sequencing ◽

Point Mutations ◽

Maturity Onset Diabetes ◽

Turkish Children ◽

Onset Diabetes ◽

Molecular Studies ◽

Generation Sequencing

AbstractMaturity-onset diabetes of the youth (MODY), is a genetically and clinically heterogeneous group of diseasesand is often misdiagnosed as type 1 or type 2 diabetes. The aim of this study is to investigate both novel and proven mutations of 11A panel of 11We identified 28 (65%) point mutations among 43 patients. Eighteen patients haveThis is the first study including molecular studies of 11

Download Full-text

pysradb: A Python package to query next-generation sequencing metadata and data from NCBI Sequence Read Archive

F1000Research ◽

10.12688/f1000research.18676.1 ◽

2019 ◽

Vol 8 ◽

pp. 532 ◽

Cited By ~ 2

Author(s):

Saket Choudhary

Keyword(s):

Next Generation Sequencing ◽

Research Community ◽

Command Line ◽

Next Generation ◽

Multiple Use ◽

Sequencing Data ◽

Sequence Read Archive ◽

Python Package ◽

Generation Sequencing ◽

Ncbi Sequence Read Archive

The NCBI Sequence Read Archive (SRA) is the primary archive of next-generation sequencing datasets. SRA makes metadata and raw sequencing data available to the research community to encourage reproducibility and to provide avenues for testing novel hypotheses on publicly available data. However, methods to programmatically access this data are limited. We introduce the Python package, pysradb, which provides a collection of command line methods to query and download metadata and data from SRA, utilizing the curated metadata database available through the SRAdb project. We demonstrate the utility of pysradb on multiple use cases for searching and downloading SRA datasets. It is available freely at https://github.com/saketkc/pysradb.

Download Full-text

Next Generation Sequencing Based Forward Genetic Approaches for Identification and Mapping of Causal Mutations in Crop Plants: A Comprehensive Review

Plants ◽

10.3390/plants9101355 ◽

2020 ◽

Vol 9 (10) ◽

pp. 1355

Author(s):

Parmeshwar K. Sahu ◽

Richa Sao ◽

Suvendu Mondal ◽

Gautam Vishwakarma ◽

Sudhir Kumar Gupta ◽

...

Keyword(s):

Next Generation Sequencing ◽

Crop Improvement ◽

Point Mutations ◽

Crop Plants ◽

Forward Genetics ◽

Next Generation ◽

Bioinformatics Tools ◽

Short Period ◽

Ngs Data ◽

Generation Sequencing

The recent advancements in forward genetics have expanded the applications of mutation techniques in advanced genetics and genomics, ahead of direct use in breeding programs. The advent of next-generation sequencing (NGS) has enabled easy identification and mapping of causal mutations within a short period and at relatively low cost. Identifying the genetic mutations and genes that underlie phenotypic changes is essential for understanding a wide variety of biological functions. To accelerate the mutation mapping for crop improvement, several high-throughput and novel NGS based forward genetic approaches have been developed and applied in various crops. These techniques are highly efficient in crop plants, as it is relatively easy to grow and screen thousands of individuals. These approaches have improved the resolution in quantitative trait loci (QTL) position/point mutations and assisted in determining the functional causative variations in genes. To be successful in the interpretation of NGS data, bioinformatics computational methods are critical elements in delivering accurate assembly, alignment, and variant detection. Numerous bioinformatics tools/pipelines have been developed for such analysis. This article intends to review the recent advances in NGS based forward genetic approaches to identify and map the causal mutations in the crop genomes. The article also highlights the available bioinformatics tools/pipelines for reducing the complexity of NGS data and delivering the concluding outcomes.

Download Full-text

Comparison of genetic alterations from the bladder cancer genome atlas (TCGA) and a prospective set of high-grade urothelial carcinoma tumors using a CLIA laboratory next generation sequencing assay.

Journal of Clinical Oncology ◽

10.1200/jco.2015.33.7_suppl.298 ◽

2015 ◽

Vol 33 (7_suppl) ◽

pp. 298-298 ◽

Cited By ~ 1

Author(s):

Gopa Iyer ◽

Aditya Bagrodia ◽

Eugene K. Cha ◽

Mariel Elena Boyd ◽

Ahmet Zehir ◽

...

Keyword(s):

Next Generation Sequencing ◽

Point Mutations ◽

Genetic Alterations ◽

Next Generation ◽

Invasive Treatment ◽

Tert Promoter ◽

Divergent Differentiation ◽

Treatment Naïve ◽

Muscle Invasive ◽

Generation Sequencing

298 Background: The type and frequency of recurrent, targetable alterations in muscle-invasive, treatment-naïve urothelial cancer (UC) have been well-catalogued through the bladder TCGA. We sought to validate these findings within a prospective cohort of patients (pts) with progressive UC treated at our institution. This cohort typifies the UC patient population managed by medical oncologists. Methods: Pts with a diagnosis of UC were enrolled onto an IRB-approved protocol between 1/14/14 and 7/29/14, which allowed for sequencing of all exons in 341 oncogenes and tumor suppressor genes using an exon capture next generation sequencing assay (NGS) platform (MSK-IMPACT) in a CLIA lab from tumor and matched germline DNA. Somatic point mutations, truncations, copy number alterations, and insertions/deletions were detected. Results: 49 UC pts were sequenced, 5 (10%) having upper tract disease. 33 (67%) tumors were pure UC histology with the remainder containing divergent differentiation. 17 samples (24%) were metastatic in origin, and 40 samples were analyzed from pts who had received prior chemotherapy. The most common alteration identified was point mutations within the TERT promoter (71%), a region not sequenced by TCGA. Additional alteration frequencies were similar between both sets. FGFR3 mutations overlapped with both PIK3CA and TSC1 alterations and co-alterations were observed between FGFR3 and CDKN2A as well as PIK3CA and CCND1. Conclusions: TERT promoter mutations were found at high frequency in the MSK-IMPACT tumor cohort. The frequency and type of alterations identified in the bladder TCGA are similar to the MSK-IMPACT cohort, suggesting that TCGA data can be used to guide clinical trials in the metastatic, pre-treated population. Overlap of alterations within and across core signaling pathways underscores the need for functional validation of alterations and for rational combinations of targeted therapies to effectively treat advanced UC. [Table: see text]

Download Full-text

Next-Generation Sequencing (NGS) in CMML, MDS and AML Detects Molecular Mutations in Oncogenes and Allows the Identification of Balanced Chromosomal Abnormalities with Extraordinary Sensitivity and Specificity.

Blood ◽

10.1182/blood.v114.22.144.144 ◽

2009 ◽

Vol 114 (22) ◽

pp. 144-144

Author(s):

Vera Grossmann ◽

Alexander Kohlmann ◽

Claudia Haferlach ◽

Hans-Ulrich Klein ◽

Martin Dugas ◽

...

Keyword(s):

Bone Marrow ◽

Next Generation Sequencing ◽

Candidate Genes ◽

Point Mutations ◽

Diagnostic Methods ◽

Next Generation ◽

Coding Regions ◽

Equity Ownership ◽

Next Generation Sequencing Ngs ◽

Generation Sequencing

Abstract Abstract 144 PicoTiterPlate (PTP) pyrosequencing allows the detection of low-abundance oncogene aberrations in complex samples even with low tumor content. Here, we compared deep sequencing data of two Next-Generation Sequencing (NGS) assays to detect molecular mutations using a PCR-based strategy and, in addition, to uncover inversions, translocations, and insertions in a targeted sequence enrichment workflow (454 Life Sciences, Roche Diagnostics Corporation, Branford, CT). First, we studied 95 patients (CMML, n=81; AML, n=6; MDS, n=3; MPS, n=3; ET, n=2) using the amplicon approach and investigated seven candidate genes with relevance in oncogenesis of myeloid malignancies: TET2, RUNX1, JAK2, MPL, KRAS, NRAS, and CBL. 43 primer pairs were designed to cover the complete coding regions of TET2, RUNX1 (beta isoform), and hotspot regions of the latter genes. In total, 4128 individual PCR reactions were performed with DNA isolated from bone marrow mononuclear cells, followed by product purification, fluorometric quantitation, and equimolar pooling of the corresponding 43 amplicon products to generate one single sequence library per patient. For sequencing, a 454 8-lane PTP was used applying standard FLX chemistry and representing one patient per lane. The median number of base pairs sequenced per patient was 9.23 Mb. For each amplicon a median of 840 reads was generated (coverage range: 485–1929 reads). As initial proof-of-concept analysis 27 of the 95 patients with known mutations (n=32) as detected by conventional sequencing or melting curve analyses were investigated (range of cells carrying the respective mutation: 1.1% for JAK2 V617F to 98.14% for TET2 C1464X). In all cases, 454 NGS confirmed results from routine diagnostic methods (GS Amplicon Variant Analyzer software version 2.0.01). We then investigated the remaining 69 CMML patients: In median, 2 variances (range 1–8 variances), i.e. differences in comparison to the reference sequence, per patient were detected. These variances included both point mutations in all candidate genes and large deletions (12-19 bp) in CBL, RUNX1, and TET2. Only 20/81 patients of the CMML-cohort (24.69%) were without any detectable mutation. Secondly, in a cohort of six AML bone marrow specimens a custom NimbleGen array (385K format; Madison, WI) was used to perform a targeted DNA sequence enrichment procedure. In total, capture probes spanning 1.91 Mb were designed to represent all coding regions of 92 target genes (1559 exons) with relevance in hematological malignancies (e.g. KIT, NF1, TP53, BCR, ABL1, NPM1, or FLT3). In addition, the complete genomic regions were targeted for RUNX1, CBFB, and MLL. For sequencing, 454 Titanium chemistry was applied, loading three patients per lane on a 2-lane PTP including three molecular identifiers (MIDs) each. Data analysis was performed using the GS Reference Mapper software version 2.0.01. For the enrichment assay, the median enrichment of the targeted genomic loci was 207-fold, as assessed by ligation-mediated LM-PCR. Overall, 1,098,132 reads were generated in the two lanes, yielding a total sequence length of 386,097,740 bases. In median, 96.52% of the sequenced bases mapped against the human genome, and 66.0% were derived from the customized NimbleGen array capture probes, resulting in a median coverage of 18.7-fold . With this method it was possible to detect and confirm point mutations (KIT, FLT3-TKD, and KRAS) and insertions (FLT3-ITD). Moreover, by capturing chimeric DNA fragments and generating reads mapping to both fusion partners this approach detected balanced aberrations, i.e. inv(16)(p13q22) and the translocations t(8;21)(q22;q22) or t(9;11)(p22;q23). In conclusion, both assays to specifically sequence targeted regions with oncogenic relevance on a NGS platform demonstrated promising results and are feasible. The amplicon approach is more suitable for detection of mutations in a routine setting and is ideally suited for large genes such as TET2, ATM, and NF1, which are labor-intensive to sequence conventionally. The array-based capturing assay is characterized by a complex and time-consuming workflow with low-throughput. However, the ability to detect balanced genomic aberrations which are detectable thus far only by cytogenetics and FISH has the potential to become an important diagnostic assay, especially in tumors in which cytogenetics can not be applied successfully. Disclosures: Grossmann: MLL Munich Leukemia Laboratory: Employment. Kohlmann:MLL Munich Leukemia Laboratory: Employment. Haferlach:MLL Munich Leukemia Laboratory: Equity Ownership. Dicker:MLL Munich Leukemia Laboratory: Employment. Kazak:MLL Munich Leukemia Laboratory: Employment. Schindela:MLL Munich Leukemia Laboratory: Employment. Schnittger:MLL Munich Leukemia Laboratory: Equity Ownership. Kern:MLL Munich Leukemia Laboratory: Equity Ownership. Haferlach:MLL Munich Leukemia Laboratory: Equity Ownership.

Download Full-text

A Single Next-Generation Sequencing (NGS) Assay for the Detection of Point Mutations and Large Chromosomal Abnormalities in MDS Patients

Leukemia Research ◽

10.1016/s0145-2126(17)30143-1 ◽

2017 ◽

Vol 55 ◽

pp. S18

Author(s):

A. Liquori ◽

E. Such ◽

L. Palomo ◽

S. Moreau ◽

L. Pedrola ◽

...

Keyword(s):

Next Generation Sequencing ◽

Chromosomal Abnormalities ◽

Point Mutations ◽

Next Generation ◽

Next Generation Sequencing Ngs ◽

Generation Sequencing

Download Full-text

Novel Primer Sets for Next Generation Sequencing-Based Analyses of Water Quality

PLoS ONE ◽

10.1371/journal.pone.0170008 ◽

2017 ◽

Vol 12 (1) ◽

pp. e0170008 ◽

Cited By ~ 4

Author(s):

Elvina Lee ◽

Maninder S. Khurana ◽

Andrew S. Whiteley ◽

Paul T. Monis ◽

Andrew Bath ◽

...

Keyword(s):

Water Quality ◽

Next Generation Sequencing ◽

Next Generation ◽

Primer Sets ◽

Generation Sequencing

Download Full-text

Analytical and Clinical Validation of an Amplicon-based Next Generation Sequencing Assay for Ultrasensitive Detection of Circulating Tumor DNA

10.1101/2021.08.03.21261575 ◽

2021 ◽

Author(s):

Jonathan Poh ◽

Kao Chin Ngeow ◽

Michelle Pek ◽

Kian-Hin Tan ◽

Jing Shan Lim ◽

...

Keyword(s):

Next Generation Sequencing ◽

Copy Number ◽

Point Mutations ◽

Circulating Tumor Dna ◽

Clinical Validation ◽

Gene Fusions ◽

Copy Number Alterations ◽

Next Generation ◽

Tumor Dna ◽

Generation Sequencing

Next-generation sequencing of circulating tumor DNA presents a promising approach to cancer diagnostics, complementing conventional tissue-based diagnostic testing by enabling minimally invasive serial testing and broad genomic coverage through a simple blood draw to maximize therapeutic benefit to patients. LiquidHALLMARK® is an amplicon-based next-generation sequencing assay developed for the genomic profiling of plasma-derived cell-free DNA. The comprehensive 80-gene panel profiles point mutations, insertions/deletions, copy number alterations, and gene fusions, and further detects oncogenic viruses (EBV and HBV) and microsatellite instability. Here, the analytical and clinical validation of the assay is reported. Analytical validation using reference genetic materials demonstrated a sensitivity of 99.38% for point mutations and 95.83% for insertions/deletions at 0.1% variant allele frequency (VAF), and a sensitivity of 91.67% for gene fusions at 0.5% VAF, with high specificity even at 0.1% VAF (99.11% per-base). The limit of detection for copy number alterations, EBV, HBV, and microsatellite instability were also empirically determined. Orthogonal comparison of EGFR variant calls made by LiquidHALLMARK and a reference allele-specific PCR method for 355 lung cancer specimens revealed an overall concordance of 93.80%, while external validation with cobas® EGFR Mutation Test v2 for 50 lung cancer specimens demonstrated an overall concordance of 84.00%, with a 100% concordance rate for EGFR variants above 0.4% VAF. Clinical application of LiquidHALLMARK in 1,592 consecutive patients demonstrated a high detection rate (74.8% alteration-positive in cancer samples) and broad actionability (50.0% of cancer samples harboring alterations with biological evidence for actionability). Among ctDNA-positive lung cancers, 72.5% harbored at least one biomarker with a guideline-approved drug indication. These results establish the high sensitivity, specificity, accuracy, and precision of the LiquidHALLMARK assay and supports its clinical application for blood-based genomic testing.

Download Full-text