scholarly journals Whole genome sequencing, variant analysis, phylogenetics, and deep sequencing of Zika virus strains

2018 ◽  
Vol 8 (1) ◽  
Author(s):  
Susmita Shrivastava ◽  
Vinita Puri ◽  
Kari A. Dilley ◽  
Erica Ngouajio ◽  
Jessica Shifflett ◽  
...  
2010 ◽  
Vol 42 (11) ◽  
pp. 931-936 ◽  
Author(s):  
Akihiro Fujimoto ◽  
Hidewaki Nakagawa ◽  
Naoya Hosono ◽  
Kaoru Nakano ◽  
Tetsuo Abe ◽  
...  

PLoS ONE ◽  
2019 ◽  
Vol 14 (8) ◽  
pp. e0220512 ◽  
Author(s):  
Zagaa Odgerel ◽  
Shilpa Sonti ◽  
Nora Hernandez ◽  
Jemin Park ◽  
Ruth Ottman ◽  
...  

2021 ◽  
Author(s):  
Maqsud Hossain ◽  
Tahrima Saiha Huq ◽  
Aura Rahman ◽  
Md. Aminul Islam ◽  
Syeda Naushin Tabassum ◽  
...  

Abstract Whole-genome sequencing is increasingly being used to investigate the spatial and temporal distribution of viral pathogens including the Severe Acute Respiratory Syndrome Coronavirus Variant 2 (SARS-CoV-2) which is responsible for the ongoing COVID-19 pandemic. In this study, we determined 55 complete genome sequences of SARS-CoV-2 strains isolated from patients from Noakhali, a South-Eastern district in Bangladesh. Variant analysis of our sequenced genomes identified sixteen rare variations in S, six in N, two in M, one in E protein and the S protein variation, Y204F, identified in two of our sequenced strains, has not been reported from any other countries in the GISAID database. Comparison of the prevalence pattern across the country showed GH clade lineages B.1.36 and B.1.36.16 to be abundant in Noakhali and the South-Eastern region of Chittagong when compared to the rest of the country. Phylodynamic analysis of our sequenced genomes revealed that the virus was estimated to be evolving at the rate of 1.065 X 10− 4 subs/site/year. The study results demonstrated the necessity of initiating a concerted, country-wide genomics surveillance effort to determine any novel mutation of functional significance, understanding virus evolution, transmission, and spread in Bangladesh. Short running title: Genome sequencing of Noakhali isolates SARS-Cov-2 in Bangladesh


2021 ◽  
Vol 144 ◽  
pp. 104993
Author(s):  
Jordy P.M. Coolen ◽  
Femke Wolters ◽  
Alma Tostmann ◽  
Lenneke F.J. van Groningen ◽  
Chantal P. Bleeker-Rovers ◽  
...  

2021 ◽  
Vol 156 (Supplement_1) ◽  
pp. S138-S138
Author(s):  
J A SoRelle ◽  
A Clark ◽  
Z Wang ◽  
J Park

Abstract Introduction/Objective The majority of tracking methods have employed whole genome sequencing, which can be very expensive and time consuming. An alternative method has been to use genotyping of specific mutations to identify variants. However, tracking SARS-CoV-2 variants by targeted methods has been a moving target. Most methods only multiplex four targets per reaction, but we have multiplexed 8 targets in a single tube using fragment analysis. Methods/Case Report Fluorescently labeled primers targeted a combination of insertion/ deletion mutations and single nucleotide mutations. The PCR amplified products, amplicons, were separated by capillary electrophoresis. Primers were designed to detect changes in size indicative of insertion or deletion mutations including: ORF1A:Del3675_3677, S:Del69_70, S:Del144, S:Del157_158, S:Del242_244, ORF8:Del119_120, and ORF8:ins28269-28273. Allele-specific primers were designed to detect both the wild-type and mutated versions of S:N501Y, S:E484K, and S:L452R. Residual nasopharyngeal and nasal specimens testing positive for SARS-CoV-2 by RT-PCR or isothermal amplification (IDnow) methods were selected from May 1- June 24, 2021. Variant analysis was performed by multiplex targeted PCR and whole genome sequencing in parallel on the same specimens to determine positive percent agreement. Results (if a Case Study enter NA) Variant analysis was performed on 250 specimens detecting each of the major variants of concern Alpha (B.1.1.7, U.K. origin, n= 108), Beta (B.1.351, South Africa origin, n=3), Gamma (P.1, Brazil origin, n=12), Delta (B.1.617.2, Indian origin, n=17), and Iota (B.1.526, New York, n=5). Some specimens with low viral load were detected by only PCR (n=18), only WGS (n=41), or neither (n=20). Overall positive percent agreement was 95% (163/171). Conclusion This adjustable method robustly and accurately identifies COVID-19 VOCs utilizing a platform amenable to multiple targets (20-40 targets ranging from 100-500b.p. across four fluorescent channels) using equipment commonly found in routine molecular pathology laboratories. Future directions include adjusting targets to detect new variants.


BMC Genomics ◽  
2012 ◽  
Vol 13 (1) ◽  
pp. 78 ◽  
Author(s):  
Ryan Doan ◽  
Noah D Cohen ◽  
Jason Sawyer ◽  
Noushin Ghaffari ◽  
Charlie D Johnson ◽  
...  

Blood ◽  
2014 ◽  
Vol 124 (21) ◽  
pp. 1958-1958
Author(s):  
Adam Burns ◽  
Susanne Weller ◽  
Sean Humphray ◽  
Jennifer Becq ◽  
Reem Alsolami ◽  
...  

Abstract Background Chronic lymphocytic leukaemia (CLL) is characterised by clinical and biological heterogeneity. Despite significant advances in therapeutic management, CLL remains largely incurable. Current risk stratification is based on cytogenetic features (del(17p), del(11q), del(13q), +12). So far, sequencing studies in CLL have focussed predominantly on the exome. These have identified a number of genes that are recurrently mutated at low frequency such as TP53, SF3B1, ATM, NOTCH1, MYD88, and BIRC3. Apart from TP53 abnormalities, none of these are currently used to guide clinical decisions and it is unclear how they are implicated in disease pathogenesis. Methods In this study, we sought to further refine the molecular landscape of CLL using whole genome sequencing (WGS) of paired tumour and germline DNA samples from a cohort of clinically annotated patients with CLL. We sequenced a heterogeneous cohort of 41 samples (25 males, 16 females, median age 69 (range 49-94)) with a range of clinical features (49% fludarabine refractory, 61% unmutated IgVH). Whole genome sequencing libraries were generated using the Illumina TruSeq PCR-free sample preparation kit, with a median insert size of 400bp, and subjected to 100bp paired-end sequencing on an Illumina HiSeq 2500 platform. Both tumour and germline libraries were sequenced to an average depth of 38x. Sequencing reads were aligned using the Isaac algorithm and the Starling and Strelka algorithms were used for SNV and Indel calling in germline and tumour samples respectively. All variants with a read depth <10x or a quality score <Q30 were excluded using Illumina VariantStudio software. For validation, selected mutations were verified using a combination of a targeted deep sequencing panel on the Illumina MiSeq platform and conventional Sanger sequencing. Copy number alterations were identified from the whole genome sequencing data using Nexus 7.5 (Biodiscovery), with findings validated on Illumina OmniExpress24 arrays. Results Whole genome sequencing revealed a total of 95,305 somatic indels and base substitutions, averaging 30.8 per patient (range 7-57) or 0.3 mutations per megabase. Of these mutations, 1266 occur in protein coding regions across 1108 genes, including 556 in 3’ and 5’ untranslated regions. Of these 1108 genes, we identified 93 as recurrently mutated (mutations present in more than one sample), including the previously described SF3B1 (12/41, 29.3%), TP53 (9/41, 22%), ATM (6/41, 14.6%), NOTCH1 (6/41, 14.6%), FAT1 (4/41, 9.8%) and BIRC3 (2/41, 4.9%). In addition to FAT1, we also identified two missense mutations in another cadherin superfamily member, FAT4(2/41, 4.9%), both occurring within the extracellular cadherin domains. Missense mutations were the most frequent (42.7%) followed by those in 3’ UTRs (36.1%), 5’ UTRs (7.7%), splice sites (6.1%), small indels (4.3%) and nonsense mutations (3.1%). In addition, 61.5% of missense mutations were identified as either deleterious or damaging by the SIFT or PolyPhen-2 algorithms. We used a modified version of the MutSigCV algorithm to identify genes with significantly higher mutation rates in the coding sequence. A similar statistical approach was used to identify significant mutations in untranslated regions. Importantly, a number of interesting candidate genes carried mutations in non-coding regions, including NFKBIZ (3/41, 7.3%), IGLL5 (3/41, 7.3%) and BCL2(2/41, 4.9%). Conclusion To our knowledge, this is the largest whole genome sequencing study in CLL so far. We present a comprehensive catalogue of genomic alteration in CLL and associate genome-wide patterns, including the presence of subclones, with clinical outcome. In addition to demonstrating the heterogeneous nature of the CLL genome, our data highlights the variety of mutations present in the regulatory regions of genes as well as structural variations, thus providing new insights for hypothesis-driven biomarker and therapeutic discovery. Disclosures Humphray: Illumina Cambridge Ltd: Employment. Becq:Illumina Cambridge Ltd: Employment. Bentley:Illumina Cambridge Ltd: Employment.


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Kimia Kamelian ◽  
Vincent Montoya ◽  
Andrea Olmstead ◽  
Winnie Dong ◽  
Richard Harrigan ◽  
...  

Abstract In 2018, the World Health Organization identified the Zika virus (ZIKV) as a pathogen that should be prioritized for public health research due to its epidemic potential. In this study, whole-genome sequencing (WGS) of travel-acquired ZIKV infections was used to examine the limitations of phylogenetic analysis. WGS and phylogenetic analysis were performed to investigate geographic clustering of samples from five Canadians with travel-acquired ZIKV infections and to assess the limitations of phylogenetic analysis of ZIKV sequences using a phylogenetic cluster approach. Genomic variability of ZIKV samples was assessed and for context, compared with hepatitis C virus (HCV) samples. Phylogenetic analysis confirmed the suspected region of ZIKV infection for one of five samples and one sample failed to cluster with sequences from its suspected country of infection. Travel-acquired ZIKV samples depicted low genomic variability relative to HCV samples. A floating patristic distance threshold classified all pre-2000 ZIKV sequences into separate clusters, while only Cambodian, Peruvian, Malaysian, and South Korean sequences were similarly classifiable. While phylogenetic analysis of ZIKV data can identify the broad geographical region of ZIKV infection, ZIKV’s low genomic variability is likely to limit precise interpretations of phylogenetic analysis of the origins of travel-related cases.


Sign in / Sign up

Export Citation Format

Share Document