scholarly journals Mutation Patterns of Human SARS-CoV-2 and Bat RaTG13 Coronavirus Genomes Are Strongly Biased Towards C>U Transitions, Indicating Rapid Evolution in Their Hosts

Genes ◽  
2020 ◽  
Vol 11 (7) ◽  
pp. 761 ◽  
Author(s):  
Roman Matyášek ◽  
Aleš Kovařík

The pandemic caused by the spread of SARS-CoV-2 has led to considerable interest in its evolutionary origin and genome structure. Here, we analyzed mutation patterns in 34 human SARS-CoV-2 isolates and a closely related RaTG13 isolated from Rhinolophus affinis (a horseshoe bat). We also evaluated the CpG dinucleotide contents in SARS-CoV-2 and other human and animal coronavirus genomes. Out of 1136 single nucleotide variations (~4% divergence) between human SARS-CoV-2 and bat RaTG13, 682 (60%) can be attributed to C>U and U>C substitutions, far exceeding other types of substitutions. An accumulation of C>U mutations was also observed in SARS-CoV2 variants that arose within the human population. Globally, the C>U substitutions increased the frequency of codons for hydrophobic amino acids in SARS-CoV-2 peptides, while U>C substitutions decreased it. In contrast to most other coronaviruses, both SARS-CoV-2 and RaTG13 exhibited CpG depletion in their genomes. The data suggest that C-to-U conversion mediated by C deamination played a significant role in the evolution of the SARS-CoV-2 coronavirus. We hypothesize that the high frequency C>U transitions reflect virus adaptation processes in their hosts, and that SARS-CoV-2 could have been evolving for a relatively long period in humans following the transfer from animals before spreading worldwide.

2020 ◽  
Author(s):  
Roman Matyasek ◽  
Ales Kovarik

Abstract Background: The world pandemy caused by SARS-CoV-2 spreading has raised considerable interest about its evolutionary origin and genome structure. Here we analysed mutation patterns in 13 human SARS-COV-2 isolates and a closely related RaTG13 isolated from Rhinolophus affinis bat. We also evaluated the CpG dinucleotide contents in SARS-COV-2 and other human and animal coronavirus genomes. Results: Out of 1107 single nucleotide differences (c. 4% divergence) between human SARS-COV-2 and bat RaTG13, 672 (61%) can be attributed to C>U and U>T substitutions significantly (P<0.001) exceeding other types of SNPs. A similar trend was observed among the 13 sequenced SARS-COV-2 genomes. Accumulation of C>U mutations was also observed in a highly variable subregion encoding the ACE2 receptor contact domain. Contrast to most other coronaviruses both SARS-COV-2 and RaTG13 exhibited CpG depletion in their genomes. Conclusion: The data support that the C-to-U conversion played a significant role in the evolution of pathogenic RNA coronaviruses including SARS-COV-2. These mutations apparently also influenced amino acid composition of the SARS-Cov-2 spike protein domain receptor implicated in virus pathogenicity. We propose that SARS-COV-2 was evolving relatively long in humans following the transfer from animals before spreading world-wide.


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Pengcheng Du ◽  
Nan Ding ◽  
Jiarui Li ◽  
Fujie Zhang ◽  
Qi Wang ◽  
...  

Abstract The spread of SARS-CoV-2 in Beijing before May, 2020 resulted from transmission following both domestic and global importation of cases. Here we present genomic surveillance data on 102 imported cases, which account for 17.2% of the total cases in Beijing. Our data suggest that all of the cases in Beijing can be broadly classified into one of three groups: Wuhan exposure, local transmission and overseas imports. We classify all sequenced genomes into seven clusters based on representative high-frequency single nucleotide polymorphisms (SNPs). Genomic comparisons reveal higher genomic diversity in the imported group compared to both the Wuhan exposure and local transmission groups, indicating continuous genomic evolution during global transmission. The imported group show region-specific SNPs, while the intra-host single nucleotide variations present as random features, and show no significant differences among groups. Epidemiological data suggest that detection of cases at immigration with mandatory quarantine may be an effective way to prevent recurring outbreaks triggered by imported cases. Notably, we also identify a set of novel indels. Our data imply that SARS-CoV-2 genomes may have high mutational tolerance.


2021 ◽  
Vol 4 (1) ◽  
Author(s):  
Jiao Fan ◽  
Yige Ding ◽  
Chao Ren ◽  
Ziguo Song ◽  
Jie Yuan ◽  
...  

AbstractCytosine or adenine base editors (CBEs or ABEs) hold great promise in therapeutic applications because they enable the precise conversion of targeted base changes without generating of double-strand breaks. However, both CBEs and ABEs induce substantial off-target DNA editing, and extensive off-target RNA single nucleotide variations in transfected cells. Therefore, the potential effects of deaminases induced by DNA base editors are of great importance for their clinical applicability. Here, the transcriptome-wide deaminase effects on gene expression and splicing is examined. Differentially expressed genes (DEGs) and differential alternative splicing (DAS) events, induced by base editors, are identified. Both CBEs and ABEs generated thousands of DEGs and hundreds of DAS events. For engineered CBEs or ABEs, base editor-induced variants had little effect on the elimination of DEGs and DAS events. Interestingly, more DEGs and DAS events are observed as a result of over expressions of cytosine and adenine deaminases. This study reveals a previously overlooked aspect of deaminase effects in transcriptome-wide gene expression and splicing, and underscores the need to fully characterize such effects of deaminase enzymes in base editor platforms.


PLoS ONE ◽  
2012 ◽  
Vol 7 (5) ◽  
pp. e36212 ◽  
Author(s):  
Raja Mazumder ◽  
Krishna Sudeep Morampudi ◽  
Mona Motwani ◽  
Sona Vasudevan ◽  
Radoslav Goldman

PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e2933 ◽  
Author(s):  
Hoseong Choi ◽  
Yeonhwa Jo ◽  
Ju-Yeon Yoon ◽  
Seung-Kook Choi ◽  
Won Kyong Cho

Viroids are the smallest infectious agents, and their genomes consist of a short single strand of RNA that does not encode any protein.Chrysanthemum stunt viroid(CSVd), a member of the familyPospiviroidae, causes chrysanthemum stunt disease. Here, we report the genomic variations of CSVd to understand the sequence variability of CSVd in different chrysanthemum cultivars. We randomly sampled 36 different chrysanthemum cultivars and examined the infection of CSVd in each cultivar by reverse transcription polymerase chain reaction (RT-PCR). Eleven cultivars were infected by CSVd. Cloning followed by Sanger sequencing successfully identified a total of 271 CSVd genomes derived from 12 plants from 11 cultivars. They were further classified into 105 CSVd variants. Each single chrysanthemum plant had a different set of CSVd variants. Moreover, different single plants from the same cultivar had different sets of CSVd variants but identical consensus genome sequences. A phylogenetic tree using 12 consensus genome sequences revealed three groups of CSVd genomes, while six different groups were defined by the phylogenetic analysis using 105 variants. Based on the consensus CSVd genome, by combining all variant sequences, we identified 99 single-nucleotide variations (SNVs) as well as three nucleotide positions showing high mutation rates. Although 99 SNVs were identified, most CSVd genomes in this study were derived from variant 1, which is identical to known CSVd SK1 showing pathogenicity.


2020 ◽  
Author(s):  
Shuo Li ◽  
Zorawar Noor ◽  
Weihua Zeng ◽  
Xiaohui Ni ◽  
Zuyang Yuan ◽  
...  

AbstractLiquid biopsy using cell-free DNA (cfDNA) is attractive for a wide range of clinical applications, including cancer detection, locating, and monitoring. However, developing these applications requires precise and sensitive calling of somatic single nucleotide variations (SNVs) from cfDNA sequencing data. To date, no SNV caller addresses all the special challenges of cfDNA to provide reliable results. Here we present cfSNV, a revolutionary somatic SNV caller with five innovative techniques to overcome and exploit the unique properties of cfDNA. cfSNV provides hierarchical mutation profiling, thanks to cfDNA’s complete coverage of the clonal landscape, and multi-layer error suppression. In both simulated datasets and real patient data, we demonstrate that cfSNV is superior to existing tools, especially for low-frequency somatic SNVs. We also show how the five novel techniques contribute to its performance. Further, we demonstrate a clinical application using cfSNV to select non-small-cell lung cancer patients for immunotherapy treatment.


Author(s):  
Jouni Sirén ◽  
Jean Monlong ◽  
Xian Chang ◽  
Adam M. Novak ◽  
Jordan M. Eizenga ◽  
...  

ABSTRACTWe introduce Giraffe, a pangenome short read mapper that can efficiently map to a collection of haplotypes threaded through a sequence graph. Giraffe, part of the variation graph toolkit (vg)1, maps reads to thousands of human genomes at around the same speed BWA-MEM2 maps reads to a single reference genome, while maintaining comparable accuracy to VG-MAP, vg’s original mapper. We have developed efficient genotyping pipelines using Giraffe. We demonstrate improvements in genotyping for single nucleotide variations (SNVs), insertions and deletions (indels) and structural variations (SVs) genome-wide. We use Giraffe to genotype and phase 167 thousands structural variations ascertained from long read studies in 5,202 human genomes sequenced with short reads, including the complete 1000 Genomes Project dataset, at an average cost of $1.50 per sample. We determine the frequency of these variations in diverse human populations, characterize their complex allelic variations and identify thousands of expression quantitative trait loci (eQTLs) driven by these variations.


Sign in / Sign up

Export Citation Format

Share Document