strand bias
Recently Published Documents


TOTAL DOCUMENTS

65
(FIVE YEARS 5)

H-INDEX

23
(FIVE YEARS 0)

2021 ◽  
Author(s):  
Metin Balaban ◽  
Nishat Anjum Bristy ◽  
Ahnaf Faisal ◽  
Md Shamsuzzoha Bayzid ◽  
Siavash Mirarab

While aligning sequences has been the dominant approach for determining homology prior to phylogenetic inference, alignment-free methods have much appeal in terms of simplifying the process of inference, especially when analyzing genome-wide data. Furthermore, alignment-free methods present the only option for some emerging forms of data such as genome skims, which cannot be assembled. Despite the appeal, alignment-free methods have not been competitive with alignment-based methods in terms of accuracy. One limitation of alignment-free methods is that they typically rely on simplified models of sequence evolution such as Jukes-Cantor. It is possible to compute pairwise distances under more complex models by computing frequencies of base substitutions provided that these quantities can be estimated in the alignment-free setting. A particular limitation is that for many forms of genome-wide data, which arguably present the best use case for alignment-free methods, the strand of DNA sequences is unknown. Under such conditions, the so-called no-strand bias models are the most complex models that can be used. Here, we show how to calculate distances under a no-strain bias restriction of the General Time Reversible (GTR) model called TK4 without relying on alignments. The method relies on replacing letters in the input sequences, and subsequent computation of Jaccard indices between k-mer sets. For the method to work on large genomes, we also need to compute the number of k-mer mismatches after replacement due to random chance. We show in simulation that these alignment-free distances can be highly accurate when genomes evolve under the assumed models, and we examine the effectiveness of the method on real genomic data.


2021 ◽  
Author(s):  
Bert Hubert

ABSTRACTGC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff’s second parity rule, which states that G and C, and T and A occur with (mostly) equal frequency even within a strand.Most bacteria also show the analogous TA skew. Different phyla show different kinds of skew and differing relations between TA and GC skew.This article introduces an open access database (https://skewdb.org) of GC and 10 other skews for over 28,000 chromosomes and plasmids. Further details like codon bias, strand bias, strand lengths and taxonomic data are also included. The SkewDB database can be used to generate or verify hypotheses. Since the origins of both the second parity rule, as well as GC skew itself, are not yet satisfactorily explained, such a database may enhance our understanding of microbial DNA.


Viruses ◽  
2021 ◽  
Vol 13 (2) ◽  
pp. 148
Author(s):  
Khalil Geballa-Koukoulas ◽  
Julien Andreani ◽  
Bernard La Scola ◽  
Guillaume Blanc

Kaumoebavirus infects the amoeba Vermamoeba vermiformis and has recently been described as a distant relative of the African swine fever virus. To characterize the diversity and evolution of this novel viral genus, we report here on the isolation and genome sequencing of a second strain of Kaumoebavirus, namely LCC10. Detailed analysis of the sequencing data suggested that its 362-Kb genome is linear with covalently closed hairpin termini, so that DNA forms a single continuous polynucleotide chain. Comparative genomic analysis indicated that although the two sequenced Kaumoebavirus strains share extensive gene collinearity, 180 predicted genes were either gained or lost in only one genome. As already observed in another distant relative, i.e., Faustovirus, which infects the same host, the center and extremities of the Kaumoebavirus genome exhibited a higher rate of sequence divergence and the major capsid protein gene was colonized by type-I introns. A possible role of the Vermamoeba host in the genesis of these evolutionary traits is hypothesized. The Kaumoebavirus genome exhibited a significant gene strand bias over the two-third of genome length, a feature not seen in the other members of the “extended Asfarviridae” clade. We suggest that this gene strand bias was induced by a putative single origin of DNA replication located near the genome extremity that imparted a selective force favoring the genes positioned on the leading strand.


2020 ◽  
Author(s):  
Ilias Georgakopoulos-Soares ◽  
Ioannis Mouratidis ◽  
Guillermo E Parada ◽  
Navneet Matharu ◽  
Martin Hemberg ◽  
...  

Abstract DNA strand asymmetries can have a major effect on several biological functions, including replication, transcription and transcription factor binding. As such, DNA strand asymmetries and mutational strand bias can provide information about biological function. However, a versatile tool to explore this does not exist. Here, we present Asymmetron, a user-friendly computational tool that performs statistical analysis and visualizations for the evaluation of strand asymmetries. Asymmetron takes as input DNA features provided with strand annotation and outputs strand asymmetries for consecutive occurrences of a single DNA feature or between pairs of features. We illustrate the use of Asymmetron by identifying transcriptional and replicative strand asymmetries of germline structural variant breakpoints. We also show that the orientation of the binding sites of 45% of human transcription factors analyzed have a significant DNA strand bias in transcribed regions, that is also corroborated in ChIP-seq analyses, and is likely associated with transcription. In summary, we provide a novel tool to assess DNA strand asymmetries and show how it can be used to derive new insights across a variety of biological disciplines.


2020 ◽  
Author(s):  
Mareike Herzog ◽  
Elisa Alonso-Perez ◽  
Israel Salguero ◽  
Jonas Warringer ◽  
David J. Adams ◽  
...  

ABSTRACTA single amino acid residue change in the exonuclease domain of human DNA polymerase ε, P286R, is associated with the development of colorectal cancers, and has been shown to impart a mutagenic phenotype. Perhaps unexpectedly, the corresponding Pol ε allele in the yeast Saccharomyces cerevisiae (pol2-P301R), was found to drive greater mutagenesis than exonuclease-deficient Pol ε (pol2-4), a phenotype sometimes termed ultra-mutagenesis. By studying the impact on mutation frequency, type, replication-strand bias, and sequence context, we show that ultra-mutagenesis is commonly observed in cells carrying a range of cancer-associated Pol ε exonuclease domain alleles. Similarities between mutations generated by these alleles and those generated in pol2-4 cells indicate a shared mechanism of mutagenesis that yields a mutation pattern similar to cancer Signature 14. Comparison of POL2 ultra-mutator with pol2-M644G, a mutant in the polymerase domain decreasing Pol ε fidelity, revealed unexpected analogies in the sequence context and strand bias of mutations. Analysis of mutational patterns unique to exonuclease domain mutant cells suggests that backtracking of the polymerase, when the mismatched primer end cannot be accommodated in the proofreading domain, results in the observed increase in insertions and T>A mutations in specific sequence contexts.


2018 ◽  
Vol 9 ◽  
Author(s):  
Ekaterina Minskaia ◽  
Barbara C. Saraiva ◽  
Maria M. V. Soares ◽  
Rita I. Azevedo ◽  
Ruy M. Ribeiro ◽  
...  
Keyword(s):  
T Cell ◽  

2016 ◽  
Vol 37 (8) ◽  
pp. 810-816 ◽  
Author(s):  
Ross R. Keller ◽  
Shelley A. Gestl ◽  
Amy Q. Lu ◽  
Alicia Hoke ◽  
David J. Feith ◽  
...  
Keyword(s):  

2014 ◽  
Vol 15 (4) ◽  
pp. 723-736 ◽  
Author(s):  
Anders Gonçalves da Silva ◽  
William Barendse ◽  
James W. Kijas ◽  
Wes C. Barris ◽  
Sean McWilliam ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document