scholarly journals Structure of the space of taboo-free sequences

2020 ◽  
Vol 81 (4-5) ◽  
pp. 1029-1057
Author(s):  
Cassius Manuel ◽  
Arndt von Haeseler

Abstract Models of sequence evolution typically assume that all sequences are possible. However, restriction enzymes that cut DNA at specific recognition sites provide an example where carrying a recognition site can be lethal. Motivated by this observation, we studied the set of strings over a finite alphabet with taboos, that is, with prohibited substrings. The taboo-set is referred to as $$\mathbb {T}$$ T and any allowed string as a taboo-free string. We consider the so-called Hamming graph $$\varGamma _n(\mathbb {T})$$ Γ n ( T ) , whose vertices are taboo-free strings of length n and whose edges connect two taboo-free strings if their Hamming distance equals one. Any (random) walk on this graph describes the evolution of a DNA sequence that avoids taboos. We describe the construction of the vertex set of $$\varGamma _n(\mathbb {T})$$ Γ n ( T ) . Then we state conditions under which $$\varGamma _n(\mathbb {T})$$ Γ n ( T ) and its suffix subgraphs are connected. Moreover, we provide an algorithm that determines if all these graphs are connected for an arbitrary $$\mathbb {T}$$ T . As an application of the algorithm, we show that about $$87\%$$ 87 % of bacteria listed in REBASE have a taboo-set that induces connected taboo-free Hamming graphs, because they have less than four type II restriction enzymes. On the other hand, four properly chosen taboos are enough to disconnect one suffix subgraph, and consequently connectivity of taboo-free Hamming graphs could change depending on the composition of restriction sites.

2019 ◽  
Author(s):  
Cassius Manuel ◽  
Arndt von Haeseler

AbstractModels of sequence evolution typically assume that all sequences are possible. However, restriction enzymes that cut DNA at specific recognition sites provide an example where carrying a recognition sequence can be lethal. Motivated by this observation, we studied the set of strings over a finite alphabet with taboos, that is, with prohibited substrings. The taboo-set is referred to as and any allowed string as a taboo-free string. We consider the graph whose vertices are taboo-free strings of length n and whose edges connect two taboo-free strings if their Hamming distance equals 1. Any (random) walk on this graph describes the evolution of a DNA sequence that avoids deleterious taboos. We describe the construction of the vertex set of . Then we state conditions under which and its suffix subgraphs are connected. Moreover, we provide a simple algorithm that can determine, for an arbitrary , if all these graphs are connected. We concluded that bacterial taboo-free Hamming graphs are nearly always connected, although 4 properly chosen taboos are enough to disconnect one of its suffix subgraphs.


2001 ◽  
Vol 311 (3) ◽  
pp. 515-527 ◽  
Author(s):  
Susan E. Milsom ◽  
Stephen E. Halford ◽  
Michelle L. Embleton ◽  
Mark D. Szczelkun

2013 ◽  
Vol 88 (2) ◽  
pp. 286-296 ◽  
Author(s):  
DANIEL R. HAWTIN ◽  
NEIL I. GILLESPIE ◽  
CHERYL E. PRAEGER

AbstractWe consider a code to be a subset of the vertex set of a Hamming graph. We examine elusive pairs, code-group pairs where the code is not determined by knowledge of its set of neighbours. We construct a new infinite family of elusive pairs, where the group in question acts transitively on the set of neighbours of the code. In these examples, the alphabet size always divides the length of the code. We show that there is no elusive pair for the smallest set of parameters that does not satisfy this condition. We also pose several questions regarding elusive pairs.


2001 ◽  
Vol 29 (4) ◽  
pp. 363-373 ◽  
Author(s):  
S. E. Halford

Type II restriction endonucleases recognize specific DNA sequences and cleave both strands of the DNA at fixed locations at or near their recognition sites. Many of these enzymes are dimeric proteins that recognize, in symmetrical fashion, palindromic DNA sequences. They generally catalyse independent reactions at each recognition site on the DNA, although in some cases they act processively; cutting the DNA first at one site, then translocating along the DNA to another site and cutting that before leaving the DNA. The way in which the degree of processivity varies with the length of DNA between the sites can reveal the mechanism of translocation. In contrast with the common view that proteins move along DNA by ‘sliding’, the principal mode of transfer of the EcoRV endonuclease is by ‘hopping’ and ‘jumping’, i.e. the dissociation of the protein from one site followed by its re-association with another site in the same DNA molecule, either close to or distant from the original site. Other type II restriction enzymes require two copies of their recognition sites for their DNA cleavage reactions. Many of these enzymes, such as SfiI, are tetramers with two DNA-binding surfaces. SfiI has no activity when bound to just one recognition site, and instead both DNA-binding surfaces have to be filled before it becomes active. Although the two sites can be on separate DNA molecules, SfiI acts optimally with two sites on the same DNA, where it traps the DNA between the sites in a loop. SfiI thus constitutes a test system for the analysis of DNA looping.


2006 ◽  
Vol 3 (1) ◽  
pp. 13-17 ◽  
Author(s):  
Yang Ke-Qiang ◽  
Wang Yue-Jin ◽  
Zhang Jin-Jin ◽  
Wang Xi-Ping ◽  
Wan Yi-Zhen ◽  
...  

AbstractThe cloning and sequencing of two specific molecular markers linked to grape (Vitis vinifera) seedless gene indicated that the full lengths of the markers, 39970524-5-564 (GenBank accession number: AY327513) and 39970524-6-1538 (GenBank accession number: AY327514), were 564 bp and 1538 bp, respectively. The restriction sites of sequences 39970524-5-564 and 39970524-6-1538 were analysed with Wingene231 software. The results indicated that there were 29 and 130 restriction sites in 39970524-5-564 and 39970524-6-1538, respectively, digested by restriction enzymes with recognition sites of six or more than six bases. There was no cleavage site in 39970524-5-564 and there was one site at 135 bp of 39970524-6-1538 when these two markers were digested by EcoRI. Both markers can not be digested by HindIII. Therefore, the full length of 39970524-5-564 and of the two bands of 39970524-6-1538 (1400 bp and 135 bp) could be cut out from the pGEMR-T Easy Vector by EcoRI. The labelled DNA of 39970524-5-564, used as a probe for Southern blotting with genomic DNA from Red Globe, Flame Seedless, Thompson Seedless, four seedless hybrids and three seeded hybrids, showed a unique hybrid band present in all seedless individuals, but absent in all seeded individuals. It was further indicated that the specific marker 39970524-5-564 was derived from the genome of seedless grapes.


Genetics ◽  
1994 ◽  
Vol 137 (3) ◽  
pp. 883-889 ◽  
Author(s):  
N T Miyashita ◽  
N Mori ◽  
K Tsunewaki

Abstract Restriction map variation in two 5-6-kb chloroplast DNA regions of five diploid Aegilops species in the section Sitopsis and two wild tetraploid wheats, Triticum dicoccoides and Triticum araraticum, was investigated with a battery of four-cutter restriction enzymes. A single accession each of Triticum durum, Triticum timopheevi and Triticum aestivum was included as a reference. More than 250 restriction sites were scored, of which only seven sites were found polymorphic in Aegilops speltoides. No restriction site polymorphisms were detected in all of the other diploid and tetraploid species. In addition, six insertion/deletion polymorphisms were detected, but they were mostly unique or species-specific. Estimated nucleotide diversity was 0.0006 for A. speltoides, and 0.0000 for all the other investigated species. In A. speltoides, none of Tajima's D values was significant, indicating no clear deviation from the neutrality of molecular polymorphisms. Significant non-random association was detected for three combinations out of 10 possible pairs between polymorphic restriction sites in A. speltoides. Phylogenetic relationship among all the plastotypes (plastid genotype) suggested the diphyletic origin of T. dicoccoides and T. araraticum. A plastotype of one A. speltoides accession was identical to the major type of T. araraticum (T. timopheevi inclusively). Three of the plastotypes found in the Sitopsis species are very similar, but not identical, to that of T. dicoccoides, T. durum and T. aestivum.


Genetics ◽  
1992 ◽  
Vol 130 (3) ◽  
pp. 629-638
Author(s):  
Y I Yeom ◽  
K Abe ◽  
K Artzt

Abstract Active gene transcription is known to promote genetic change in neighboring DNA. We reasoned that the change would be readily heritable if transcription was occurring in germ cells or early embryonic cells before the germ cells are set aside. The H-2K region of the major histocompatibility complex (MHC) provides a good vehicle for testing this hypothesis because it is replete with such genes. We have compared the amount of polymorphism in 240 kb of DNA contiguous with H-2K and 150 kb of DNA flanking a homologous duplicated region in t-haplotypes and inbred strains. Using 90 probes and three restriction enzymes, we find a staggering difference in the amount of polymorphism in the H-2K region vs. the duplicated region (26% vs. 0%) of t-haplotypes. The disparity in the rate of divergence between the two regions indicates that the spatial distribution of genes and their expression pattern might be important factors in sequence evolution. Since t-haplotypes normally show extremely limited variability among themselves due to their recent divergence from a single ancestor, these results imply that the mutation rate in the H-2K region is unusually high. This is in apparent contradiction to the current view that the MHC loci have evolved at the same rate as other loci. The implications for the evolution of the H-2K gene are discussed.


2021 ◽  
Vol 28 (2) ◽  
Author(s):  
Jia Huang

The Norton product is defined on each eigenspace of a distance regular graph by the orthogonal projection of the entry-wise product. The resulting algebra, known as the Norton algebra, is a commutative nonassociative algebra that is useful in group theory due to its interesting automorphism group. We provide a formula for the Norton product on each eigenspace of a Hamming graph using linear characters. We construct a large subgroup of automorphisms of the Norton algebra of a Hamming graph and completely describe the automorphism group in some cases. We also show that the Norton product on each eigenspace of a Hamming graph is as nonassociative as possible, except for some special cases in which it is either associative or equally as nonassociative as the so-called double minus operation previously studied by the author, Mickey, and Xu. Our results restrict to the hypercubes and extend to the halved and/or folded cubes, the bilinear forms graphs, and more generally, all Cayley graphs of finite abelian groups.


1985 ◽  
Vol 112 (2) ◽  
pp. 265-267 ◽  
Author(s):  
Maria Luisa Porceddu ◽  
Maria Giuseppa Corda ◽  
Enrico Sanna ◽  
Giovanni Biggio
Keyword(s):  
Type Ii ◽  

Sign in / Sign up

Export Citation Format

Share Document