alphabet size
Recently Published Documents


TOTAL DOCUMENTS

67
(FIVE YEARS 16)

H-INDEX

10
(FIVE YEARS 2)

2021 ◽  
Vol 68 (5) ◽  
pp. 1-39
Author(s):  
Bernhard Haeupler ◽  
Amirbehshad Shahrasbi

We introduce synchronization strings , which provide a novel way to efficiently deal with synchronization errors , i.e., insertions and deletions. Synchronization errors are strictly more general and much harder to cope with than more commonly considered Hamming-type errors , i.e., symbol substitutions and erasures. For every ε > 0, synchronization strings allow us to index a sequence with an ε -O(1) -size alphabet, such that one can efficiently transform k synchronization errors into (1 + ε)k Hamming-type errors . This powerful new technique has many applications. In this article, we focus on designing insdel codes , i.e., error correcting block codes (ECCs) for insertion-deletion channels. While ECCs for both Hamming-type errors and synchronization errors have been intensely studied, the latter has largely resisted progress. As Mitzenmacher puts it in his 2009 survey [30]: “ Channels with synchronization errors...are simply not adequately understood by current theory. Given the near-complete knowledge, we have for channels with erasures and errors...our lack of understanding about channels with synchronization errors is truly remarkable. ” Indeed, it took until 1999 for the first insdel codes with constant rate, constant distance, and constant alphabet size to be constructed and only since 2016 are there constructions of constant rate insdel codes for asymptotically large noise rates. Even in the asymptotically large or small noise regimes, these codes are polynomially far from the optimal rate-distance tradeoff. This makes the understanding of insdel codes up to this work equivalent to what was known for regular ECCs after Forney introduced concatenated codes in his doctoral thesis 50 years ago. A straightforward application of our synchronization strings-based indexing method gives a simple black-box construction that transforms any ECC into an equally efficient insdel code with only a small increase in the alphabet size. This instantly transfers much of the highly developed understanding for regular ECCs into the realm of insdel codes. Most notably, for the complete noise spectrum, we obtain efficient “near-MDS” insdel codes, which get arbitrarily close to the optimal rate-distance tradeoff given by the Singleton bound. In particular, for any δ ∈ (0,1) and ε > 0, we give a family of insdel codes achieving a rate of 1 - δ - ε over a constant-size alphabet that efficiently corrects a δ fraction of insertions or deletions.


2021 ◽  
Author(s):  
Mira Gonen ◽  
Michael Langberg ◽  
Alex Sprintson
Keyword(s):  

Author(s):  
Stefano Crespi Reghizzi ◽  
Antonio Restivo ◽  
Pierluigi San Pietro

Author(s):  
Nobuya Kimoto ◽  
Shigetaka Nakamura ◽  
Ken Komiya ◽  
Kenzo Fujimoto ◽  
Satoshi Kobayashi
Keyword(s):  

2020 ◽  
Vol 21 (19) ◽  
pp. 7392
Author(s):  
Peter R. Wills ◽  
Charles W. Carter

We recently observed that errors in gene replication and translation could be seen qualitatively to behave analogously to the impedances in acoustical and electronic energy transducing systems. We develop here quantitative relationships necessary to confirm that analogy and to place it into the context of the minimization of dissipative losses of both chemical free energy and information. The formal developments include expressions for the information transferred from a template to a new polymer, Iσ; an impedance parameter, Z; and an effective alphabet size, neff; all of which have non-linear dependences on the fidelity parameter, q, and the alphabet size, n. Surfaces of these functions over the {n,q} plane reveal key new insights into the origin of coding. Our conclusion is that the emergence and evolutionary refinement of information transfer in biology follow principles previously identified to govern physical energy flows, strengthening analogies (i) between chemical self-organization and biological natural selection, and (ii) between the course of evolutionary trajectories and the most probable pathways for time-dependent transitions in physics. Matching the informational impedance of translation to the four-letter alphabet of genes uncovers a pivotal role for the redundancy of triplet codons in preserving as much intrinsic genetic information as possible, especially in early stages when the coding alphabet size was small.


2020 ◽  
pp. 104614 ◽  
Author(s):  
Henk Don ◽  
Hans Zantema ◽  
Michiel de Bondt

2020 ◽  
Vol 66 (3) ◽  
pp. 1474-1481
Author(s):  
Hamed Narimani ◽  
Mohammadali Khosravifard
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document