scholarly journals An Improved Algorithm for The k-Dyck Edit Distance Problem

2022 ◽  
pp. 3650-3669
Author(s):  
Dvir Fried ◽  
Shay Golan ◽  
Tomasz Kociumaka ◽  
Tsvi Kopelowitz ◽  
Ely Porat ◽  
...  
2019 ◽  
Vol 106 ◽  
pp. 225-235 ◽  
Author(s):  
Mostafa Darwiche ◽  
Donatello Conte ◽  
Romain Raveaux ◽  
Vincent T’Kindt

Author(s):  
Maren Brand ◽  
Nguyen Khoa Tran ◽  
Philipp Spohr ◽  
Sven Schrinner ◽  
Gunnar W. Klau

AbstractWe consider the homo-edit distance problem, which is the minimum number of homo-deletions or homo-insertions to convert one string into another. A homo-insertion is the insertion of a string of equal characters into another string, while a homo-deletion is the inverse operation. We show how to compute the homo-edit distance of two strings in polynomial time: We first demonstrate that the problem is equivalent to computing a common subsequence of the two input strings with a minimum number of homo-deletions and then present a dynamic programming solution for the reformulated problem.2012 ACM Subject ClassificationApplied computing → Bioinformatics; Applied computing → Molecular sequence analysis; Theory of computation → Dynamic programming


2020 ◽  
Author(s):  
Samuel Briand ◽  
Christophe Dessimoz ◽  
Nadia El-Mabrouk ◽  
Yannis Nevers

AbstractMotivationComparing trees is a basic task for many purposes, and especially in phylogeny where different tree reconstruction tools may lead to different trees, likely representing contradictory evolutionary information. While a large variety of pairwise measures of similarity or dissimilarity have been developed for comparing trees with no information on internal nodes, very few address the case of inner node-labeled trees. Yet such trees are common; for instance reconciled gene trees have inner nodes labeled with the type of event giving rise to them, typically speciation or duplication. Recently, we proposed a formulation of the Labeled Robinson Foulds edit distance with edge extensions, edge contractions between identically labeled nodes, and node label flips. However, this distance proved difficult to compute, in particular because shortest edit paths can require contracting “good” edges, i.e. edges present in the two trees.ResultsHere, we report on a different formulation of the Labeled Robinson Foulds edit distance — based on node insertion, deletion and label substitution — which we show can be computed in linear time. The new formulation also maintains other desirable properties: being a metric, reducing to Robinson Foulds for unlabeled trees and maintaining an intuitive interpretation. The new distance is computable for an arbitrary number of label types, thus making it useful for applications involving not only speciations and duplications, but also horizontal gene transfers and further events associated with the internal nodes of the tree. To illustrate the utility of the new distance, we use it to study the impact of taxon sampling on labeled gene tree inference, and conclude that denser taxon sampling yields better trees.


Sign in / Sign up

Export Citation Format

Share Document