String Algorithms

Data Structures to Represent a Set of k -long DNA Sequences

ACM Computing Surveys ◽

10.1145/3445967 ◽

2021 ◽

Vol 54 (1) ◽

pp. 1-22

Author(s):

Rayan Chikhi ◽

Jan Holub ◽

Paul Medvedev

Keyword(s):

Data Structures ◽

Dna Sequences ◽

Sequencing Data ◽

String Algorithms ◽

Fixed Length ◽

The Past

The analysis of biological sequencing data has been one of the biggest applications of string algorithms. The approaches used in many such applications are based on the analysis of k -mers, which are short fixed-length strings present in a dataset. While these approaches are rather diverse, storing and querying a k -mer set has emerged as a shared underlying component. A set of k -mers has unique features and applications that, over the past 10 years, have resulted in many specialized approaches for its representation. In this survey, we give a unified presentation and comparison of the data structures that have been proposed to store and query a k -mer set. We hope this survey will serve as a resource for researchers in the field as well as make the area more accessible to researchers outside the field.

Download Full-text

Predecessor Search, String Algorithms and Data Structures

Encyclopedia of Algorithms ◽

10.1007/978-1-4939-2864-4_632 ◽

2016 ◽

pp. 1605-1611

Author(s):

Djamal Belazzougui

Keyword(s):

Data Structures ◽

Algorithms And Data Structures ◽

String Algorithms ◽

Search String

Download Full-text

Undergraduate Topics in Computer Science - Guide to Competitive Programming ◽

10.1007/978-3-030-39357-1_14 ◽

2020 ◽

pp. 243-261

Author(s):

Antti Laaksonen

Keyword(s):

String Algorithms

Download Full-text

Predecessor Search, String Algorithms and Data Structures

Encyclopedia of Algorithms ◽

10.1007/978-3-642-27848-8_632-2 ◽

2015 ◽

pp. 1-8

Author(s):

Djamal Belazzougui

Keyword(s):

Data Structures ◽

Algorithms And Data Structures ◽

String Algorithms ◽

Search String

Download Full-text

Fuzzy String Matching Procedure

The Open Bioinformatics Journal ◽

10.2174/1875036202013010050 ◽

2020 ◽

Vol 13 (1) ◽

pp. 50-56

Author(s):

Zekâi Şen

Keyword(s):

Probability Distribution ◽

Fuzzy Number ◽

String Matching ◽

Distribution Functions ◽

Number Representation ◽

String Algorithms ◽

Collective Behaviors ◽

Text String ◽

Probability Distribution Functions ◽

Random Variability

Background: There are different methodologies for DNA comparison based on two string algorithms, which are dependent on crisp logical principles, where there is no room for verbal (linguistic) uncertainty. These are successfully applicable procedures in DNA bioinformatics researches even by taking into consideration probabilistic random variability components based on the probability distribution functions of various types. Objective: The main purpose of this paper is to review first briefly all available DNA string matching methodologies that are based on crisp logic and then to suggest a new method based on the fuzzy logic rules and application. Methods: There are different methodologies for DNA comparison based on two string algorithms, which are dependent on crisp logical principles, where there is no room for verbal (linguistic) uncertainty. These are successfully applicable procedures in DNA bioinformatics researchers even by taking into consideration probabilistic random variability components based on the probability distribution functions of various types. Results: Fuzzy number representation of each gene implies some sort of uncertainty or unhealthiness in some or all the genes. Their better identifications can be achieved on the basis of fuzzy numbers with different membership degrees, which imply the unhealthiness or healthiness of the genes and their collective behaviors. Conclusion: After the development of fuzzy number representation of the text string coupled with crisp pattern string their relationships are searched at different shift operations, and hence, the possibility of defaulters are identified in the text string with a certain degree of membership.

Download Full-text

Some Applications of String Algorithms in Human-Computer Interaction

Algorithms and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-642-12476-1_14 ◽

2010 ◽

pp. 196-209 ◽

Cited By ~ 6

Author(s):

Kari-Jouko Räihä

Keyword(s):

Human Computer Interaction ◽

String Algorithms ◽

Computer Interaction

Download Full-text

String Algorithms

Algorithmic Aspects of Bioinformatics - Natural Computing Series ◽

10.1007/978-3-540-71913-7_4 ◽

2007 ◽

pp. 37-79

Keyword(s):

String Algorithms

Download Full-text

Parallel Implementation of Median String Algorithms.

10.1109/sccc54552.2021.9650389 ◽

2021 ◽

Author(s):

Pedro Mirabal ◽

Ignacio Lincolao-Venegas ◽

Mario Castillo-Sanhueza ◽

Jose Abreu

Keyword(s):

Parallel Implementation ◽

String Algorithms ◽

Median String

Download Full-text

A Survey on Shortest Unique Substring Queries

Algorithms ◽

10.3390/a13090224 ◽

2020 ◽

Vol 13 (9) ◽

pp. 224

Author(s):

Paniz Abedin ◽

M. Oğuzhan Külekci ◽

Shama V. Thankachan

Keyword(s):

Information Retrieval ◽

String Algorithms ◽

Active Line ◽

Recent Developments

The shortest unique substring (SUS) problem is an active line of research in the field of string algorithms and has several applications in bioinformatics and information retrieval. The initial version of the problem was proposed by Pei et al. [ICDE’13]. Over the years, many variants and extensions have been pursued, which include positional-SUS, interval-SUS, approximate-SUS, palindromic-SUS, range-SUS, etc. In this article, we highlight some of the key results and summarize the recent developments in this area.

Download Full-text

BEYOND STRING ALGORITHMS: PROTEIN SEQUENCE ANALYSIS USINGWAVELET TRANSFORMS

Analysis of Biological Data - Science, Engineering, and Biology Informatics ◽

10.1142/9789812708892_0005 ◽

2007 ◽

pp. 109-131 ◽

Cited By ~ 1

Author(s):

Arun Krishnan ◽

Kuo-Bin Li

Keyword(s):

Sequence Analysis ◽

Protein Sequence ◽

Protein Sequence Analysis ◽

String Algorithms

Download Full-text