Calculating PSSM probabilities with lazy dynamic programming

Position-specific scoring matrices are one way to represent approximate string patterns, which are commonly encountered in the field of bioinformatics. An important problem that arises with their application is calculating the statistical significance of matches. We review the currently most efficient algorithm for this task, and show how it can be implemented in Haskell, taking advantage of the built-in non-strictness of the language. The resulting program turns out to be an instance of dynamic programming, using lists rather the typical dynamic programming matrix.

Download Full-text

Data dependency reduction in Dynamic Programming matrix

2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE) ◽

10.1109/jcsse.2011.5930126 ◽

2011 ◽

Cited By ~ 2

Author(s):

Guillermo Delgado ◽

Chatchawit Aporntewan

Keyword(s):

Dynamic Programming ◽

Data Dependency ◽

Dynamic Programming Matrix

Download Full-text

Perbandingan Algoritma Cheapest Insertion Heuristics dan Pemrograman Dinamis untuk Penyelesaian Traveling Salesman Problem

JES-MAT (Jurnal Edukasi dan Sains Matematika) ◽

10.25134/jes-mat.v1i1.242 ◽

2015 ◽

Vol 1 (1) ◽

Author(s):

Anggar Titis Prayitno

Keyword(s):

Dynamic Programming ◽

Traveling Salesman Problem ◽

Calculation Result ◽

Efficient Algorithm ◽

Time Complexity ◽

Traveling Salesman ◽

Algorithm Efficiency ◽

Heuristics Algorithm ◽

Complexity Algorithm

ABSTRACT Traveling Salesman Problem (TSP) is one of combinatorics optimation problem to find the possible shorthest path that can be obtained if a salesman visit each city exactly once and return to the starting city. The shorthest path searching can be done by Cheapest Insertion Heuristics algorithm and Dynamic Programming. Each algorithm has different efficiency to find shorthest path. Algorithm efficiency is determined based on time complexity. Algorithm wich has the smallest time complexity is the most efficient algorithm. Based on the calculation result, the time complexity of Cheapest Insertion Heuristics algorithm is and Dynamic Programming is . Therefore, for Cheapest Insertion Heuristics Algorithm is more efficient algorithm than Dynamic Programming in TSP solving. Keywords : Traveling Salesman Problem, Cheapest Insertion Heuristics Algorithm, Dynamic Programming, and Algorithm time complexity.

Download Full-text

Dynamic Programming Algorithms Applied to Musical Counterpoint in Process Composition: An Example Using Henri Pousseur’s Scambi

10.20944/preprints202006.0359.v1 ◽

2020 ◽

Cited By ~ 1

Author(s):

Louis J. Cochrane ◽

Derek Gatherer

Keyword(s):

Dynamic Programming ◽

Electronic Music ◽

Dynamic Programming Algorithm ◽

Pairwise Alignment ◽

Distance Matrix ◽

Programming Algorithm ◽

Biological Sequence ◽

Dynamic Programming Matrix ◽

Process Composition ◽

Programming Algorithms

The Needleman-Wunsch process is a classic tool in bioinformatics, being a dynamic programming algorithm that performs a pairwise alignment of two input biological sequences, either protein or nucleic acid. A distance matrix between the tokens used in the sequences is also required as input. The distance matrix is used to generate a positional pairwise similarity matrix between the input sequences, which is in turn used to generate a dynamic programming matrix. The best path through the dynamic programming matrix is navigated using a traceback procedure that maximises similarity, inserting gaps as necessary. Needleman-Wunsch can align both nucleic acids or proteins, which use alphabets of size 4 and 20 tokens respectively. It can also be applied to any other kind of sequence where distance matrices can be specified. Here, we apply it to chains of Pousseur’s Scambi electronic music fragments, of which there are 32, and which Pousseur categorised by their sonic properties, thus permitting the consecutive construction of distance, similarity and dynamic programming matrices. Traceback through the dynamic programming matrix thus produces contrapuntal duet compositions in which two Scambi chains are played in the maximally euphonious manner, providing also an illustration of the principles of biological sequence alignment in sound.

Download Full-text

Panning for Genes—A Visual Strategy for Identifying Novel Gene Orthologs and Paralogs

Genome Research ◽

10.1101/gr.9.4.373 ◽

1999 ◽

Vol 9 (4) ◽

pp. 373-382

Author(s):

Jacques D. Retief ◽

Kevin R. Lynch ◽

William R. Pearson

Keyword(s):

Glutathione Transferase ◽

Expressed Sequence Tag ◽

Statistical Significance ◽

Query Sequence ◽

Gene Families ◽

Protein Alignment ◽

Significant Similarity ◽

Protein Superfamilies ◽

Scoring Matrices ◽

Gene Orthologs

We have developed a rapid visual method for identifying novel members of gene families. Starting with an evolutionary tree, 20–50 protein query sequences for a gene family are selected from different branches of the tree. These query sequences are used to search the GenBank and expressed sequence tag (EST) DNA databases and their nightly updates using the tfastx3 or tfasty3 programs. The results of all 20–50 searches are collated and resorted to highlight EST or genomic sequences that share significant similarity with the query sequences. The statistical significance of each DNA/protein alignment is plotted, highlighting the portion of the query sequence that is present in the database sequence and the percent identity in the aligned region. The collated results for database sequences are linked using the WWW to the underlying scores and alignments; these links can also be used to perform additional searches to characterize the novel sequence further. With traditional “deep” scoring matrices (BLOSUM50) one can search for previously unrecognized families of large protein superfamilies. Alternatively, by using query sequences and EST libraries from the same species (e.g., human or mouse) together with “shallow” scoring matrices and filters that remove high-identity sequences, one can highlight new paralogs of previously described subfamilies. Using query sequences from the glutathione transferase superfamily, we identified two novel mammalian glutathione transferase families that were recognized previously only in plants. Using query sequences from known mammalian glutathione transferase subfamilies, we identified new candidate paralogs from the mouse class-mu, class-pi, and class-theta families.

Download Full-text

An efficient algorithm for control action sequences in FMS using dynamic programming

SICE 2000. Proceedings of the 39th SICE Annual Conference. International Session Papers (IEEE Cat. No.00TH8545) ◽

10.1109/sice.2000.889683 ◽

2002 ◽

Cited By ~ 1

Author(s):

Jae Won Choi ◽

Jae Weon Choi

Keyword(s):

Dynamic Programming ◽

Efficient Algorithm ◽

Control Action ◽

Action Sequences

Download Full-text

An Efficient Algorithm for Mapping of Reads to a Genome Graph Using an Index Based on Hash Tables and Dynamic Programming

BIOPHYSICS ◽

10.1134/s0006350918030193 ◽

2018 ◽

Vol 63 (3) ◽

pp. 311-317 ◽

Cited By ~ 2

Author(s):

S. N. Petrov ◽

L. A. Uroshlev ◽

A. S. Kasyanov ◽

V. Yu. Makeev

Keyword(s):

Dynamic Programming ◽

Efficient Algorithm ◽

Hash Tables ◽

A Genome ◽

Genome Graph

Download Full-text

Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences

Nucleic Acids Research ◽

10.1093/nar/gkf438 ◽

2002 ◽

Vol 30 (14) ◽

pp. 3214-3224 ◽

Cited By ~ 69

Author(s):

M. C. Frith

Keyword(s):

Statistical Significance ◽

Nucleotide Sequences ◽

Scoring Matrices

Download Full-text

A FAST DYNAMIC PROGRAMMING ALGORITHM FOR STOPE BOUNDARY LAYOUT FOR UNDERGROUND MINE

International Journal of Advanced Research ◽

10.21474/ijar01/13378 ◽

2021 ◽

Vol 9 (09) ◽

pp. 86-90

Author(s):

Sakirudeen A. Abdulsalaam ◽

Keyword(s):

Dynamic Programming ◽

Efficient Algorithm ◽

Dynamic Programming Algorithm ◽

Underground Mine ◽

Three Dimensions ◽

Programming Algorithm ◽

Mining Site ◽

Physical Constraints ◽

Ore Body ◽

Fast Dynamic

We developed an efficient algorithm that generates optimal stope layout for an underground mine. After a mining site has been identified and an exploration has been done, the data gathered is analysed and a modelling technique is applied to produce an ore body. The ore body is divided into thousands of mining blocks in three dimensions. The blocks are assigned values per tonne. The miners desire a stope layout which maximizes the mine value. In this paper, we present a fast algorithm that generates the stope layout efficiently without violating the physical constraints.

Download Full-text