Parallelization of the dynamic programming algorithm for solving the longest common subsequence problem

To obtain the target webpages from many webpages, we proposed a Method for Filtering Pages by Similarity Degree based on Dynamic Programming (MFPSDDP). The method needs to use one of three same relationships proposed between two nodes, so we give the definition of the three same relationships. The biggest innovation of MFPSDDP is that it does not need to know the structures of webpages in advance. First, we address the design ideas with queue and double threads. Then, a dynamic programming algorithm for calculating the length of the longest common subsequence and a formula for calculating similarity are proposed. Further, for obtaining detailed information webpages from 200,000 webpages downloaded from the famous website “www.jd.com”, we choose the same relationship Completely Same Relationship (CSR) and set the similarity threshold to 0.2. The Recall Ratio (RR) of MFPSDDP is in the middle in the four filtering methods compared. When the number of webpages filtered is nearly 200,000, the PR of MFPSDDP is highest in the four filtering methods compared, which can reach 85.1%. The PR of MFPSDDP is 13.3 percentage points higher than the PR of a Method for Filtering Pages by Containing Strings (MFPCS).

Download Full-text

Automata Technique for The LCS Problem

Journal of Computer Science and Cybernetics ◽

10.15625/1813-9663/35/1/13293 ◽

2019 ◽

Vol 35 (1) ◽

pp. 21-37

Author(s):

Trường Huy Nguyễn

Keyword(s):

Dynamic Programming ◽

Parallel Algorithms ◽

Time Complexity ◽

Dynamic Programming Algorithm ◽

Longest Common Subsequence ◽

Programming Algorithm ◽

Worst Case ◽

Common Subsequence ◽

Input Strings ◽

Upper Estimate

In this paper, we introduce two eﬃcient algorithms in practice for computing the length of a longest common subsequence of two strings, using automata technique, in sequential and parallel ways. For two input strings of lengths m and n with m ≤ n, the parallel algorithm uses k processors (k ≤ m) and costs time complexity O(n) in the worst case, where k is an upper estimate of the length of a longest common subsequence of the two strings. These results are based on the Knapsack Shaking approach proposed by P. T. Huy et al. in 2002. Experimental results show that for the alphabet of size 256, our sequential and parallel algorithms are about 65.85 and 3.41m times faster than the standard dynamic programming algorithm proposed by Wagner and Fisher in 1974, respectively.

Download Full-text

Dynamic programming algorithms for the mosaic longest common subsequence problem

Information Processing Letters ◽

10.1016/j.ipl.2006.11.006 ◽

2007 ◽

Vol 102 (2-3) ◽

pp. 99-103 ◽

Cited By ~ 16

Author(s):

Kuo-Si Huang ◽

Chang-Biau Yang ◽

Kuo-Tsung Tseng ◽

Yung-Hsing Peng ◽

Hsing-Yen Ann

Keyword(s):

Dynamic Programming ◽

Longest Common Subsequence ◽

Longest Common Subsequence Problem ◽

Common Subsequence ◽

Programming Algorithms

Download Full-text

Time Efficient Segmented Technique for Dynamic Programming Based Algorithms with FPGA Implementation

Journal of Circuits System and Computers ◽

10.1142/s021812661950227x ◽

2019 ◽

Vol 28 (13) ◽

pp. 1950227

Author(s):

Talal Bonny ◽

Ridhwan Al Debsi ◽

Mohamed Basel Almourad

Keyword(s):

Dynamic Programming ◽

Sequence Alignment ◽

Input Parameter ◽

Dynamic Programming Algorithm ◽

Computation Time ◽

Longest Common Subsequence ◽

Programming Algorithm ◽

Optimization Approach ◽

Alignment Algorithm ◽

Optimal Sequence

Although dynamic programming (DP) is an optimization approach used to solve a complex problem fast, the time required to solve it is still not efficient and grows polynomially with the size of the input. In this contribution, we improve the computation time of the dynamic programming based algorithms by proposing a novel technique, which is called “SDP: Segmented Dynamic programming”. SDP finds the best way of splitting the compared sequences into segments and then applies the dynamic programming algorithm to each segment individually. This will reduce the computation time dramatically. SDP may be applied to any dynamic programming based algorithm to improve its computation time. As case studies, we apply the SDP technique on two different dynamic programming based algorithms; “Needleman–Wunsch (NW)”, the widely used program for optimal sequence alignment, and the LCS algorithm, which finds the “Longest Common Subsequence” between two input strings. The results show that applying the SDP technique in conjunction with the DP based algorithms improves the computation time by up to 80% in comparison to the sole DP algorithms, but with small or ignorable degradation in comparing results. This degradation is controllable and it is based on the number of split segments as an input parameter. However, we compare our results with the well-known heuristic FASTA sequence alignment algorithm, “GGSEARCH”. We show that our results are much closer to the optimal results than the “GGSEARCH” algorithm. The results are valid independent from the sequences length and their level of similarity. To show the functionality of our technique on the hardware and to verify the results, we implement it on the Xilinx Zynq-7000 FPGA.

Download Full-text