DCA: An efficient implementation of the divide-and-conquer approach to simultaneous multiple sequence alignment

A brand new performance assessment model is proposed for multiple sequence alignment. The new strategy is based on beam constructing of DC-BTA algorithm, which is a Divide-and-Conquer alignment method with beams. Beams form blocks of almost the identical columns and contribute biggest similarity weight to sequences. A formula to compute all beam areas covering a sequence assigns a value or weight to the sequence. And the total beam area is a partial to the whole alignment. A rate value between 0 and 1 is computed to assess the performance. This scheme is a simple and effective assessment policy in DC-BTA for the convenience of collecting the beam areas.

Download Full-text

Improving the divide-and-conquer approach to sum-of-pairs multiple sequence alignment

Applied Mathematics Letters ◽

10.1016/s0893-9659(97)00013-x ◽

1997 ◽

Vol 10 (2) ◽

pp. 67-73 ◽

Cited By ~ 13

Author(s):

J. Stoye ◽

S.W. Perrey ◽

A.W.M. Dress

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Divide And Conquer ◽

Multiple Sequence

Download Full-text

A Divide-and-Conquer Method for Multiple Sequence Alignment on Multi-core Computers

Communications in Computer and Information Science - Parallel Computational Fluid Dynamics ◽

10.1007/978-3-642-53962-6_41 ◽

2014 ◽

pp. 460-469

Author(s):

Xiangyuan Zhu

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Divide And Conquer ◽

Multiple Sequence

Download Full-text

An efficient algorithm for multiple sequence alignment based on ant colony optimisation and divide‐and‐conquer method

New Zealand Journal of Agricultural Research ◽

10.1080/00288230709510330 ◽

2007 ◽

Vol 50 (5) ◽

pp. 617-626 ◽

Cited By ~ 4

Author(s):

Wei Liu ◽

Ling Chen ◽

Juan Chen

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Efficient Algorithm ◽

Ant Colony ◽

Divide And Conquer ◽

Ant Colony Optimisation ◽

Multiple Sequence

Download Full-text

Multiple Sequence Alignment by Ant Colony Optimization and Divide-and-Conquer

Computational Science – ICCS 2006 - Lecture Notes in Computer Science ◽

10.1007/11758525_88 ◽

2006 ◽

pp. 646-653 ◽

Cited By ~ 4

Author(s):

Yixin Chen ◽

Yi Pan ◽

Juan Chen ◽

Wei Liu ◽

Ling Chen

Keyword(s):

Ant Colony Optimization ◽

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Ant Colony ◽

Divide And Conquer ◽

Multiple Sequence

Download Full-text

Multiple sequence alignment with the divide-and-conquer method

Gene ◽

10.1016/s0378-1119(98)00097-3 ◽

1998 ◽

Vol 211 (2) ◽

pp. GC45-GC56 ◽

Cited By ~ 54

Author(s):

Jens Stoye

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Divide And Conquer ◽

Multiple Sequence

Download Full-text

A Greedy Clustering Algorithm for Multiple Sequence Alignment

International Journal of Cognitive Informatics and Natural Intelligence ◽

10.4018/ijcini.20211001oa28 ◽

2021 ◽

Vol 15 (4) ◽

pp. 0-0

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Clustering Algorithm ◽

Optimization Procedure ◽

Search Space ◽

Divide And Conquer ◽

Biological Sequence ◽

Multiple Sequence ◽

Biological Sequence Analysis ◽

Np Hard Problem

This paper presents a strategy to tackle the Multiple Sequence Alignment (MSA) problem, which is one of the most important tasks in the biological sequence analysis. Its role is to align the sequences in their entirety to derive relationships and common characteristics between a set of protein or nucleotide sequences. The MSA problem was proved to be an NP-Hard problem. The proposed strategy incorporates a new idea based on the well-known divide and conquer paradigm. This paper presents a novel method of clustering sequences as a preliminary step to improve the final alignment; this decomposition can be used as an optimization procedure with any MSA aligner to explore promising alignments of the search space. In their solution, authors proposed to align the clusters in a parallel and distributed way in order to benefit from parallel architectures. The strategy was tested using classical benchmarks like BAliBASE, Sabre, Prefab4 and Oxm, and the experimental results show that it gives good results by comparing to the other aligners.

Download Full-text

Hybrid Genetics Algorithms for Multiple Sequence Alignment

Handbook of Research on Modern Optimization Algorithms and Applications in Engineering and Economics - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-4666-9644-0.ch013 ◽

2016 ◽

pp. 346-366 ◽

Cited By ~ 1

Author(s):

John Tsiligaridis

Keyword(s):

Genetic Algorithm ◽

Tabu Search ◽

Traveling Salesman Problem ◽

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Hybrid Genetic Algorithm ◽

Traveling Salesman ◽

Divide And Conquer ◽

Multiple Sequence ◽

The Traveling Salesman Problem

The purpose of this chapter is to present a set of algorithms and their efficiency for the consistency based Multiple Sequence Alignment (MSA) problem. Based on the strength and adaptability of the Genetic Algorithm (GA) two approaches are developed depending on the MSA type. The first approach, for the non related sequences (no consistency), involves a Hybrid Genetic Algorithm (GA_TS) considering also Tabu Search (TS). The Traveling Salesman Problem (TSP) is also applied determining MSA orders. The second approach, for sequences with consistency, deals with a hybrid GA based on the Divide and Conquer principle (DCP) and it can save space. A consistent dot matrices (CDM) algorithm discovers consistency and creates MSA. The proposed GA (GA_TS_VS) also uses TS but it works with partitions. In conclusion, GAs are stochastic approaches that are proved very beneficial for MSA in terms of their performance.

Download Full-text

MAGUS: Multiple sequence Alignment using Graph clUStering

Bioinformatics ◽

10.1093/bioinformatics/btaa992 ◽

2020 ◽

Author(s):

Vladimir Smirnov ◽

Tandy Warnow

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Large Scale ◽

Graph Clustering ◽

Divide And Conquer ◽

Supplementary Information ◽

Sequence Alignments ◽

Multiple Sequence ◽

Full Dataset ◽

A New Technique

Abstract Motivation The estimation of large multiple sequence alignments (MSAs) is a basic bioinformatics challenge. Divide-and-conquer is a useful approach that has been shown to improve the scalability and accuracy of MSA estimation in established methods such as SATé and PASTA. In these divide-and-conquer strategies, a sequence dataset is divided into disjoint subsets, alignments are computed on the subsets using base MSA methods (e.g. MAFFT), and then merged together into an alignment on the full dataset. Results We present MAGUS, Multiple sequence Alignment using Graph clUStering, a new technique for computing large-scale alignments. MAGUS is similar to PASTA in that it uses nearly the same initial steps (starting tree, similar decomposition strategy, and MAFFT to compute subset alignments), but then merges the subset alignments using the Graph Clustering Merger, a new method for combining disjoint alignments that we present in this study. Our study, on a heterogeneous collection of biological and simulated datasets, shows that MAGUS produces improved accuracy and is faster than PASTA on large datasets, and matches it on smaller datasets. Availability and implementation MAGUS: https://github.com/vlasmirnov/MAGUS Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

A Greedy Clustering Algorithm for Multiple Sequence Alignment

International Journal of Cognitive Informatics and Natural Intelligence ◽

10.4018/ijcini.20211001.oa41 ◽

2021 ◽

Vol 15 (4) ◽

pp. 1-17

Author(s):

Rabah Lebsir ◽

Abdesslem Layeb ◽

Tahi Fariza

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Clustering Algorithm ◽

Optimization Procedure ◽

Search Space ◽

Divide And Conquer ◽

Biological Sequence ◽

Multiple Sequence ◽

Biological Sequence Analysis ◽

Np Hard Problem

This paper presents a strategy to tackle the Multiple Sequence Alignment (MSA) problem, which is one of the most important tasks in the biological sequence analysis. Its role is to align the sequences in their entirety to derive relationships and common characteristics between a set of protein or nucleotide sequences. The MSA problem was proved to be an NP-Hard problem. The proposed strategy incorporates a new idea based on the well-known divide and conquer paradigm. This paper presents a novel method of clustering sequences as a preliminary step to improve the final alignment; this decomposition can be used as an optimization procedure with any MSA aligner to explore promising alignments of the search space. In their solution, authors proposed to align the clusters in a parallel and distributed way in order to benefit from parallel architectures. The strategy was tested using classical benchmarks like BAliBASE, Sabre, Prefab4 and Oxm, and the experimental results show that it gives good results by comparing to the other aligners.

Download Full-text