Parallelization of Pairwise Alignment and Neighbor-Joining Algorithm in Progressive Multiple Sequence Alignment

Progressive multiple sequence alignment ClustalW is a widely used heuristic method for computing multiple sequence alignment (MSA). It has three stages: distance matrix computation using pairwise alignment, guide tree reconstruction using neighbor-joining and progressive alignment. To accelerate computing for large data, the progressive MSA algorithm needs to be parallelized. This research aims to identify, decompose and implement the pairwise alignment and neighbor-joining in progressive MSA using message passing, shared memory and hybrid programming model in the computer cluster. The experimental results obtained shared memory programming model as the best scenario implementation with speed up up to 12 times.

Download Full-text

Progressive multiple sequence alignment with indel evolution

BMC Bioinformatics ◽

10.1186/s12859-018-2357-1 ◽

2018 ◽

Vol 19 (1) ◽

Cited By ~ 1

Author(s):

Massimo Maiolo ◽

Xiaolei Zhang ◽

Manuel Gil ◽

Maria Anisimova

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Multiple Sequence ◽

Progressive Multiple Sequence Alignment

Download Full-text

Multiple Sequence Alignment Optimization Using Meta-Heuristic Techniques

Data Analytics in Medicine ◽

10.4018/978-1-7998-1204-3.ch031 ◽

2020 ◽

pp. 565-579 ◽

Cited By ~ 1

Author(s):

Mohamed Issa ◽

Aboul Ella Hassanien

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Phylogenetic Trees ◽

Pairwise Alignment ◽

Accurate Method ◽

Alignment Algorithm ◽

Bacterial Foraging Optimization ◽

Multiple Sequence ◽

Speed Up ◽

Dna Fragment Assembly

Sequence alignment is a vital process in many biological applications such as Phylogenetic trees construction, DNA fragment assembly and structure/function prediction. Two kinds of alignment are pairwise alignment which align two sequences and Multiple Sequence alignment (MSA) that align sequences more than two. The accurate method of alignment is based on Dynamic Programming (DP) approach which suffering from increasing time exponentially with increasing the length and the number of the aligned sequences. Stochastic or meta-heuristics techniques speed up alignment algorithm but with near optimal alignment accuracy not as that of DP. Hence, This chapter aims to review the recent development of MSA using meta-heuristics algorithms. In addition, two recent techniques are focused in more deep: the first is Fragmented protein sequence alignment using two-layer particle swarm optimization (FTLPSO). The second is Multiple sequence alignment using multi-objective based bacterial foraging optimization algorithm (MO-BFO).

Download Full-text

A Hybrid Flow for Multiple Sequence Alignment with a BLASTn Based Pairwise Alignment Processor

2018 IEEE International Symposium on Circuits and Systems (ISCAS) ◽

10.1109/iscas.2018.8351254 ◽

2018 ◽

Author(s):

Mao-Jan Lin ◽

Chih-Yu Chang ◽

Yu-Cheng Li ◽

Nae-Chyun Chen ◽

Yi-Chang Lu

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Pairwise Alignment ◽

Multiple Sequence

Download Full-text

A Novel Method for Progressive Multiple Sequence Alignment Based on Lempel-Ziv

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-10677-4_17 ◽

2009 ◽

pp. 151-158 ◽

Cited By ~ 3

Author(s):

Guoli Ji ◽

Congting Ye ◽

Zijiang Yang ◽

Zhenya Guo

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Multiple Sequence ◽

Novel Method ◽

Progressive Multiple Sequence Alignment

Download Full-text

Progressive Alignment Method Using Genetic Algorithm for Multiple Sequence Alignment

IEEE Transactions on Evolutionary Computation ◽

10.1109/tevc.2011.2162849 ◽

2012 ◽

Vol 16 (5) ◽

pp. 615-631 ◽

Cited By ~ 36

Author(s):

Farhana Naznin ◽

Ruhul Sarker ◽

Daryl Essam

Keyword(s):

Genetic Algorithm ◽

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Alignment Method ◽

Progressive Alignment ◽

Multiple Sequence

Download Full-text

Efficient mapping of genomic sequences to optimize multiple pairwise alignment in hybrid cluster platforms

Journal of Integrative Bioinformatics ◽

10.1515/jib-2014-251 ◽

2014 ◽

Vol 11 (3) ◽

pp. 60-71

Author(s):

Alberto Montañola ◽

Concepció Roig ◽

Porfidio Hernández

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Pairwise Alignment ◽

Experimental Results ◽

Genomic Sequences ◽

Multiple Sequence ◽

Optimal Amount ◽

New Challenges ◽

Best Parameters ◽

Available Resources

Summary Multiple sequence alignment (MSA), used in biocomputing to study similarities between different genomic sequences, is known to require important memory and computation resources. Nowadays, researchers are aligning thousands of these sequences, creating new challenges in order to solve the problem using the available resources efficiently. Determining the efficient amount of resources to allocate is important to avoid waste of them, thus reducing the economical costs required in running for example a specific cloud instance. The pairwise alignment is the initial key step of the MSA problem, which will compute all pair alignments needed. We present a method to determine the optimal amount of memory and computation resources to allocate by the pairwise alignment, and we will validate it through a set of experimental results for different possible inputs. These allow us to determine the best parameters to configure the applications in order to use effectively the available resources of a given system.

Download Full-text

CLUSTAL W (improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice)

Encyclopedia of Genetics, Genomics, Proteomics and Informatics ◽

10.1007/978-1-4020-6754-9_3188 ◽

2008 ◽

pp. 376-377 ◽

Cited By ~ 13

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Weight Matrix ◽

Multiple Sequence ◽

Progressive Multiple Sequence Alignment

Download Full-text

MULTIPLE SEQUENCE ALIGNMENT USING AN EXHAUSTIVE AND GREEDY ALGORITHM

Journal of Bioinformatics and Computational Biology ◽

10.1142/s021972000500103x ◽

2005 ◽

Vol 03 (02) ◽

pp. 243-255 ◽

Cited By ~ 1

Author(s):

YI WANG ◽

KUO-BIN LI

Keyword(s):

Greedy Algorithm ◽

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Multiple Alignment ◽

Initial Alignment ◽

Progressive Alignment ◽

Multiple Sequence ◽

Java Programming ◽

Multiple Alignments ◽

Objective Score

We describe an exhaustive and greedy algorithm for improving the accuracy of multiple sequence alignment. A simple progressive alignment approach is employed to provide initial alignments. The initial alignment is then iteratively optimized against an objective function. For any working alignment, the optimization involves three operations: insertions, deletions and shuffles of gaps. The optimization is exhaustive since the algorithm applies the above operations to all eligible positions of an alignment. It is also greedy since only the operation that gives the best improving objective score will be accepted. The algorithms have been implemented in the EGMA (Exhaustive and Greedy Multiple Alignment) package using Java programming language, and have been evaluated using the BAliBASE benchmark alignment database. Although EGMA is not guaranteed to produce globally optimized alignment, the tests indicate that EGMA is able to build alignments with high quality consistently, compared with other commonly used iterative and non-iterative alignment programs. It is also useful for refining multiple alignments obtained by other methods.

Download Full-text

A New Quantum Cuckoo Search Algorithm for Multiple Sequence Alignment

Journal of Intelligent Systems ◽

10.1515/jisys-2013-0052 ◽

2014 ◽

Vol 23 (3) ◽

pp. 261-275 ◽

Cited By ~ 4

Author(s):

Widad Kartous ◽

Abdesslem Layeb ◽

Salim Chikhi

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Search Algorithm ◽

Cuckoo Search ◽

Cuckoo Search Algorithm ◽

Initial Population ◽

Biological Sequences ◽

Alignment Method ◽

Progressive Alignment ◽

Multiple Sequence

AbstractMultiple sequence alignment (MSA) is one of the major problems that can be encountered in the bioinformatics field. MSA consists in aligning a set of biological sequences to extract the similarities between them. Unfortunately, this problem has been shown to be NP-hard. In this article, a new algorithm was proposed to deal with this problem; it is based on a quantum-inspired cuckoo search algorithm. The other feature of the proposed approach is the use of a randomized progressive alignment method based on a hybrid global/local pairwise algorithm to construct the initial population. The results obtained by this hybridization are very encouraging and show the feasibility and effectiveness of the proposed solution.

Download Full-text