Ant Colony Optimization Methodfor Multiple Sequence Alignment

One of the main tasks of structural biology is comparing the structure of proteins. Comparisons of protein structure can determine their functional similarities. Multigraph alignment is a useful tool for identifying functional similarities based on structural analysis. This article proposes a new algorithm for aligning protein binding sites called ACOTS-MGA. This algorithm is based on the memetic scheme. It uses the ACO method to construct a set of solutions, then selects the best solution for implementing Tabu Search to improve the solution quality. Experimental results have shown that ACOTS-MGA outperforms state-of-the-art algorithms while producing alignments of better quality.KeywordsMultiple Graph Alignment, Tabu Search, Ant Colony Optimization, local search, memetic algorithm, SMMAS pheromone update rule, protein active sitesReferencesE. Todd, C. A. Orengo, and J. M. Thornton, “Evolution of function in protein superfamilies, from a structural perspective,” J. Mol. Biol., vol. 307, no. 4, pp. 1113–1143, Apr. 2001.S. F. Altschul et al., “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs,” Nucleic Acids Res., vol. 25, pp. 3389–3402, 1997.R. C. Edgar, “MUSCLE: multiple sequence alignment with high accuracy and high throughput,” Nucleic Acids Res., vol. 32, no. 5, pp. 1792–1797, Mar. 2004.J. D. Thompson, D. G. Higgins, and T. J. Gibson, “CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice,” Nucleic Acids Res., vol. 22, no. 22, pp. 4673–4680, Nov. 1994.M. Larkin, G. Blackshields, N. Brown, … R. C.-, and undefined 2007, “Clustal W and Clustal X version 2.0,” academic.oup.com.C. Notredame, D. G. Higgins, and J. Heringa, “T-coffee: a novel method for fast and accurate multiple sequence alignment,” J. Mol. Biol., vol. 302, no. 1, pp. 205–217, Sep. 2000.K. Sjolander, “Phylogenomic inference of protein molecular function: advances and challenges,” Bioinformatics, vol. 20, no. 2, pp. 170–179, Jan. 2004.T. Fober, M. Mernberger, G. Klebe, and E. Hüllermeier, “Evolutionary construction of multiple graph alignments for the structural analysis of biomolecules,” Bioinformatics, vol. 25, no. 16, pp. 2110–2117, 2009.M. Mernberger, G. Klebe, and E. Hullermeier, “SEGA: Semiglobal Graph Alignment for Structure-Based Protein Comparison,” IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 8, no. 5, pp. 1330–1343, Sep. 2011.D. Shasha, J. T. L. Wang, and R. Giugno, “Algorithmics and applications of tree and graph searching,” in Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems - PODS ’02, 2002, p. 39.R. V. Spriggs, P. J. Artymiuk, and P. Willett, “Searching for Patterns of Amino Acids in 3D Protein Structures,” J. Chem. Inf. Comput. Sci., vol. 43, no. 2, pp. 412–421, Mar. 2003.D. Conte, P. Foggia, C. Sansone, And M. Vento, “Thirty years of graph matching in pattern recognition,” Int. J. Pattern Recognit. Artif. Intell., vol. 18, no. 3, pp. 265–298, May 2004.K. Kinoshita and H. Nakamura, “Identification of the ligand binding sites on the molecular surface of proteins,” Protein Sci., vol. 14, no. 3, pp. 711–718, Mar. 2005.O. Kuchaiev and N. Pržulj, “Integrative network alignment reveals large regions of global network similarity in yeast and human,” Bioinformatics, vol. 27, 2011.Xifeng Yan, Feida Zhu, Jiawei Han, and P. S. Yu, “Searching Substructures with Superimposed Distance,” in 22nd International Conference on Data Engineering (ICDE’06), 2006, pp. 88–88.X. Yan, P. S. Yu, and J. Han, “Substructure similarity search in graph databases,” in Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD ’05, 2005, p. 766.S. Zhang, M. Hu, and J. Yang, “TreePi: A Novel Graph Indexing Method,” in 2007 IEEE 23rd International Conference on Data Engineering, 2007, pp. 966–975.A. E. Aladag and C. Erten, “SPINAL: scalable protein interaction network alignment,” Bioinformatics, vol. 29, pp. 917–924, 2013.S. Schmitt, D. Kuhn, and G. Klebe, “A New Method to Detect Related Function Among Proteins Independent of Sequence and Fold Homology,” J. Mol. Biol., vol. 323, no. 2, pp. 387–406, Oct. 2002.M. Hendlich, A. Bergner, J. Günther, and G. Klebe, “Relibase: Design and Development of a Database for Comprehensive Analysis of Protein–Ligand Interactions,” J. Mol. Biol., vol. 326, no. 2, pp. 607–620, Feb. 2003.N. Weskamp, E. Hüllermeier, D. Kuhn, and G. Klebe, “Multiple graph alignment for the structural analysis of protein active sites,” IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 4, no. 2, pp. 310–320, 2007.T. N. Ha, D. D. Dong, and H. X. Huan, “An efficient ant colony optimization algorithm for Multiple Graph Alignment,” in 2013 International Conference on Computing, Management and Telecommunications (ComManTel), 2013, pp. 386–391. F. Neri, Handbook of memetic algorithms, vol. 379. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011.M. Gong, Z. Peng, L. Ma, and J. Huang, “Global Biological Network Alignment by Using Efficient Memetic Algorithm,” IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 13, no. 6, pp. 1117–1129, Nov. 2016.J. M. Caldonazzo Garbelini, A. Y. Kashiwabara, and D. S. Sanches, “Sequence motif finder using memetic algorithm,” BMC Bioinformatics, vol. 19, 2018. L. Correa, B. Borguesan, C. Farfan, M. Inostroza-Ponta, and M. Dorn, “A Memetic Algorithm for 3-D Protein Structure Prediction Problem,” IEEE/ACM Trans. Comput. Biol. Bioinforma., pp. 1–1, 2016.H. Tran Ngoc, D. Do Duc, and H. Hoang Xuan, “A novel ant based algorithm for multiple graph alignment,” in 2014 International Conference on Advanced Technologies for Communications (ATC 2014), 2014, pp. 181–186. H. X. Huan, N. Linh-Trung, H.-T. Huynh, and others, “Solving the Traveling Salesman Problem with Ant Colony Optimization: A Revisit and New Efficient Algorithms,” REV J. Electron. Commun., vol. 2, no. 3–4, 2013. D. Do Duc, H. Q. Dinh, and H. Hoang Xuan, “On the Pheromone Update Rules of Ant Colony Optimization Approaches for the Job Shop Scheduling Problem,” 2008, pp. 153-160.

Download Full-text

Multiple sequence alignment algorithm based on a dispersion graph and ant colony algorithm

Journal of Computational Chemistry ◽

10.1002/jcc.21203 ◽

2009 ◽

Vol 30 (13) ◽

pp. 2031-2038 ◽

Cited By ~ 9

Author(s):

Weiyang Chen ◽

Bo Liao ◽

Wen Zhu ◽

Xuyu Xiang

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Ant Colony Algorithm ◽

Ant Colony ◽

Alignment Algorithm ◽

Multiple Sequence ◽

Sequence Alignment Algorithm

Download Full-text

An efficient algorithm for multiple sequence alignment based on ant colony optimisation and divide‐and‐conquer method

New Zealand Journal of Agricultural Research ◽

10.1080/00288230709510330 ◽

2007 ◽

Vol 50 (5) ◽

pp. 617-626 ◽

Cited By ~ 4

Author(s):

Wei Liu ◽

Ling Chen ◽

Juan Chen

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Efficient Algorithm ◽

Ant Colony ◽

Divide And Conquer ◽

Ant Colony Optimisation ◽

Multiple Sequence

Download Full-text

An ant colony algorithm for multiple sequence alignment in bioinformatics

Artificial Neural Nets and Genetic Algorithms ◽

10.1007/978-3-7091-0646-4_33 ◽

2003 ◽

pp. 182-186 ◽

Cited By ~ 12

Author(s):

Jonathan Moss ◽

Colin G. Johnson

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Ant Colony Algorithm ◽

Ant Colony ◽

Multiple Sequence

Download Full-text

Multiple Sequence Alignment and Profile Analysis of Protein Family Utsing Hidden Markov Model

International Journal of Scientific Research ◽

10.15373/22778179/june2013/66 ◽

2012 ◽

Vol 2 (6) ◽

pp. 208-211

Author(s):

Navjot Kaur ◽

◽

Rajbir Singh Cheema ◽

Harmandeep Singh Harmandeep Singh

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Profile Analysis ◽

Hidden Markov ◽

Protein Family ◽

Multiple Sequence

Download Full-text

Faculty Opinions recommendation of MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.731078852.793536612 ◽

2017 ◽

Author(s):

Feng Gao

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Online Service ◽

Multiple Sequence

Download Full-text

Computational Analysis of Therapeutic Enzyme Uricase from Different Source Organisms

Current Proteomics ◽

10.2174/1570164616666190617165107 ◽

2020 ◽

Vol 17 (1) ◽

pp. 59-77

Author(s):

Anand Kumar Nelapati ◽

JagadeeshBabu PonnanEttiyappan

Keyword(s):

Uric Acid ◽

Amino Acid ◽

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Protein Sequences ◽

Amino Acid Sequences ◽

Amino Acid Residues ◽

Multiple Sequence ◽

Physiochemical Properties ◽

Pharmaceutical Industries

Background:Hyperuricemia and gout are the conditions, which is a response of accumulation of uric acid in the blood and urine. Uric acid is the product of purine metabolic pathway in humans. Uricase is a therapeutic enzyme that can enzymatically reduces the concentration of uric acid in serum and urine into more a soluble allantoin. Uricases are widely available in several sources like bacteria, fungi, yeast, plants and animals.Objective:The present study is aimed at elucidating the structure and physiochemical properties of uricase by insilico analysis.Methods:A total number of sixty amino acid sequences of uricase belongs to different sources were obtained from NCBI and different analysis like Multiple Sequence Alignment (MSA), homology search, phylogenetic relation, motif search, domain architecture and physiochemical properties including pI, EC, Ai, Ii, and were performed.Results:Multiple sequence alignment of all the selected protein sequences has exhibited distinct difference between bacterial, fungal, plant and animal sources based on the position-specific existence of conserved amino acid residues. The maximum homology of all the selected protein sequences is between 51-388. In singular category, homology is between 16-337 for bacterial uricase, 14-339 for fungal uricase, 12-317 for plants uricase, and 37-361 for animals uricase. The phylogenetic tree constructed based on the amino acid sequences disclosed clusters indicating that uricase is from different source. The physiochemical features revealed that the uricase amino acid residues are in between 300- 338 with a molecular weight as 33-39kDa and theoretical pI ranging from 4.95-8.88. The amino acid composition results showed that valine amino acid has a high average frequency of 8.79 percentage compared to different amino acids in all analyzed species.Conclusion:In the area of bioinformatics field, this work might be informative and a stepping-stone to other researchers to get an idea about the physicochemical features, evolutionary history and structural motifs of uricase that can be widely used in biotechnological and pharmaceutical industries. Therefore, the proposed in silico analysis can be considered for protein engineering work, as well as for gout therapy.

Download Full-text