FAME: fast and memory efficient multiple sequences alignment tool through compatible chain of roots

2020 ◽  
Vol 36 (12) ◽  
pp. 3662-3668
Author(s):  
Etminan Naznooshsadat ◽  
Parvinnia Elham ◽  
Sharifi-Zarchi Ali

Abstract Motivation Multiple sequence alignment (MSA) is important and challenging problem of computational biology. Most of the existing methods can only provide a short length multiple alignments in an acceptable time. Nevertheless, when the researchers confront the genome size in the multiple alignments, the process has required a huge processing space/time. Accordingly, using the method that can align genome size rapidly and precisely has a great effect, especially on the analysis of the very long alignments. Herein, we have proposed an efficient method, called FAME, which vertically divides sequences from the places that they have common areas; then they are arranged in consecutive order. Then these common areas are shifted and placed under each other, and the subsequences between them are aligned using any existing MSA tool. Results The results demonstrate that the combination of FAME and the MSA methods and deploying minimizer are capable to be executed on personal computer and finely align long length sequences with much higher sum-of-pair (SP) score compared to the standalone MSA tools. As we select genomic datasets with longer length, the SP score of the combinatorial methods is gradually improved. The calculated computational complexity of methods supports the results in a way that combining FAME and the MSA tools leads to at least four times faster execution on the datasets. Availability and implementation The source code and all datasets and run-parameters are accessible free on http://github.com/naznoosh/msa. Supplementary information Supplementary data are available at Bioinformatics online.

2019 ◽  
Vol 35 (20) ◽  
pp. 3970-3980 ◽  
Author(s):  
Mathilde Carpentier ◽  
Jacques Chomilier

Abstract Motivation Multiple sequence alignment programs have proved to be very useful and have already been evaluated in the literature yet not alignment programs based on structure or both sequence and structure. In the present article we wish to evaluate the added value provided through considering structures. Results We compared the multiple alignments resulting from 25 programs either based on sequence, structure or both, to reference alignments deposited in five databases (BALIBASE 2 and 3, HOMSTRAD, OXBENCH and SISYPHUS). On the whole, the structure-based methods compute more reliable alignments than the sequence-based ones, and even than the sequence+structure-based programs whatever the databases. Two programs lead, MAMMOTH and MATRAS, nevertheless the performances of MUSTANG, MATT, 3DCOMB, TCOFFEE+TM_ALIGN and TCOFFEE+SAP are better for some alignments. The advantage of structure-based methods increases at low levels of sequence identity, or for residues in regular secondary structures or buried ones. Concerning gap management, sequence-based programs set less gaps than structure-based programs. Concerning the databases, the alignments of the manually built databases are more challenging for the programs. Availability and implementation All data and results presented in this study are available at: http://wwwabi.snv.jussieu.fr/people/mathilde/download/AliMulComp/. Supplementary information Supplementary data are available at Bioinformatics online.


Author(s):  
Gondo Mastutik ◽  
Juniastuti Juniastuti ◽  
Ali Rohman ◽  
Mochamad Amin ◽  
Poernomo Boedi Setiawan

Chronic activivity of Hepatitis B Virus (HBV) infection can lead to liver cirrhosis or hepatocellular carcinoma. The objective of thisstudy was to know by analyzing the distribution of HBV genotypes and subtypes from hepatitis B patients suffering from chronic activehepatitis B infection in Surabaya. The HBV genotypes were determined by comparing the S gene sequences to those kept in the GeneBank. The phylogenetic tree was constructed by means of the unweighted-pair group method using arithmetic averages. Furthermore,the subtypes were deduced based on the prediction of amino acid residues 116 to 183 of HBsAg on multiple sequences alignment withClustalW2. This study involved 20 sera obtained from patients suffering chronic active hepatitis B infection. After PCR and sequencing,it was found that 13 samples could be used for sequence analysis. The results showed that all sequences were clustered into HBV genotypeB. The subtype adw2 was identified from 12 of 13 sequences, whereas one (1) belonged to ayw1. The subtype adw2 is most prevalent inIndonesia, namely in the islands of Sumatra, Java, South Kalimantan, Bali, Lombok, Ternate, and Morotai, while ayw1 is found in theislands of Nusa Tenggara and Moluccas. Based on this study, it was found that the patients with HBV subtype adw2 were from Surabaya, whereas with ayw1 was from Nusa Tenggara. It can be concluded that the HBV infected patients with chronic active hepatitis B inSurabaya have the genotype B with subtype adw2 which was originally from Surabaya, whereas, ayw1 was a patient originally fromNusa Tenggara.


2002 ◽  
Vol 7 (2) ◽  
pp. 139-144
Author(s):  
Shi Feng ◽  
Huang Jing ◽  
Mo Zhong-xi ◽  
Zheng Hui-rao

Author(s):  
Zne-Jung Lee ◽  
Chou-Yuan Lee ◽  
Huei-Lung Yu ◽  
Kuan-Hung Liu ◽  
Shun-Feng Su

Sign in / Sign up

Export Citation Format

Share Document