DNA sequence comparison based on Tabular Representation

DNA sequence comparison remains as one of the critical steps in the analysis of phylogenetic relationships between species. In order to get quantitative comparison, we want to devise an algorithm that would use the tabular representation of DNA sequences. The tabular approach of representation captures the essence of the base composition and distribution of the sequence. In this contribution, we take the tabular notation for DNA sequences and then these tables are compared to find the similarity/dissimilarity measure of the sequences. We have developed algorithms for comparing DNA sequences. These programs help us to search similar segments of sequences, calculate similarity scores and identify repetitions based on local sequence similarity. There are two approaches: one is to find the exact similarity and another is to find the measurement for similarity. The first approach is more sensitive, which can be used to search DNA sequence similarities only if complete matches occurred and can compare exactly similar sequences only. This approach violates if a single mismatch for any base character appears so it is not a general solution. To find the miss matches along with the matches we have suggested another approach which compiles the information matrix based on matches and miss matches. This approach is quiet general in terms of sequences which have a large fragment common with less no of dissimilar base characters. This alternate approach includes an additional step in the calculation of the similarity score that denotes multiple regions of similarity between sequences. For both these approaches computer programs are prepared and tested on data sets. These programs can be used to evaluate the significance of similarity scores using a shuffling method that preserves local sequence composition. In addition, these programs have been generalized to allow comparison of DNA sequences based on a variety of alternative scoring matrices. We have been developing tools for the analysis of protein The method is very simple and fast, and it can be used to analyze both short and long DNA sequences. The utility of this method is tested on the several sequences of species and the results are consistent with that reported.

Download Full-text

A New Approach for DNA Sequence Similarity Analysis based on Triplets of Nucleic Acid Bases

International Journal of Nanotechnology and Molecular Computation ◽

10.4018/978-1-60960-064-8.ch006 ◽

2010 ◽

Vol 2 (4) ◽

pp. 1-11

Author(s):

Dan Wei ◽

Qingshan Jiang ◽

Sheng Li

Keyword(s):

Nucleic Acid ◽

Dna Sequence ◽

Dna Sequences ◽

Sequence Similarity ◽

Similarity Analysis ◽

Biological Sequence ◽

Nucleic Acid Bases ◽

New Approach ◽

Characteristic Distribution ◽

Sequence Similarity Analysis

Similarity analysis of DNA sequences is a fundamental research area in Bioinformatics. The characteristic distribution of L-tuple, which is the tuple of length L, reflects the valuable information contained in a biological sequence and thus may be used in DNA sequence similarity analysis. However, similarity analysis based on characteristic distribution of L-tuple is not effective for the comparison of highly conservative sequences. In this paper, a new similarity measurement approach based on Triplets of Nucleic Acid Bases (TNAB) is introduced for DNA sequence similarity analysis. The new approach characterizes both the content feature and position feature of a DNA sequence using the frequency and position of occurrence of TNAB in the sequence. The experimental results show that the approach based on TNAB is effective for analysing DNA sequence similarity.

Download Full-text

Parallel Megabase DNA Sequence Comparison with OpenCL

2015 IEEE 22nd International Conference on High Performance Computing (HiPC) ◽

10.1109/hipc.2015.13 ◽

2015 ◽

Cited By ~ 1

Author(s):

Marco Antonio C. de Figueiredo ◽

Edans F. de O. Sandes ◽

Alba Cristina M. A. de Melo

Keyword(s):

Dna Sequence ◽

Sequence Comparison ◽

Dna Sequence Comparison

Download Full-text

Phylogenetic relationships ofSalmonellabased on DNA sequence comparison ofatpDencoding the Î² subunit of ATP synthase

FEMS Microbiology Letters ◽

10.1111/j.1574-6968.1998.tb12933.x ◽

1998 ◽

Vol 161 (1) ◽

pp. 89-96

Author(s):

Henrik Christensen ◽

John Elmerdahl Olsen

Keyword(s):

Dna Sequence ◽

Atp Synthase ◽

Phylogenetic Relationships ◽

Sequence Comparison ◽

Dna Sequence Comparison

Download Full-text

Maximum-likelihood estimation of the statistical distribution of Smith-Waterman local sequence similarity scores

Bulletin of Mathematical Biology ◽

10.1016/s0092-8240(05)80176-4 ◽

1992 ◽

Vol 54 (1) ◽

pp. 59-75 ◽

Cited By ~ 9

Author(s):

R MOTT

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Sequence Similarity ◽

Likelihood Estimation ◽

Statistical Distribution ◽

Local Sequence ◽

Similarity Scores

Download Full-text

DNA sequence comparison of human and mouse retinitis pigmentosa GTPase regulator (RPGR) identifies tissue-specific exons and putative regulatory elements

Human Genetics ◽

10.1007/s004390100572 ◽

2001 ◽

Vol 109 (3) ◽

pp. 271-278 ◽

Cited By ~ 11

Author(s):

Renate Kirschner ◽

Deniz Erturk ◽

Christina Zeitz ◽

Selen Sahin ◽

Juliane Ramser ◽

...

Keyword(s):

Retinitis Pigmentosa ◽

Dna Sequence ◽

Sequence Comparison ◽

Regulatory Elements ◽

Tissue Specific ◽

Retinitis Pigmentosa Gtpase Regulator ◽

Human And Mouse ◽

Dna Sequence Comparison

Download Full-text

Ordered index seed algorithm for intensive DNA sequence comparison

2008 IEEE International Symposium on Parallel and Distributed Processing ◽

10.1109/ipdps.2008.4536172 ◽

2008 ◽

Cited By ~ 2

Author(s):

Dominique Lavenier

Keyword(s):

Dna Sequence ◽

Sequence Comparison ◽

Dna Sequence Comparison

Download Full-text

A Burrows-Wheeler Transform Based Method for DNA Sequence Comparison

Computational Biology and Bioinformatics ◽

10.11648/j.cbb.20140203.11 ◽

2014 ◽

Vol 2 (3) ◽

pp. 33

Author(s):

Chun Li

Keyword(s):

Dna Sequence ◽

Sequence Comparison ◽

Burrows Wheeler Transform ◽

Dna Sequence Comparison

Download Full-text

Long DNA Sequence Comparison on Multicore Architectures

Euro-Par 2010 - Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-15291-7_24 ◽

2010 ◽

pp. 247-259 ◽

Cited By ~ 6

Author(s):

Friman Sánchez ◽

Felipe Cabarcas ◽

Alex Ramirez ◽

Mateo Valero

Keyword(s):

Dna Sequence ◽

Sequence Comparison ◽

Multicore Architectures ◽

Dna Sequence Comparison

Download Full-text

Desulfovibrio africanus subsp. uniflagellum subsp. nov., a sulfate-reducing bacterium from a uranium-contaminated subsurface aquifer

INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY ◽

10.1099/ijs.0.006668-0 ◽

2010 ◽

Vol 60 (4) ◽

pp. 880-886 ◽

Cited By ~ 14

Author(s):

I. Nydia Castañeda-Carrión ◽

Cody S. Sheik ◽

Lee R. Krumholz

Keyword(s):

Dna Sequence ◽

Dna Sequences ◽

Type Strain ◽

Sequence Similarity ◽

Gene Clusters ◽

Contaminated Site ◽

Amino Acid Sequence Similarity ◽

Trna Genes ◽

Rrna Gene ◽

Small Plasmid

The bacterial strain SR-1T was isolated from subsurface sediments of a uranium-contaminated site in Shiprock, New Mexico, USA. Cells are vibrioid and motile by means of a single polar flagellum. Strain SR-1T grows on sulfate, oxidizing formate, lactate and H2, but not malate, and ferments pyruvate. The DNA sequences of the 16S rRNA gene and the 16S–23S internal transcribed spacer of strain SR-1T showed 99.9 and 99.4 % similarity, respectively, to those of the type strain Desulfovibrio africanus DSM 2603T. The DNA sequence of the ITS region is 300 bases in length and contains two tRNA genes (tRNAIle, tRNAAla). The partial DNA sequence of the dsrAB gene showed 94.6 % amino acid sequence similarity to that of D. africanus. The DNA G+C content of strain SR-1T was 62.4 mol% and it showed 72 % DNA–DNA similarity to D. africanus. DNA typing methods that target gene clusters and whole genomes revealed characteristic genomic fingerprints for strain SR-1T. A small plasmid was detected by gel electrophoresis. On the basis of distinct phenotypic and genotypic characteristics, strain SR-1T represents a novel subspecies of D. africanus, for which the name Desulfovibrio africanus subsp. uniflagellum subsp. nov. is proposed. The type strain is SR-1T (=JCM 15510T =LS KCTC 5649T).

Download Full-text

DNA Sequence Comparison Viewers

Encyclopedia of Genetics, Genomics, Proteomics and Informatics ◽

10.1007/978-1-4020-6754-9_4708 ◽

2008 ◽

pp. 539-539

Keyword(s):

Dna Sequence ◽

Sequence Comparison ◽

Dna Sequence Comparison

Download Full-text