A novel alignment-free DNA sequence similarity analysis approach based on top-k n-gram match-up

2020 ◽  
Vol 100 ◽  
pp. 107693
Author(s):  
Emre Delibaş ◽  
Ahmet Arslan ◽  
Abdulkadir Şeker ◽  
Banu Diri
Author(s):  
Dan Wei ◽  
Qingshan Jiang ◽  
Sheng Li

Similarity analysis of DNA sequences is a fundamental research area in Bioinformatics. The characteristic distribution of L-tuple, which is the tuple of length L, reflects the valuable information contained in a biological sequence and thus may be used in DNA sequence similarity analysis. However, similarity analysis based on characteristic distribution of L-tuple is not effective for the comparison of highly conservative sequences. In this paper, a new similarity measurement approach based on Triplets of Nucleic Acid Bases (TNAB) is introduced for DNA sequence similarity analysis. The new approach characterizes both the content feature and position feature of a DNA sequence using the frequency and position of occurrence of TNAB in the sequence. The experimental results show that the approach based on TNAB is effective for analysing DNA sequence similarity.


2011 ◽  
Vol 7 ◽  
pp. EBO.S7364 ◽  
Author(s):  
Xingqin Qi ◽  
Qin Wu ◽  
Yusen Zhang ◽  
Eddie Fuller ◽  
Cun-Quan Zhang

2008 ◽  
Vol 46 (3) ◽  
pp. 395-401 ◽  
Author(s):  
C. Meintanis ◽  
K.I. Chalkou ◽  
K. Ar. Kormas ◽  
D.S. Lymperopoulou ◽  
E.A. Katsifas ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document