Sequence Alignment and Sequence Database Search

Author(s):  
Martin Vingron
2018 ◽  
Vol 18 (2) ◽  
pp. 652-663 ◽  
Author(s):  
Kristian E. Swearingen ◽  
Jimmy K. Eng ◽  
David Shteynberg ◽  
Vladimir Vigdorovich ◽  
Timothy A. Springer ◽  
...  

PROTEOMICS ◽  
2006 ◽  
Vol 6 (4) ◽  
pp. 1121-1132 ◽  
Author(s):  
Gun Wook Park ◽  
Kyung-Hoon Kwon ◽  
Jin Young Kim ◽  
Jeong Hwa Lee ◽  
Sung-Ho Yun ◽  
...  

PROTEOMICS ◽  
2012 ◽  
Vol 13 (1) ◽  
pp. 22-24 ◽  
Author(s):  
Jimmy K. Eng ◽  
Tahmina A. Jahan ◽  
Michael R. Hoopmann

PROTEOMICS ◽  
2014 ◽  
Vol 14 (23-24) ◽  
pp. 2742-2749 ◽  
Author(s):  
Heejin Park ◽  
Junwoo Bae ◽  
Hyunwoo Kim ◽  
Sangok Kim ◽  
Hokeun Kim ◽  
...  

2020 ◽  
Author(s):  
Siddharth Jain ◽  
Xiongye Xiao ◽  
Paul Bogdan ◽  
Jehoshua Bruck

ABSTRACTEvolution is a process of change where mutations in the viral RNA are selected based on their fitness for replication and survival. Given that current phylogenetic analysis of SARS-CoV-2 identifies new viral clades after they exhibit evolutionary selections, one wonders whether we can identify the viral selection and predict the emergence of new viral clades? Inspired by the Kolmogorov complexity concept, we propose a generative complexity (algorithmic) framework capable to analyze the viral RNA sequences by mapping the multiscale nucleotide dependencies onto a state machine, where states represent subsequences of nucleotides and state-transition probabilities encode the higher order interactions between these states. We apply computational learning and classification techniques to identify the active state-transitions and use those as features in clade classifiers to decipher the transient mutations (still evolving within a clade) and stable mutations (typical to a clade). As opposed to current analysis tools that rely on the edit distance between sequences and require sequence alignment, our method is computationally local, does not require sequence alignment and is robust to random errors (substitution, insertions and deletions). Relying on the GISAID viral sequence database, we demonstrate that our method can predict clade emergence, potentially aiding with the design of medications and vaccines.


Sign in / Sign up

Export Citation Format

Share Document