Investigating Molecular Recognition Through Large-scale Analysis of Protein Sequences and Structures

2000 ◽  
Author(s):  
Mark Gerstein
2015 ◽  
Vol 43 (5) ◽  
pp. 807-811 ◽  
Author(s):  
François D. Richard ◽  
Andrey V. Kajava

Tandem repeats (TRs) are frequently not perfect, containing a number of mutations accumulated during evolution. One of the main problems is to distinguish between the sequences that contain highly imperfect TRs and the aperiodic sequences. The majority of proteins with TRs in sequences have repetitive arrangements in their 3D structures. Therefore, the 3D structures of proteins can be used as a benchmarking criterion for TR detection in sequences. Different TR detection tools use their own scoring procedures to determine the boundary between repetitive and non-repetitive protein sequences. Here we described these scoring functions and benchmark them by using known structural TRs. Our survey shows that none of the existing scoring procedures are able to achieve an appropriate separation between genuine structural TRs and non-TR regions. This suggests that if we want to obtain a collection of structurally and functionally meaningful TRs from a large scale analysis of proteomes, the TR scoring metrics need to be improved.


2021 ◽  
Author(s):  
Mehdi A. Beniddir ◽  
Kyo Bin Kang ◽  
Grégory Genta-Jouve ◽  
Florian Huber ◽  
Simon Rogers ◽  
...  

This review highlights the key computational tools and emerging strategies for metabolite annotation, and discusses how these advances will enable integrated large-scale analysis to accelerate natural product discovery.


Sign in / Sign up

Export Citation Format

Share Document