aPRBind: protein–RNA interface prediction by combining sequence and I-TASSER model-based structural features learned with convolutional neural networks

Author(s):  
Yang Liu ◽  
Weikang Gong ◽  
Yanpeng Zhao ◽  
Xueqing Deng ◽  
Shan Zhang ◽  
...  

Abstract Motivation Protein–RNA interactions play a critical role in various biological processes. The accurate prediction of RNA-binding residues in proteins has been one of the most challenging and intriguing problems in the field of computational biology. The existing methods still have a relatively low accuracy especially for the sequence-based ab-initio methods. Results In this work, we propose an approach aPRBind, a convolutional neural network-based ab-initio method for RNA-binding residue prediction. aPRBind is trained with sequence features and structural ones (particularly including residue dynamics information and residue–nucleotide propensity developed by us) that are extracted from the predicted structures by I-TASSER. The analysis of feature contributions indicates the sequence features are most important, followed by dynamics information, and the sequence and structural features are complementary in binding site prediction. The performance comparison of our method with other peer ones on benchmark dataset shows that aPRBind outperforms some state-of-the-art ab-initio methods. Additionally, aPRBind can give a better prediction for the modeled structures with TM-score≥0.5, and meanwhile since the structural features are not very sensitive to the refined 3D structures, aPRBind has only a marginal dependence on the accuracy of the structure model, which allows aPRBind to be applied to the RNA-binding site prediction for the modeled or unbound structures. Availability and implementation The source code is available at https://github.com/ChunhuaLiLab/aPRbind. Supplementary information Supplementary data are available at Bioinformatics online.

2021 ◽  
Vol 19 (02) ◽  
pp. 2150006
Author(s):  
Fatemeh Nazem ◽  
Fahimeh Ghasemi ◽  
Afshin Fassihi ◽  
Alireza Mehri Dehnavi

Binding site prediction for new proteins is important in structure-based drug design. The identified binding sites may be helpful in the development of treatments for new viral outbreaks in the world when there is no information available about their pockets with COVID-19 being a case in point. Identification of the pockets using computational methods, as an alternative method, has recently attracted much interest. In this study, the binding site prediction is viewed as a semantic segmentation problem. An improved 3D version of the U-Net model based on the dice loss function is utilized to predict the binding sites accurately. The performance of the proposed model on the independent test datasets and SARS-COV-2 shows the segmentation model could predict the binding sites with a more accurate shape than the recently published deep learning model, i.e. DeepSite. Therefore, the model may help predict the binding sites of proteins and could be used in drug design for novel proteins.


2020 ◽  
pp. 49-69
Author(s):  
Himanshu Avashthi ◽  
Ambuj Srivastava ◽  
Dev Bukhsh Singh

Sign in / Sign up

Export Citation Format

Share Document