A Neural Network Based Approach for GPCR Protein Prediction Using Pattern Discovery

Author(s):  
Tsang Ing Ren ◽  
George D. C. Calvalcanti ◽  
Francisco Nascimento Junior ◽  
Gabriela Espadas
2008 ◽  
Author(s):  
Francisco Nascimento Junior ◽  
Ing Ren Tsang ◽  
George D.C. Cavalcanti

2021 ◽  
Vol 22 (S6) ◽  
Author(s):  
Xinnan Dai ◽  
Fan Xu ◽  
Shike Wang ◽  
Piyushkumar A. Mundra ◽  
Jie Zheng

Abstract Background Recent advances in simultaneous measurement of RNA and protein abundances at single-cell level provide a unique opportunity to predict protein abundance from scRNA-seq data using machine learning models. However, existing machine learning methods have not considered relationship among the proteins sufficiently. Results We formulate this task in a multi-label prediction framework where multiple proteins are linked to each other at the single-cell level. Then, we propose a novel method for single-cell RNA to protein prediction named PIKE-R2P, which incorporates protein–protein interactions (PPI) and prior knowledge embedding into a graph neural network. Compared with existing methods, PIKE-R2P could significantly improve prediction performance in terms of smaller errors and higher correlations with the gold standard measurements. Conclusion The superior performance of PIKE-R2P indicates that adding the prior knowledge of PPI to graph neural networks can be a powerful strategy for cross-modality prediction of protein abundances at the single-cell level.


2021 ◽  
Author(s):  
Chengbin Hu ◽  
Yiru Qin ◽  
Chuan Ye ◽  
Jiao Jin ◽  
Ting Zhou ◽  
...  

Abstract Background: Many proteins or partial regions of proteins do not have stable and well-defined three-dimensional structures in vitro. Understanding Intrinsically Disorder Proteins (IDPs) is significant for interpreting biological function as well as studying many diseases. Although more than 70 disorder predictors have been invented, many existing predictors are limited on the characteristics of proteins and do not have very high accuracy. Therefore, it is critical to formulate new strategies on disorder protein prediction. Results: Here, we propose a machine learning meta-strategy to improve the accuracy of disordered proteins and disordered regions prediction. We first use logistic forward parameter selection to select eight most significant predictors from the current available IDP predictors. Then we design a novel meta-strategy using several machine learning models, including Decision-tree based algorithm, Naive Bayes, Random forest, and Convolutional Neural Network (CNN). By applying different strategies, the results suggest Random forest can improve the predicted single amino acid accuracy significantly to 93.35%. Using the combination vector data of eight most significant predictors as input, the Convolution Neural Network can improve the whole protein prediction to 95.62%. Conclusion: According to the performance of our machine learning meta-strategy, the Random forest and CNN models can improve the accuracy to predict IDPs.


2020 ◽  
Author(s):  
Chengbin Hu ◽  
Yiru Qin ◽  
Chuan Ye ◽  
Jiao jin ◽  
Ting Zhou ◽  
...  

Background: Many proteins or partial regions of proteins do not have stable and well-defined three-dimensional structures in vitro. Understanding intrinsically disorder proteins (IDPs) is significant for interpreting biological function as well as studying many diseases. Although more than 70 disorder predictors have been invented, many existing predictors are limited on the characteristics of proteins and do not have very high accuracy. Therefore, it is critical to formulate new strategies on disorder protein prediction. Results: Here, we propose a machine learning meta-strategy to improve the accuracy of disordered proteins and disordered regions prediction. We first use logistic forward parameter selection to select eight most significant predictors from the current available IDP predictors. Then we design a novel meta-strategy using several machine learning models, including Decision-tree based algorithm, Naive Bayes, Random forest, and Convolutional Neural Network (CNN). By applying different strategies, the results suggest Random forest can improve the predicted single amino acid accuracy significantly to 93.35%. Using the combination vector data of eight most significant predictors as input, the Convolution Neural Network can improve the whole protein prediction to 95.62%. Conclusion: According to the performance of our machine learning meta-strategy, the Random forest and CNN models can improve the accuracy to predict intrinsically disorder proteins.


2000 ◽  
Vol 25 (4) ◽  
pp. 325-325
Author(s):  
J.L.N. Roodenburg ◽  
H.J. Van Staveren ◽  
N.L.P. Van Veen ◽  
O.C. Speelman ◽  
J.M. Nauta ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document