Improving MHC-peptide binding prediction from sparse data with a multi-allele graphical model and active learning

2016 ◽  
Author(s):  
Li Jiang
2021 ◽  
Vol 12 ◽  
Author(s):  
Ido Springer ◽  
Nili Tickotsky ◽  
Yoram Louzoun

IntroductionPredicting the binding specificity of T Cell Receptors (TCR) to MHC-peptide complexes (pMHCs) is essential for the development of repertoire-based biomarkers. This affinity may be affected by different components of the TCR, the peptide, and the MHC allele. Historically, the main element used in TCR-peptide binding prediction was the Complementarity Determining Region 3 (CDR3) of the beta chain. However, recently the contribution of other components, such as the alpha chain and the other V gene CDRs has been suggested. We use a highly accurate novel deep learning-based TCR-peptide binding predictor to assess the contribution of each component to the binding.MethodsWe have previously developed ERGO-I (pEptide tcR matchinG predictiOn), a sequence-based T-cell receptor (TCR)-peptide binding predictor that employs natural language processing (NLP) -based methods. We improved it to create ERGO-II by adding the CDR3 alpha segment, the MHC typing, V and J genes, and T cell type (CD4+ or CD8+) as to the predictor. We then estimate the contribution of each component to the prediction.Results and DiscussionERGO-II provides for the first time high accuracy prediction of TCR-peptide for previously unseen peptides. For most tested peptides and all measures of binding prediction accuracy, the main contribution was from the beta chain CDR3 sequence, followed by the beta chain V and J and the alpha chain, in that order. The MHC allele was the least contributing component. ERGO-II is accessible as a webserver at http://tcr2.cs.biu.ac.il/ and as a standalone code at https://github.com/IdoSpringer/ERGO-II.


2008 ◽  
Vol 9 (1) ◽  
pp. 8 ◽  
Author(s):  
Hong Lin ◽  
Surajit Ray ◽  
Songsak Tongchusak ◽  
Ellis L Reinherz ◽  
Vladimir Brusic

Author(s):  
Pandjassarame Kangueane ◽  
Bing Zhao ◽  
Meena K. Sakharkar

2017 ◽  
Author(s):  
Yeeleng S. Vang ◽  
Xiaohui Xie

AbstractMany biological processes are governed by protein-ligand interactions. One such example is the recognition of self and non-self cells by the immune system. This immune response process is regulated by the major histocompatibility complex (MHC) protein which is encoded by the human leukocyte antigen (HLA) complex. Understanding the binding potential between MHC and peptides can lead to the design of more potent, peptide-based vaccines and immunotherapies for infectious autoimmune diseases.We apply machine learning techniques from the natural language processing (NLP) domain to address the task of MHC-peptide binding prediction. More specifically, we introduce a new distributed representation of amino acids, name HLA-Vec, that can be used for a variety of downstream proteomic machine learning tasks. We then propose a deep convolutional neural network architecture, name HLA-CNN, for the task of HLA class I-peptide binding prediction. Experimental results show combining the new distributed representation with our HLA-CNN architecture acheives state-of-the-art results in the majority of the latest two Immune Epitope Database (IEDB) weekly automated benchmark datasets. We further apply our model to predict binding on the human genome and identify 15 genes with potential for self binding. Codes are available at https://github.com/uci-cbcl/HLA-bind.


2020 ◽  
Vol 12 (6) ◽  
pp. 1383-1395
Author(s):  
Ivan V Grebenkin ◽  
Andrey Evgen'evich Alekseenko ◽  
Nikolay A Gaivoronskiy ◽  
Mikhail G Ignatov ◽  
Andrey Maksimovich Kazennov ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document