A Basic Molecular Analysis of the Diabetic Antigen GAD by Homology Modelling. Principles of the Method and Understanding of Antigenicity and Binding Sites

Pteridines ◽  
2007 ◽  
Vol 18 (1) ◽  
pp. 79-94
Author(s):  
Marco Wiltgen ◽  
Gernot P. Tilz

Abstract Functional specificity of a protein is linked to its structure. A growing section of bioinformatics deals with the prediction and visualization of protein 3D structures. In homology modelling, a protein sequence with an unknown structure is aligned with sequences of known protein structures. By exploiting structural information from the known configurations, the new structure can be predicted. In this introductory paper, we will present the principles of homology modelling and demonstrate the method used, by determining the structure of the enzyme glutamic decarboxylase (GAD 65). This protein is an autoantigen involved in several human autoimmune diseases. We will illustrate the different steps in structure prediction of GAD 65 by use of two experimentally determined structures of pig kidney DOPA decarboxylase (one structure in complex with the inhibitor carbidopa) as templates. The resulting model of GAD 65 provides detailed information about the active site of the protein and selected epitopes. By analysis of the interactions between the DOPA decarboxylase with the inhibitor carbidopa, the residues of the GAD 65 active site can be identified via the sequence alignment between DOPA and GAD 65. The locations of known epitopes in the molecule are visualized in special representations giving insights into mechanisms of antigenicity. Hydrophobicity analysis gives first hints for the adherence ability of GAD 65 to the cell membrane. Homology modelling is at present one of the most efficient techniques to provide accurate structural models of proteins. It is expected that in few years, for every new determined protein sequence, at least one member with a known structure of the same protein family will be available, which will steadily increase the importance and applicability of homology modelling.

2014 ◽  
Vol 70 (a1) ◽  
pp. C491-C491
Author(s):  
Jürgen Haas ◽  
Alessandro Barbato ◽  
Tobias Schmidt ◽  
Steven Roth ◽  
Andrew Waterhouse ◽  
...  

Computational modeling and prediction of three-dimensional macromolecular structures and complexes from their sequence has been a long standing goal in structural biology. Over the last two decades, a paradigm shift has occurred: starting from a large "knowledge gap" between the huge number of protein sequences compared to a small number of experimentally known structures, today, some form of structural information – either experimental or computational – is available for the majority of amino acids encoded by common model organism genomes. Methods for structure modeling and prediction have made substantial progress of the last decades, and template based homology modeling techniques have matured to a point where they are now routinely used to complement experimental techniques. However, computational modeling and prediction techniques often fall short in accuracy compared to high-resolution experimental structures, and it is often difficult to convey the expected accuracy and structural variability of a specific model. Retrospectively assessing the quality of blind structure prediction in comparison to experimental reference structures allows benchmarking the state-of-the-art in structure prediction and identifying areas which need further development. The Critical Assessment of Structure Prediction (CASP) experiment has for the last 20 years assessed the progress in the field of protein structure modeling based on predictions for ca. 100 blind prediction targets per experiment which are carefully evaluated by human experts. The "Continuous Model EvaluatiOn" (CAMEO) project aims to provide a fully automated blind assessment for prediction servers based on weekly pre-released sequences of the Protein Data Bank PDB. CAMEO has been made possible by the development of novel scoring methods such as lDDT, which are robust against domain movements to allow for automated continuous structure comparison without human intervention.


2019 ◽  
Vol 20 (10) ◽  
pp. 2442 ◽  
Author(s):  
Teppei Ikeya ◽  
Peter Güntert ◽  
Yutaka Ito

To date, in-cell NMR has elucidated various aspects of protein behaviour by associating structures in physiological conditions. Meanwhile, current studies of this method mostly have deduced protein states in cells exclusively based on ‘indirect’ structural information from peak patterns and chemical shift changes but not ‘direct’ data explicitly including interatomic distances and angles. To fully understand the functions and physical properties of proteins inside cells, it is indispensable to obtain explicit structural data or determine three-dimensional (3D) structures of proteins in cells. Whilst the short lifetime of cells in a sample tube, low sample concentrations, and massive background signals make it difficult to observe NMR signals from proteins inside cells, several methodological advances help to overcome the problems. Paramagnetic effects have an outstanding potential for in-cell structural analysis. The combination of a limited amount of experimental in-cell data with software for ab initio protein structure prediction opens an avenue to visualise 3D protein structures inside cells. Conventional nuclear Overhauser effect spectroscopy (NOESY)-based structure determination is advantageous to elucidate the conformations of side-chain atoms of proteins as well as global structures. In this article, we review current progress for the structure analysis of proteins in living systems and discuss the feasibility of its future works.


2019 ◽  
Vol 35 (22) ◽  
pp. 4854-4856 ◽  
Author(s):  
James D Stephenson ◽  
Roman A Laskowski ◽  
Andrew Nightingale ◽  
Matthew E Hurles ◽  
Janet M Thornton

Abstract Motivation Understanding the protein structural context and patterning on proteins of genomic variants can help to separate benign from pathogenic variants and reveal molecular consequences. However, mapping genomic coordinates to protein structures is non-trivial, complicated by alternative splicing and transcript evidence. Results Here we present VarMap, a web tool for mapping a list of chromosome coordinates to canonical UniProt sequences and associated protein 3D structures, including validation checks, and annotating them with structural information. Availability and implementation https://www.ebi.ac.uk/thornton-srv/databases/VarMap. Supplementary information Supplementary data are available at Bioinformatics online.


Science ◽  
2021 ◽  
pp. eabj8754
Author(s):  
Minkyung Baek ◽  
Frank DiMaio ◽  
Ivan Anishchenko ◽  
Justas Dauparas ◽  
Sergey Ovchinnikov ◽  
...  

DeepMind presented remarkably accurate predictions at the recent CASP14 protein structure prediction assessment conference. We explored network architectures incorporating related ideas and obtained the best performance with a three-track network in which information at the 1D sequence level, the 2D distance map level, and the 3D coordinate level is successively transformed and integrated. The three-track network produces structure predictions with accuracies approaching those of DeepMind in CASP14, enables the rapid solution of challenging X-ray crystallography and cryo-EM structure modeling problems, and provides insights into the functions of proteins of currently unknown structure. The network also enables rapid generation of accurate protein-protein complex models from sequence information alone, short circuiting traditional approaches which require modeling of individual subunits followed by docking. We make the method available to the scientific community to speed biological research.


2021 ◽  
Author(s):  
Qianmu Yuan ◽  
Sheng Chen ◽  
Jiahua Rao ◽  
Shuangjia Zheng ◽  
Huiying Zhao ◽  
...  

AbstractMotivationProtein-DNA interactions play crucial roles in the biological systems, and identifying protein-DNA binding sites is the first step for mechanistic understanding of various biological activities (such as transcription and repair) and designing novel drugs. How to accurately identify DNA-binding residues from only protein sequence remains a challenging task. Currently, most existing sequence-based methods only consider contextual features of the sequential neighbors, which are limited to capture spatial information.ResultsBased on the recent breakthrough in protein structure prediction by AlphaFold2, we propose an accurate predictor, GraphSite, for identifying DNA-binding residues based on the structural models predicted by AlphaFold2. Here, we convert the binding site prediction problem into a graph node classification task and employ a transformerbased variant model to take the protein structural information into account. By leveraging predicted protein structures and graph transformer, GraphSite substantially improves over the latest sequence-based and structure-based methods. The algorithm was further confirmed on the independent test set of 196 proteins, where GraphSite surpasses the state-of-the-art structure-based method by 12.3% in AUPR and 9.3% in MCC, [email protected]


2021 ◽  
Author(s):  
Kyungyong Seong ◽  
Ksenia V Krasileva

AbstractMagnaporthe oryzae relies on a diverse collection of secreted effector proteins to reprogram the host metabolic and immune responses for the pathogen’s benefit. Characterization of the effectors is thus critical for understanding the biology and host infection mechanisms of this phytopathogen. In rapid, divergent effector evolution, structural information has the potential to illuminate the unknown aspects of effectors that sequence analyses alone cannot reveal. It has recently become feasible to reliably predict the protein structures without depending on homologous templates. In this study, we tested structure modeling on 1854 secreted proteins from M. oryzae and evaluated success and obstacles involved in effector structure prediction. With sensitive homology search and structure-based clustering, we defined both distantly related homologous groups and structurally related analogous groups. With this dataset, we propose sequence-unrelated, structurally similar effectors are a common theme in M. oryzae and possibly in other phytopathogens. We incorporated the predicted models for structure-based annotations, molecular docking and evolutionary analyses to demonstrate how the predicted structures can deepen our understanding of effector biology. We also provide new experimentally testable structure-derived hypotheses of effector functions. Collectively, we propose that computational structural genomic approaches can now be an integral part of studying effector biology and provide valuable resources that were inaccessible before the advent of reliable, machine learning-based structure prediction.


2014 ◽  
Vol 11 (95) ◽  
pp. 20131147 ◽  
Author(s):  
Agnel Praveen Joseph ◽  
Alexandre G. de Brevern

Protein folding has been a major area of research for many years. Nonetheless, the mechanisms leading to the formation of an active biological fold are still not fully apprehended. The huge amount of available sequence and structural information provides hints to identify the putative fold for a given sequence. Indeed, protein structures prefer a limited number of local backbone conformations, some being characterized by preferences for certain amino acids. These preferences largely depend on the local structural environment. The prediction of local backbone conformations has become an important factor to correctly identifying the global protein fold. Here, we review the developments in the field of local structure prediction and especially their implication in protein fold recognition.


eLife ◽  
2015 ◽  
Vol 4 ◽  
Author(s):  
Sergey Ovchinnikov ◽  
Lisa Kinch ◽  
Hahnbeom Park ◽  
Yuxing Liao ◽  
Jimin Pei ◽  
...  

The prediction of the structures of proteins without detectable sequence similarity to any protein of known structure remains an outstanding scientific challenge. Here we report significant progress in this area. We first describe de novo blind structure predictions of unprecendented accuracy we made for two proteins in large families in the recent CASP11 blind test of protein structure prediction methods by incorporating residue–residue co-evolution information in the Rosetta structure prediction program. We then describe the use of this method to generate structure models for 58 of the 121 large protein families in prokaryotes for which three-dimensional structures are not available. These models, which are posted online for public access, provide structural information for the over 400,000 proteins belonging to the 58 families and suggest hypotheses about mechanism for the subset for which the function is known, and hypotheses about function for the remainder.


2018 ◽  
Author(s):  
Maher M. Kassem ◽  
Lars B. Christoffersen ◽  
Andrea Cavalli ◽  
Kresten Lindorff-Larsen

AbstractBased on the development of new algorithms and growth of sequence databases, it has recently become possible to build robust and informative higher-order statistical sequence models based on large sets of aligned protein sequences. By disentangling direct and indirect effects, such models have proven useful to assess phenotypic landscapes, determine protein-protein interaction sites, and in de novo structure prediction. In the context of structure prediction, the sequence models are used to find pairs of residues that co-vary during evolution, and hence are likely to be in spatial proximity in the functional native protein. The accuracy of these algorithms, however, drop dramatically when the number of sequences in the alignment is small, and thus the highest ranking pairs may include a substantial number of false positive predictions. We have developed a method that we termed CE-YAPP (CoEvolution-YAPP), that is based on YAPP (Yet Another Peak Processor), which has been shown to solve a similar problem in NMR spectroscopy. By simultaneously performing structure prediction and contact assignment, CE-YAPP uses structural self-consistency as a filter to remove false positive contacts. At the same time CE-YAPP solves another problem, namely how many contacts to choose from the ordered list of covarying amino acid pairs. Our results show that CE-YAPP consistently and substantially improves contact prediction from multiple sequence alignments, in particular for proteins that are difficult targets. We further show that CE-YAPP can be integrated with many different contact prediction methods, and thus will benefit also from improvements in algorithms for sequence analyses. Finally, we show that the structures determined from CE-YAPP are also in better agreement with those determined using traditional methods in structural biology.Author summaryHomologous proteins generally have similar functions and three-dimensional structures. This in turn means that it is possible to extract structural information from a detailed analysis of a multiple sequence alignment of a protein sequence. In particular, it has been shown that global statistical analyses of such sequence alignments allows one to find pairs of residues that have covaried during evolution, and that such pairs are likely to be in close contact in the folded protein structure. Although these insights have led to important developments in our ability to predict protein structures, these methods generally result in many false positive contacts predicted when the number of homologous sequences is not large. To deal with this issue, we have developed CE-YAPP, a method that can take a noisy set of predicted contacts as input and robustly detect many incorrectly predicted contacts within these. More specifically, our method performs simultaneous structure prediction and contact assignment so as to use structural self-consistency as a filter for erroneous predictions. In this way, CE-YAPP improves contact and structure predictions, and thus advances our ability to extract structural information from analyses of the evolutionary record of a protein.


Sign in / Sign up

Export Citation Format

Share Document