protein folds
Recently Published Documents


TOTAL DOCUMENTS

223
(FIVE YEARS 18)

H-INDEX

49
(FIVE YEARS 3)

2021 ◽  
Author(s):  
Shintaro Minami ◽  
Naohiro Kobayashi ◽  
Toshihiko Sugiki ◽  
Toshio Nagashima ◽  
Toshimichi Fujiwara ◽  
...  

Most naturally occurring protein folds have likely been discovered. The question is whether natural evolution has exhaustively sampled almost all possible protein folds, or whether a large fraction of the possible folds remains unexplored. To address this question, we introduce a set of rules for β-sheet topology to predict novel folds, and carry out the systematic de novo protein design for the novel folds predicted by the rules. The rules predicted eight novel αβ-folds with a four-stranded β-sheet, including a knot-forming one. We designed proteins for all the predicted αβ-folds and found that all the designs are monomeric with high thermal stability and fold into the structures close to the design models, demonstrating the ability of the set of rules to predict novel αβ-folds. The rules also predicted about twelve thousand novel αβ-folds with five- to eight-stranded β-sheets ; the number is far exceeding the number of αβ-folds observed so far. This result suggests that the enormous number of αβ-folds are possible but have not emerged or become extinct due to evolutionary bias. The predicted novel folds should open up the possibility of designing functional proteins of our interests.


2021 ◽  
Author(s):  
T Linsky ◽  
K Noble ◽  
A Tobin ◽  
R Crow ◽  
Lauren Carter ◽  
...  

Nature only samples a small fraction in sequence space, yet many more amino acid combinations can fold into stable proteins. Furthermore, small structural variations in a single fold, which may only be a few amino acids different from the next homolog, define their molecular function. Hence, to design proteins with novel molecular functionalities, such as molecular recognition, methods to control and sample shape diversity are necessary. To explore this space, we developed and experimentally validated a computational platform that can design a wide variety of small protein folds while sampling high shape diversity. We designed and evaluated about 30,000 de novo protein designs of 7 different folds. Among these designs, about 6,200 stable proteins were identified, with predicted structures having first-of-its-kind minimalized thioredoxin. Obtained data revealed more protein folding rules, such as helix connecting loops, which were in nature. Beyond providing a resource database for protein engineering, our data presents a large training data set for machine learning. We developed a high-accuracy classifier to predict the stability of our designed proteins. The methods and the wide range of new protein shapes provide a basis for the design of new protein function without compromising stability.


2021 ◽  
Vol 2 (1) ◽  
pp. 63-76
Author(s):  
Xingjian Xu ◽  
Igor Dikiy ◽  
Matthew R. Evans ◽  
Leandro P. Marcelino ◽  
Kevin H. Gardner

Abstract. Recent research on fold-switching metamorphic proteins has revealed some notable exceptions to Anfinsen's hypothesis of protein folding. We have previously described how a single point mutation can enable a well-folded protein domain, one of the two PAS (Per-ARNT-Sim) domains of the human ARNT (aryl hydrocarbon receptor nuclear translocator) protein, to interconvert between two conformers related by a slip of an internal β strand. Using this protein as a test case, we advance the concept of a “fragile fold”, a protein fold that can reversibly rearrange into another fold that differs by a substantial number of hydrogen bonds, entailing reorganization of single secondary structure elements to more drastic changes seen in metamorphic proteins. Here we use a battery of biophysical tests to examine several factors affecting the equilibrium between the two conformations of the switching ARNT PAS-B Y456T protein. Of note is that we find that factors which impact the HI loop preceding the shifted Iβ strand affect both the equilibrium levels of the two conformers and the denatured state which links them in the interconversion process. Finally, we describe small molecules that selectively bind to and stabilize the wild-type conformation of ARNT PAS-B. These studies form a toolkit for studying fragile protein folds and could enable ways to modulate the biological functions of such fragile folds, both in natural and engineered proteins.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Joaquim Aguirre-Plans ◽  
Alberto Meseguer ◽  
Ruben Molina-Fernandez ◽  
Manuel Alejandro Marín-López ◽  
Gaurav Jumde ◽  
...  

Abstract Background Statistical potentials, also named knowledge-based potentials, are scoring functions derived from empirical data that can be used to evaluate the quality of protein folds and protein–protein interaction (PPI) structures. In previous works we decomposed the statistical potentials in different terms, named Split-Statistical Potentials, accounting for the type of amino acid pairs, their hydrophobicity, solvent accessibility and type of secondary structure. These potentials have been successfully used to identify near-native structures in protein structure prediction, rank protein docking poses, and predict PPI binding affinities. Results Here, we present the SPServer, a web server that applies the Split-Statistical Potentials to analyze protein folds and protein interfaces. SPServer provides global scores as well as residue/residue-pair profiles presented as score plots and maps. This level of detail allows users to: (1) identify potentially problematic regions on protein structures; (2) identify disrupting amino acid pairs in protein interfaces; and (3) compare and analyze the quality of tertiary and quaternary structural models. Conclusions While there are many web servers that provide scoring functions to assess the quality of either protein folds or PPI structures, SPServer integrates both aspects in a unique easy-to-use web server. Moreover, the server permits to locally assess the quality of the structures and interfaces at a residue level and provides tools to compare the local assessment between structures. Server address https://sbi.upf.edu/spserver/.


2021 ◽  
Author(s):  
Xingjian Xu ◽  
Igor Dikiy ◽  
Matthew R. Evans ◽  
Leandro P. Marcelino ◽  
Kevin H. Gardner

Abstract. Recent research on fold-switching metamorphic proteins has revealed some notable exceptions to Anfinsen's hypothesis of protein folding. We have previously described how a single point mutation can enable a well-folded protein domain, one of the two PAS (Per-ARNT-Sim) domains of the human ARNT (aryl hydrocarbon receptor nuclear translocator) protein, to interconvert between two conformers related by a slip of an internal beta-strand. Using this protein as a test case, we advance the concept of a fragile fold, a protein fold that can reversibly rearrange into another fold that differs by a substantial number of hydrogen bonds, entailing reorganization of single secondary structure elements to more drastic changes seen in metamorphic proteins. Here we use a battery of biophysical tests to examine several factors affecting the equilibrium between the two conformations of the switching ARNT PAS-B Y456T protein. Of note, we find that factors which impact the HI loop preceding the shifted I(beta)-strand affect both the equilibrium levels of the two conformers and the denatured state which links them in the interconversion process. Finally, we describe small molecules that selectively bind to and stabilize the wildtype conformation of ARNT PAS-B. These studies form a toolkit for studying fragile protein folds and could enable ways to modulate the biological functions of such fragile folds, both in natural and engineered proteins.


2020 ◽  
Vol 16 (10) ◽  
pp. e1008285
Author(s):  
Pengfei Tian ◽  
Robert B. Best

2020 ◽  
Author(s):  
Pengfei Tian ◽  
Robert B. Best

AbstractMost foldable protein sequences adopt only a single native fold. Recent protein design studies have, however, created protein sequences which fold into different structures apon changes of environment, or single point mutation, the best characterized example being the switch between the folds of the GA and GB binding domains of streptococcal protein G. To obtain further insight into the design of sequences which can switch folds, we have used a computational model for the fitness landscape of a single fold, built from the observed sequence variation of protein homologues. We have recently shown that such coevolutionary models can be used to design novel foldable sequences. By appropriately combining two of these models to describe the joint fitness landscape of GA and GB, we are able to describe the propensity of a given sequence for each of the two folds. We have successfully tested the combined model against the known series of designed GA/GB hybrids. Using Monte Carlo simulations on this landscape, we are able to identify pathways of mutations connecting the two folds. In the absence of a requirement for domain stability, the most frequent paths go via sequences in which neither domain is stably folded, reminiscent of the propensity for certain intrinsically disordered proteins to fold into different structures according to context. Even if the folded state is required to be stable, we find that there is nonetheless still a wide range of sequences which are close to the transition region and therefore likely fold switches, consistent with recent estimates that fold switching may be more widespread than had been thought.Author SummaryWhile most proteins self-assemble (or “fold”) to a unique three-dimensional structure, a few have been identified that can fold into two distinct structures. These so-called “metamorphic” proteins that can switch folds have attracted a lot of recent interest, and it has been suggested that they may be much more widespread than currently appreciated. We have developed a computational model that captures the propensity of a given protein sequence to fold into either one of two specific structures (GA and GB), in order to investigate which sequences are able to fold to both GA and GB (“switch sequences”), versus just one of them. Our model predicts that there is a large number of switch sequences that could fold into both structures, but also that the most likely such sequences are those for which the folded structures have low stability, in agreement with available experimental data. This also suggests that intrinsically disordered proteins which can fold into different structures on binding may provide an evolutionary path in sequence space between protein folds.


Sign in / Sign up

Export Citation Format

Share Document