scholarly journals Dead-end elimination with perturbations (DEEPer): A provable protein design algorithm with continuous sidechain and backbone flexibility

2012 ◽  
Vol 81 (1) ◽  
pp. 18-39 ◽  
Author(s):  
Mark A. Hallen ◽  
Daniel A. Keedy ◽  
Bruce R. Donald
2018 ◽  
Author(s):  
Mark A. Hallen

AbstractProtein design algorithms must search an enormous conformational space to identify favorable conformations. As a result, those that perform this search with guarantees of accuracy generally start with a conformational pruning step, such as dead-end elimination (DEE). However, the mathematical assumptions of DEE-based pruning algorithms have up to now severely restricted the biophysical model that can feasibly be used in protein design. To lift these restrictions, I propose to prune local unrealistic geometries (PLUG) using a linear programming-based method. PLUG’s biophysical model consists only of well-known lower bounds on interatomic distances. PLUG is intended as pre-processing for energy-based protein design calculations, whose biophysical model need not support DEE pruning. Based on 96 test cases, PLUG is at least as effective at pruning as DEE for larger protein designs—the type that most require pruning. When combined with the LUTE protein design algorithm, PLUG greatly facilitates designs that account for continuous entropy, large multistate designs with continuous flexibility, and designs with extensive continuous backbone flexibility and advanced non-pairwise energy functions. Many of these designs are tractable only with PLUG, either for empirical reasons (LUTE’s machine learning step achieves an accurate fit only after PLUG pruning), or for theoretical reasons (many energy functions are fundamentally incompatible with DEE).


2013 ◽  
Author(s):  
Eleisha L. Jackson ◽  
Noah Ollikainen ◽  
Arthur W. Covert III ◽  
Tanja Kortemme ◽  
Claus O. Wilke

Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability.


2019 ◽  
Vol 36 (1) ◽  
pp. 122-130
Author(s):  
Jelena Vucinic ◽  
David Simoncini ◽  
Manon Ruffini ◽  
Sophie Barbe ◽  
Thomas Schiex

Abstract Motivation Structure-based computational protein design (CPD) plays a critical role in advancing the field of protein engineering. Using an all-atom energy function, CPD tries to identify amino acid sequences that fold into a target structure and ultimately perform a desired function. The usual approach considers a single rigid backbone as a target, which ignores backbone flexibility. Multistate design (MSD) allows instead to consider several backbone states simultaneously, defining challenging computational problems. Results We introduce efficient reductions of positive MSD problems to Cost Function Networks with two different fitness definitions and implement them in the Pompd (Positive Multistate Protein design) software. Pompd is able to identify guaranteed optimal sequences of positive multistate full protein redesign problems and exhaustively enumerate suboptimal sequences close to the MSD optimum. Applied to nuclear magnetic resonance and back-rubbed X-ray structures, we observe that the average energy fitness provides the best sequence recovery. Our method outperforms state-of-the-art guaranteed computational design approaches by orders of magnitudes and can solve MSD problems with sizes previously unreachable with guaranteed algorithms. Availability and implementation https://forgemia.inra.fr/thomas.schiex/pompd as documented Open Source. Supplementary information Supplementary data are available at Bioinformatics online.


2016 ◽  
Vol 44 (5) ◽  
pp. 1523-1529 ◽  
Author(s):  
James T. MacDonald ◽  
Paul S. Freemont

The computational algorithms used in the design of artificial proteins have become increasingly sophisticated in recent years, producing a series of remarkable successes. The most dramatic of these is the de novo design of artificial enzymes. The majority of these designs have reused naturally occurring protein structures as ‘scaffolds’ onto which novel functionality can be grafted without having to redesign the backbone structure. The incorporation of backbone flexibility into protein design is a much more computationally challenging problem due to the greatly increased search space, but promises to remove the limitations of reusing natural protein scaffolds. In this review, we outline the principles of computational protein design methods and discuss recent efforts to consider backbone plasticity in the design process.


2009 ◽  
Vol 20 (4) ◽  
pp. 420-428 ◽  
Author(s):  
Daniel J Mandell ◽  
Tanja Kortemme

2019 ◽  
Author(s):  
Fabian Sesterhenn ◽  
Che Yang ◽  
Jaume Bonet ◽  
Johannes T Cramer ◽  
Xiaolin Wen ◽  
...  

AbstractDe novo protein design has been successful in expanding the natural protein repertoire. However, most de novo proteins lack biological function, presenting a major methodological challenge. In vaccinology, the induction of precise antibody responses remains a cornerstone for next-generation vaccines. Here, we present a novel protein design algorithm, termed TopoBuilder, with which we engineered epitope-focused immunogens displaying complex structural motifs. Both in mice and non-human primates, cocktails of three de novo designed immunogens induced robust neutralizing responses against the respiratory syncytial virus. Furthermore, the immunogens refocused pre-existing antibody responses towards defined neutralization epitopes. Overall, our de novo design approach opens the possibility of targeting specific epitopes for vaccine and therapeutic antibody development, and more generally will be applicable to design de novo proteins displaying complex functional motifs.


2013 ◽  
Author(s):  
Eleisha L. Jackson ◽  
Noah Ollikainen ◽  
Arthur W. Covert III ◽  
Tanja Kortemme ◽  
Claus O. Wilke

Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability.


2007 ◽  
Vol 28 (13) ◽  
pp. 2122-2129 ◽  
Author(s):  
Chen Yanover ◽  
Menachem Fromer ◽  
Julia M. Shifman
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document