scholarly journals Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis

2019 ◽  
Vol 116 (33) ◽  
pp. 16367-16377 ◽  
Author(s):  
Alex Nisthal ◽  
Connie Y. Wang ◽  
Marie L. Ary ◽  
Stephen L. Mayo

The accurate prediction of protein stability upon sequence mutation is an important but unsolved challenge in protein engineering. Large mutational datasets are required to train computational predictors, but traditional methods for collecting stability data are either low-throughput or measure protein stability indirectly. Here, we develop an automated method to generate thermodynamic stability data for nearly every single mutant in a small 56-residue protein. Analysis reveals that most single mutants have a neutral effect on stability, mutational sensitivity is largely governed by residue burial, and unexpectedly, hydrophobics are the best tolerated amino acid type. Correlating the output of various stability-prediction algorithms against our data shows that nearly all perform better on boundary and surface positions than for those in the core and are better at predicting large-to-small mutations than small-to-large ones. We show that the most stable variants in the single-mutant landscape are better identified using combinations of 2 prediction algorithms and including more algorithms can provide diminishing returns. In most cases, poor in silico predictions were tied to compositional differences between the data being analyzed and the datasets used to train the algorithm. Finally, we find that strategies to extract stabilities from high-throughput fitness data such as deep mutational scanning are promising and that data produced by these methods may be applicable toward training future stability-prediction tools.

2018 ◽  
Author(s):  
Alex Nisthal ◽  
Connie Y. Wang ◽  
Marie L. Ary ◽  
Stephen L. Mayo

AbstractThe accurate prediction of protein stability upon sequence mutation is an important but unsolved challenge in protein engineering. Large mutational datasets are required to train computational predictors, but traditional methods for collecting stability data are either low-throughput or measure protein stability indirectly. Here, we develop an automated method to generate thermodynamic stability data for nearly every single mutant in a small 56-residue protein. Analysis reveals that most single mutants have a neutral effect on stability, mutational sensitivity is largely governed by residue burial, and unexpectedly, hydrophobics are the best tolerated amino acid type. Correlating the output of various stability prediction algorithms against our data shows that nearly all perform better on boundary and surface positions than for those in the core, and are better at predicting large to small mutations than small to large ones. We show that the most stable variants in the single mutant landscape are better identified using combinations of two prediction algorithms, and that including more algorithms can provide diminishing returns. In most cases, poor in silico predictions were tied to compositional differences between the data being analyzed and the datasets used to train the algorithm. Finally, we find that strategies to extract stabilities from high-throughput fitness data such as deep mutational scanning are promising and that data produced by these methods may be applicable toward training future stability prediction tools.Significance StatementUsing liquid-handling automation, we constructed and measured the thermodynamic stability of almost every single mutant of protein G (Gβ1), a small domain. This self-consistent dataset is the largest of its kind and offers unique opportunities on two fronts: (i) insight into protein domain properties such as positional sensitivity and incorporated amino acid tolerance, and (ii) service as a validation set for future efforts in protein stability prediction. As Gβ1 is a model system for protein folding and design, and its single mutant landscape has been measured by deep mutational scanning, we expect our dataset to serve as a reference for studies aimed at extracting stability information from fitness data or developing novel high-throughput stability assays.


2021 ◽  
Author(s):  
Yashas Samaga B L ◽  
Shampa Raghunathan ◽  
U. Deva Priyakumar

<div>Engineering proteins to have desired properties by mutating amino acids at specific sites is commonplace. Such engineered proteins must be stable to function. Experimental methods used to determine stability at throughputs required to scan the protein sequence space thoroughly are laborious. To this end, many machine learning based methods have been developed to predict thermodynamic stability changes upon mutation. These methods have been evaluated for symmetric consistency by testing with hypothetical reverse mutations. In this work, we propose transitive data augmentation, evaluating transitive consistency, and a new machine learning based method, first of its kind, that incorporates both symmetric and transitive properties into the architecture. Our method, called SCONES, is an interpretable neural network that estimates a residue's contributions towards protein stability dG in its local structural environment. The difference between independently predicted contributions of the reference and mutant residues in a missense mutation is reported as dG. We show that this self-consistent machine learning architecture is immune to many common biases in datasets, relies less on data than existing methods, and is robust to overfitting.</div><div><br></div>


2001 ◽  
Vol 50 (11) ◽  
pp. 847-855 ◽  
Author(s):  
Junichi KOUCHI ◽  
Tatsuru TABOHASHI ◽  
Shoko YOKOYAMA ◽  
Fuminori HARUSAWA ◽  
Aritomo YAMAGUCHI ◽  
...  
Keyword(s):  

2001 ◽  
Vol 20 (1) ◽  
pp. 79-86 ◽  
Author(s):  
Tatsuru Tabohashi ◽  
Kazuhiko Tobita ◽  
Kazutami Sakamoto ◽  
Junichi Kouchi ◽  
Shoko Yokoyama ◽  
...  

2001 ◽  
Vol 47 (6) ◽  
pp. 470-475 ◽  
Author(s):  
Shi-Fa Wang ◽  
Takeshi Furuno ◽  
Zhi Cheng

2006 ◽  
Vol 34 (1) ◽  
pp. 13-21 ◽  
Author(s):  
Peter S. C. Wu ◽  
Kiyoshi Ozawa ◽  
Slobodan Jergic ◽  
Xun-Cheng Su ◽  
Nicholas E. Dixon ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document