lattice proteins
Recently Published Documents


TOTAL DOCUMENTS

53
(FIVE YEARS 6)

H-INDEX

14
(FIVE YEARS 2)

2021 ◽  
Vol 8 (3) ◽  
pp. 291-306
Author(s):  
Salomón J. Alas-Guardado ◽  
◽  
Pedro Pablo González-Pérez ◽  
Hiram Isaac Beltrán ◽  
◽  
...  

<abstract> <p>Many of the simplistic hydrophobic-polar lattice models, such as Dill's model (called <bold>Model 1</bold> herein), are aimed to fold structures through hydrophobic-hydrophobic interactions mimicking the well-known hydrophobic collapse present in protein structures. In this work, we studied 11 designed hydrophobic-polar sequences, S<sub>1</sub>-S<sub>8</sub> folded in 2D-square lattice, and S<sub>9</sub>-S<sub>11</sub> folded in 3D-cubic lattice. And to better fold these structures we have developed <bold>Model 2</bold> as an approximation to convex function aimed to weight hydrophobic-hydrophobic but also polar-polar contacts as an augmented version of <bold>Model 1</bold>. In this partitioned approach hydrophobic-hydrophobic ponderation was tuned as <italic>α</italic>-1 and polar-polar ponderation as <italic>α</italic>. This model is centered in preserving required hydrophobic substructure, and at the same time including polar-polar interactions, otherwise absent, to reach a better folding score now also acquiring the polar-polar substructure. In all tested cases the folding trials were better achieved with <bold>Model 2</bold>, using <italic>α</italic> values of 0.05, 0.1, 0.2 and 0.3 depending of sequence size, even finding optimal scores not reached with <bold>Model 1</bold>. An important result is that the better folding score, required the lower <italic>α</italic> weighting. And when <italic>α</italic> values above 0.3 are employed, no matter the nature of the hydrophobic-polar sequence, banning of hydrophobic-hydrophobic contacts started, thus yielding misfolding of sequences. Therefore, the value of <italic>α</italic> to correctly fold structures is the result of a careful weighting among hydrophobic-hydrophobic and polar-polar contacts.</p> </abstract>


Genetics ◽  
2020 ◽  
Vol 214 (4) ◽  
pp. 1047-1057 ◽  
Author(s):  
Jason Bertram ◽  
Joanna Masel

The “fitness” landscapes of genetic sequences are characterized by high dimensionality and “ruggedness” due to sign epistasis. Ascending from low to high fitness on such landscapes can be difficult because adaptive trajectories get stuck at low-fitness local peaks. Compounding matters, recent theoretical arguments have proposed that extremely long, winding adaptive paths may be required to reach even local peaks: a “maze-like” landscape topography. The extent to which peaks and mazes shape the mode and tempo of evolution is poorly understood, due to empirical limitations and the abstractness of many landscape models. We explore the prevalence, scale, and evolutionary consequences of landscape mazes in a biophysically grounded computational model of protein evolution that captures the “frustration” between “stability” and aggregation propensity. Our stability-aggregation landscape exhibits extensive sign epistasis and local peaks galore. Although this frequently obstructs adaptive ascent to high fitness and virtually eliminates reproducibility of evolutionary outcomes, many adaptive paths do successfully complete the ascent from low to high fitness, with hydrophobicity a critical mediator of success. These successful paths exhibit maze-like properties on a global landscape scale, in which taking an indirect path helps to avoid low-fitness local peaks. This delicate balance of “hard but possible” adaptation could occur more broadly in other biological settings where competing interactions and frustration are important.


2019 ◽  
Author(s):  
Jason Bertram ◽  
Joanna Masel

AbstractFitness landscapes are widely used to visualize the dynamics and long-term outcomes of evolution. The fitness landscapes of genetic sequences are characterized by high dimensionality and “ruggedness” due to sign epistasis. Ascending from low to high fitness on such landscapes can be difficult because adaptive trajectories get stuck at low-fitness local peaks. Compounding matters, recent computational complexity arguments have proposed that extremely long, winding adaptive paths may be required to even reach local peaks, a “maze-like” landscape topography. The extent to which peaks and mazes shape the mode and tempo of evolution is poorly understood due to empirical limitations and the abstractness of many landscape models. We develop a biophysically-grounded computational model of protein evolution based on two novel extensions of the classic hydrophobic-polar lattice model of protein folding. First, rather than just considering fold stability we account for the tradeoff between stability and aggregation propensity. Second, we use a “hydrophobic zipping” algorithm to kinetically generate ensembles of post-translationally folded structures. Our stability-aggregation fitness landscape exhibits extensive sign epistasis and local peaks galore. We confirm the postulated existence of maze-like topography in our biologically-grounded landscape. Although these landscape features frequently obstruct adaptive ascent to high fitness and virtually eliminate reproducibility of evolutionary outcomes, many adaptive paths do successfully complete the long ascent from low to high fitness. This delicate balance of “hard but possible” adaptation could occur more broadly provided that the optimal outcomes possible under a tradeoff are improved by rare constraint-breaking substitutions.


eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Jérôme Tubiana ◽  
Simona Cocco ◽  
Rémi Monasson

Statistical analysis of evolutionary-related protein sequences provides information about their structure, function, and history. We show that Restricted Boltzmann Machines (RBM), designed to learn complex high-dimensional data and their statistical features, can efficiently model protein families from sequence information. We here apply RBM to 20 protein families, and present detailed results for two short protein domains (Kunitz and WW), one long chaperone protein (Hsp70), and synthetic lattice proteins for benchmarking. The features inferred by the RBM are biologically interpretable: they are related to structure (residue-residue tertiary contacts, extended secondary motifs (α-helixes and β-sheets) and intrinsically disordered regions), to function (activity and ligand specificity), or to phylogenetic identity. In addition, we use RBM to design new protein sequences with putative properties by composing and 'turning up' or 'turning down' the different modes at will. Our work therefore shows that RBM are versatile and practical tools that can be used to unveil and exploit the genotype–phenotype relationship for protein families.


2019 ◽  
Vol 116 (3) ◽  
pp. 476a
Author(s):  
Austin H. Cheng ◽  
Cory J. Kim ◽  
Amy Y. Wang ◽  
Xuanye Zhu ◽  
Qizhang Jia ◽  
...  
Keyword(s):  

2019 ◽  
Vol 116 (3) ◽  
pp. 339a
Author(s):  
Xuanye Zhu ◽  
Qizhang Jia ◽  
Kateri H. DuBay
Keyword(s):  

2018 ◽  
Vol 44 (12) ◽  
pp. 1025-1030
Author(s):  
Matthew S. Wilson ◽  
Guangjie Shi ◽  
Thomas Wüst ◽  
Ying Wai Li ◽  
David P. Landau

Sign in / Sign up

Export Citation Format

Share Document