scholarly journals Learning the molecular grammar of protein condensates from sequence determinants and embeddings

2021 ◽  
Vol 118 (15) ◽  
pp. e2019053118
Author(s):  
Kadi L. Saar ◽  
Alexey S. Morgunov ◽  
Runzhang Qi ◽  
William E. Arter ◽  
Georg Krainer ◽  
...  

Intracellular phase separation of proteins into biomolecular condensates is increasingly recognized as a process with a key role in cellular compartmentalization and regulation. Different hypotheses about the parameters that determine the tendency of proteins to form condensates have been proposed, with some of them probed experimentally through the use of constructs generated by sequence alterations. To broaden the scope of these observations, we established an in silico strategy for understanding on a global level the associations between protein sequence and phase behavior and further constructed machine-learning models for predicting protein liquid–liquid phase separation (LLPS). Our analysis highlighted that LLPS-prone proteins are more disordered, less hydrophobic, and of lower Shannon entropy than sequences in the Protein Data Bank or the Swiss-Prot database and that they show a fine balance in their relative content of polar and hydrophobic residues. To further learn in a hypothesis-free manner the sequence features underpinning LLPS, we trained a neural network-based language model and found that a classifier constructed on such embeddings learned the underlying principles of phase behavior at a comparable accuracy to a classifier that used knowledge-based features. By combining knowledge-based features with unsupervised embeddings, we generated an integrated model that distinguished LLPS-prone sequences both from structured proteins and from unstructured proteins with a lower LLPS propensity and further identified such sequences from the human proteome at a high accuracy. These results provide a platform rooted in molecular principles for understanding protein phase behavior. The predictor, termed DeePhase, is accessible from https://deephase.ch.cam.ac.uk/.

2012 ◽  
Vol 560-561 ◽  
pp. 701-708 ◽  
Author(s):  
Lu Li ◽  
Jie Xu ◽  
Tao Fang ◽  
Jin Geng ◽  
Detlef Freitag ◽  
...  

Electrospinning combined with nonsolvent-induced phase separation is a simple and novel method to produce porous nanofibers. In the study, Poly (vinylpyrrolidone) (PVP) nanofibers were fabricated using an electrospinning approach complemented by compressed nitrogen (N). N2 was used as the nonsolvent of choice. Besides, the tun2ning of N2 pressure and temperature can impact the nanofibers’ morphologies by altering phase behavior and mass transfer. Also, the other parameters affecting electrospinning of polymer solution were discussed. The results were demonstrated by extending the technique to PVP/dichloromethane (DCM) and PVP/ethanol (EtOH) systems. And the binary phase equilibrium of solvents (dichloromethane or ethanol) and N simulated by ASPEN PLUS 2006 demonstrates that N is not a 2good choice for producing hollow or po2rous polymer nanofibers.


2016 ◽  
Vol 18 (21) ◽  
pp. 14252-14256 ◽  
Author(s):  
Julian Schulze ◽  
Johannes Möller ◽  
Jonathan Weine ◽  
Karin Julius ◽  
Nico König ◽  
...  

Dense protein solutions exhibit a reentrant liquid–liquid phase separation region at high pressures.


2007 ◽  
Vol 104 (9) ◽  
pp. 3165-3170 ◽  
Author(s):  
T. Baumgart ◽  
A. T. Hammond ◽  
P. Sengupta ◽  
S. T. Hess ◽  
D. A. Holowka ◽  
...  

2012 ◽  
Vol 287 (42) ◽  
pp. 34979-34991 ◽  
Author(s):  
Christopher C. Valley ◽  
Alessandro Cembran ◽  
Jason D. Perlmutter ◽  
Andrew K. Lewis ◽  
Nicholas P. Labello ◽  
...  

Of the 20 amino acids, the precise function of methionine (Met) remains among the least well understood. To establish a determining characteristic of methionine that fundamentally differentiates it from purely hydrophobic residues, we have used in vitro cellular experiments, molecular simulations, quantum calculations, and a bioinformatics screen of the Protein Data Bank. We show that approximately one-third of all known protein structures contain an energetically stabilizing Met-aromatic motif and, remarkably, that greater than 10,000 structures contain this motif more than 10 times. Critically, we show that as compared with a purely hydrophobic interaction, the Met-aromatic motif yields an additional stabilization of 1–1.5 kcal/mol. To highlight its importance and to dissect the energetic underpinnings of this motif, we have studied two clinically relevant TNF ligand-receptor complexes, namely TRAIL-DR5 and LTα-TNFR1. In both cases, we show that the motif is necessary for high affinity ligand binding as well as function. Additionally, we highlight previously overlooked instances of the motif in several disease-related Met mutations. Our results strongly suggest that the Met-aromatic motif should be exploited in the rational design of therapeutics targeting a range of proteins.


2018 ◽  
Vol 660 ◽  
pp. 77-81 ◽  
Author(s):  
Chanita Sungkapreecha ◽  
Mark J. Beily ◽  
Jörg Kressler ◽  
Walter W. Focke ◽  
René Androsch

2021 ◽  
Author(s):  
Yiming Tang ◽  
Santu Bera ◽  
Yifei Yao ◽  
Jiyuan Zeng ◽  
Zenghui Lao ◽  
...  

AbstractLiquid-liquid phase separation (LLPS) of proteins mediates the assembly of biomolecular condensates involved in physiological and pathological processes. Identifying the minimalistic building blocks and the sequence determinant of protein phase separation is of urgent importance but remains challenging due to the enormous sequence space and difficulties of existing methodologies in characterizing the phase behavior of ultrashort peptides. Here we demonstrate computational tools to efficiently quantify the microscopic fluidity and density of liquid-condensates/solid-aggregates and the temperature-dependent phase diagram of peptides. Utilizing our approaches, we comprehensively predict the LLPS abilities of all 400 dipeptide combinations of coded amino acids based on 492 micro-second molecular dynamics simulations, and observe the occurrences of spontaneous LLPS. We identify 54 dipeptides that form solid-like aggregates and three categories of dipeptides with high LLPS propensity. Our predictions are validated by turbidity assays and differential interference contrast (DIC) microscopy on four representative dipeptides (WW, QW, GF, and VI). Phase coexistence diagrams are constructed to explore the temperature dependence of LLPS. Our results reveal that aromatic moieties are crucial for a dipeptide to undergo LLPS, and hydrophobic and polar components are indispensable. We demonstrate for the first time that dipeptides, minimal but complete, possess multivalent interactions sufficient for LLPS, suggesting that LLPS is a general property of peptides/proteins, independent of their sequence length. This study provides a computational and experimental approach to the prediction and characterization of the phase behavior of minimalistic peptides, and will be helpful for understanding the sequence-dependence and molecular mechanism of protein phase separation.SignificanceProtein liquid-liquid phase separation (LLPS) is associated with human health and diseases. Identifying the minimalistic building blocks and sequence determinants of LLPS is of urgent importance but remains computationally challenging partially due to the lack of methodologies characterizing the liquid condensates. Herein we provide approaches to evaluate LLPS ability of dipeptides, and screen all 400 dipeptides by MD simulations combined with multi-bead-per-residue models which capture key interactions driving LLPS that are missing in one-bead-per-residue models. Three categories of LLPS dipeptides are identified and the experimentally-verified QW dipeptide is by far the smallest LLPS system. Our results suggest that dipeptides, minimal but complete, possess multivalent interactions sufficient for LLPS, and LLPS is a general property of peptides/proteins, independent of their length.


Author(s):  
T. M. Perdikari ◽  
N. Jovic ◽  
G. L. Dignon ◽  
Y. C. Kim ◽  
N. L. Fawzi ◽  
...  

AbstractBiomolecules undergo liquid-liquid phase separation (LLPS) resulting in the formation of multicomponent protein-RNA membraneless organelles in cells. However, the physiological and pathological role of post translational modifications (PTMs) on the biophysics of phase behavior is only beginning to be probed. To study the effect of PTMs on LLPS in silico, we extend our transferable coarse-grained model of intrinsically disordered proteins to include phosphorylated and acetylated amino acids. Using the parameters for modified amino acids available for fixed charge atomistic forcefields, we parameterize the size and atomistic hydropathy of the coarse-grained modified amino acid beads, and hence the interactions between the modified and natural amino acids. We then elucidate how the number and position of phosphorylated and acetylated residues alter the protein’s single chain compactness and its propensity to phase separate. We show that both the number and the position of phosphorylated threonines/serines or acetylated lysines can serve as a molecular on/off switch for phase separation in the well-studied disordered regions of FUS and DDX3X, respectively. We also compare modified residues to their commonly used PTM mimics for their impact on chain properties. Importantly, we show that the model can predict and capture experimentally measured differences in the phase behavior for position-specific modifications, showing that the position of modifications can dictate phase separation. In sum, this model will be useful for studying LLPS of post-translationally modified intrinsically disordered proteins and predicting how modifications control phase behavior with position-specific resolution.Statement of SignificancePost-translational modifications are important regulators of liquid-liquid phase separation (LLPS) which drives the formation of biomolecular condensates. Theoretical methods can be used to characterize the biophysical properties of intrinsically disordered proteins (IDPs). Our recent framework for molecular simulations using a Cα-centered coarse-grained model can predict the effect of various perturbations such as mutations (Dignon et al. PloS Comput. Biol, 2018) and temperature (Dignon et al, ACS Cent. Sci., 2019) on LLPS. Here, we expand this framework to incorporate modified residues like phosphothreonine, phosphoserine and acetylysine. This model will prove useful for simulating the phase separation of post-translationally modified IDPs and predicting how position-specific modifications can control phase behavior across the large family of proteins known to be phosphorylated and acetylated.


Sign in / Sign up

Export Citation Format

Share Document