scholarly journals Hydropathy patterning complements charge patterning to describe conformational preferences of disordered proteins

Author(s):  
Wenwei Zheng ◽  
Gregory Dignon ◽  
Matthew Brown ◽  
Young C. Kim ◽  
Jeetain Mittal

AbstractUnderstanding the conformational ensemble of an intrinsically disordered protein (IDP) is of great interest due to its relevance to critical intracellular functions and diseases. It is now well established that the polymer scaling behavior can provide a great deal of information about the conformational properties as well as liquid-liquid phase separation of an IDP. It is, therefore, extremely desirable to be able to predict an IDP’s scaling behavior from the protein sequence itself. The work in this direction so far has focused on highly charged proteins and how charge patterning can perturb their structural properties. As naturally occurring IDPs are composed of a significant fraction of uncharged amino acids, the rules based on charge content and patterning are only partially helpful in solving the problem. Here, we propose a new order parameter, sequence hydropathy decoration (SHD), which can provide a near quantitative understanding of scaling and structural properties of IDPs devoid of charged residues. We combine this with a charge patterning parameter, sequence charge decoration (SCD), to obtain a general equation, parameterized from extensive coarse-grained simulation data, for predicting protein dimensions from the sequence. We finally test this equation against available experimental data and find a semi-quantitative match in predicting the scaling behavior. We also provide guidance on how to extend this approach to experimental data, which should be feasible in the near future. Graphical TOC Entry

2021 ◽  
Author(s):  
Cecilia Chavez-Garcia ◽  
Jerome Henin ◽  
Mikko Karttunen

The malfunction of the Methyl CpG binding protein 2 (MeCP2) is associated to the Rett syndrome, one of the most common causes of cognitive impairment in females. MeCP2 is an intrinsically disordered protein (IDP), making its experimental characterization a challenge. There is currently no structure available for the full-length MeCP2 in any of the databases, and only the structure of its MBD domain has been solved. We used this structure to build a full-length model of MeCP2 by completing the rest of the protein via ab initio modelling. Using a combination of all-atom and coarse-grained simulations, we characterized its structure and dynamics as well as the conformational space sampled by the ID and TRD domains in the absence of the rest of the protein. The present work is the first computational study of the full-length protein. Two main conformations were sampled in the coarse-grained simulations: a globular structure similar to the one observed in the all-atom force field and a two-globule conformation. Our all-atom model is in good agreement with the available experimental data, predicting amino acid W104 to be buried, amino acids R111 and R133 to be solvent accessible, and having 4.1% of α-helix content, compared to the 4% found experimentally. Finally, we compared the model predicted by AlphaFold to our Modeller model. The model was not stable in water and underwent further folding. Together, these simulations provide a detailed (if perhaps incomplete) conformational ensemble of the full-length MeCP2, which is compatible with experimental data and can be the basis of further studies, e.g., on mutants of the protein or its interactions with its biological partners.


2021 ◽  
Author(s):  
Antonio B. Oliveira ◽  
Xingcheng Lin ◽  
Prakash Kulkarni ◽  
José N. Onuchic ◽  
Susmita Roy ◽  
...  

AbstractIntrinsically disordered proteins (IDPs) lack a rigid 3D structure and populate a polymorphic ensemble of conformations. Because of the lack of a reference conformation, their energy landscape representation in terms of reaction coordinates presents a daunting challenge. Here, our newly developed Energy Landscape Visualization Method (ELViM), a reaction coordinate-free approach, shows its prime application to explore frustrated energy landscapes of an intrinsically disordered protein, Prostate-Associated Gene 4 (PAGE4). PAGE4 is a transcriptional coactivator that potentiates the oncogene c-Jun. Two kinases, namely HIPK1 and CLK2, phosphorylate PAGE4 generating variants phosphorylated at different serine/threonine residues (HIPK1-PAGE4 and CLK2-PAGE4, respectively) with opposing functions. While HIPK1-PAGE4 predominantly phosphorylates Thr51 and potentiates c-Jun, CLK2-PAGE4 hyper-phosphorylates PAGE4 and attenuates transactivation. To understand the underlying mechanisms of conformational diversity among different phosphoforms, we have analyzed their atomistic trajectories simulated using AWSEM forcefield and the energy landscapes were elucidated using ELViM. This method allows us to identify and compare the population distributions of different conformational ensembles of PAGE4 phosphoforms using the same effective phase space. The results reveal a predominant conformational ensemble with an extended C-terminal segment of WT PAGE4, which exposes a functional residue Thr51, implying its potential of undertaking a fly-casting mechanism while binding to its cognate partner. In contrast, for HIPK1-PAGE4, a compact conformational ensemble enhances its population sequestering phosphorylated-Thr51. This clearly explains the experimentally observed weaker affinity of HIPK1-PAGE4 for c-Jun. ELViM appears as a powerful tool especially to analyze the highly-frustrated energy landscape representation of IDPs where appropriate reaction coordinates are hard to apprehend.


Author(s):  
Gregory-Neal W. Gomes ◽  
Mickaël Krzeminski ◽  
Ashley Namini ◽  
Erik. W. Martin ◽  
Tanja Mittag ◽  
...  

AbstractIntrinsically disordered proteins (IDPs) have fluctuating heterogeneous conformations, which makes structural characterization challenging, but of great interest, since their conformational ensembles are the link between their sequences and functions. An accurate description of IDP conformational ensembles depends crucially on the amount and quality of the experimental data, how it is integrated, and if it supports a consistent structural picture. We have used an integrative modelling approach to understand how conformational restraints imposed by the most common structural techniques for IDPs: Nuclear Magnetic Resonance (NMR) spectroscopy, Small-angle X-ray Scattering (SAXS), and single-molecule Förster Resonance Energy Transfer (smFRET) reach concordance on structural ensembles for Sic1 and phosphorylated Sic1 (pSic1). To resolve apparent discrepancies between smFRET and SAXS, we integrated SAXS data with non-smFRET (NMR) data and reserved the new smFRET data for Sic1 and pSic1 as an independent validation. The consistency of the SAXS/NMR restrained ensembles with smFRET, which was not guaranteed a priori, indicates that the perturbative effects of NMR or smFRET labels on the Sic1 and pSic1 ensembles are minimal. Furthermore, the mutual agreement with such a diverse set of experimental data suggest that details of the generated ensembles can now be examined with a high degree of confidence to reveal distinguishing features of Sic1 vs. pSic1. From the experimentally well supported ensembles, we find they are consistent with independent biophysical models of Sic1’s ultrasensitive binding to its partner Cdc4. Our results underscore the importance of integrative modelling in calculating and drawing biological conclusions from IDP conformational ensembles.


Entropy ◽  
2019 ◽  
Vol 21 (7) ◽  
pp. 662 ◽  
Author(s):  
Iva Pritišanac ◽  
Robert Vernon ◽  
Alan Moses ◽  
Julie Forman Kay

Bioinformatics and biophysical studies of intrinsically disordered proteins and regions (IDRs) note the high entropy at individual sequence positions and in conformations sampled in solution. This prevents application of the canonical sequence-structure-function paradigm to IDRs and motivates the development of new methods to extract information from IDR sequences. We argue that the information in IDR sequences cannot be fully revealed through positional conservation, which largely measures stable structural contacts and interaction motifs. Instead, considerations of evolutionary conservation of molecular features can reveal the full extent of information in IDRs. Experimental quantification of the large conformational entropy of IDRs is challenging but can be approximated through the extent of conformational sampling measured by a combination of NMR spectroscopy and lower-resolution structural biology techniques, which can be further interpreted with simulations. Conformational entropy and other biophysical features can be modulated by post-translational modifications that provide functional advantages to IDRs by tuning their energy landscapes and enabling a variety of functional interactions and modes of regulation. The diverse mosaic of functional states of IDRs and their conformational features within complexes demands novel metrics of information, which will reflect the complicated sequence-conformational ensemble-function relationship of IDRs.


2019 ◽  
Author(s):  
Mustapha Carab Ahmed ◽  
Ramon Crehuet ◽  
Kresten Lindorff-Larsen

AbstractThe level of compaction of an intrinsically disordered protein may affect both its physical and biological properties, and can be probed via different types of biophysical experiments. Small-angle X-ray scattering (SAXS) probe the radius of gyration (Rg) whereas pulsed-field-gradient nuclear magnetic resonance (NMR) diffusion, fluorescence correlation spectroscopy and dynamic light scattering experiments can be used to determine the hydrodynamic radius (Rh). Here we show how to calculate Rg and Rh from a computationally-generated conformational ensemble of an intrinsically disordered protein. We further describe how to use a Bayesian/Maximum Entropy procedure to integrate data from SAXS and NMR diffusion experiments, so as to derive conformational ensembles in agreement with those experiments.


2020 ◽  
Author(s):  
Yani Zhao ◽  
Robinson Cortes-Huerto ◽  
Kurt Kremer ◽  
Joseph F. Rudzinski

Intrinsically disordered proteins (IDPs) play an important role in an array of biological processes but present a number of fundamental challenges for computational modeling. Recently, simple polymer models have re-gained popularity for interpreting the experimental characterization of IDPs. Homopolymer theory provides a strong foundation for understanding generic features of phenomena ranging from single-chain conformational dynamics to the properties of entangled polymer melts, but is difficult to extend to the copolymer context. This challenge is magnified for proteins due to the variety of competing interactions and large deviations in side-chain properties. In this work, we apply a simple physics-based coarse-grained model for describing largely disordered conformational ensembles of peptides, based on the premise that sampling sterically-forbidden conformations can compromise the faithful description of both static and dynamical properties. The Hamiltonian of the employed model can be easily adjusted to investigate the impact of distinct interactions and sequence specificity on the randomness of the resulting conformational ensemble. In particular, starting with a bead-spring-like model and then adding more detailed interactions one by one, we construct a hierarchical set of models and perform a detailed comparison of their properties. Our analysis clarifies the role of generic attractions, electrostatics and side-chain sterics, while providing a foundation for developing efficient models for IDPs that retain an accurate description of the hierarchy of conformational dynamics, which is nontrivially influenced by interactions with surrounding proteins and solvent molecules.


2019 ◽  
Vol 116 (41) ◽  
pp. 20446-20452 ◽  
Author(s):  
Utsab R. Shrestha ◽  
Puneet Juneja ◽  
Qiu Zhang ◽  
Viswanathan Gurumoorthy ◽  
Jose M. Borreguero ◽  
...  

Intrinsically disordered proteins (IDPs) are abundant in eukaryotic proteomes, play a major role in cell signaling, and are associated with human diseases. To understand IDP function it is critical to determine their configurational ensemble, i.e., the collection of 3-dimensional structures they adopt, and this remains an immense challenge in structural biology. Attempts to determine this ensemble computationally have been hitherto hampered by the necessity of reweighting molecular dynamics (MD) results or biasing simulation in order to match ensemble-averaged experimental observables, operations that reduce the precision of the generated model because different structural ensembles may yield the same experimental observable. Here, by employing enhanced sampling MD we reproduce the experimental small-angle neutron and X-ray scattering profiles and the NMR chemical shifts of the disordered N terminal (SH4UD) of c-Src kinase without reweighting or constraining the simulations. The unbiased simulation results reveal a weakly funneled and rugged free energy landscape of SH4UD, which gives rise to a heterogeneous ensemble of structures that cannot be described by simple polymer theory. SH4UD adopts transient helices, which are found away from known phosphorylation sites and could play a key role in the stabilization of structural regions necessary for phosphorylation. Our findings indicate that adequately sampled molecular simulations can be performed to provide accurate physical models of flexible biosystems, thus rationalizing their biological function.


Author(s):  
Bin Chong ◽  
Yingguang Yang ◽  
Zi-Le Wang ◽  
Han Xing ◽  
Zhirong Liu

Intrinsically disordered proteins (IDPs) widely involve in human diseases and are thus attractive therapeutic targets. In practice, however, it is computationally prohibitive to dock large ligand libraries to thousands and...


Sign in / Sign up

Export Citation Format

Share Document