Protein structure prediction using sparse NOE and RDC restraints with Rosetta in CASP13

Mapping Intimacies ◽

10.1101/597724 ◽

2019 ◽

Author(s):

Georg Kuenze ◽

Jens Meiler

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Dipolar Coupling ◽

Nuclear Overhauser Effect ◽

Residual Dipolar Coupling ◽

Comparative Modeling ◽

Model Quality ◽

Blind Test

AbstractComputational methods that produce accurate protein structure models from limited experimental data, e.g. from nuclear magnetic resonance (NMR) spectroscopy, hold great potential for biomedical research. The NMR-assisted modeling challenge in CASP13 provided a blind test to explore the capabilities and limitations of current modeling techniques in leveraging NMR data which had high sparsity, ambiguity and error rate for protein structure prediction. We describe our approach to predict the structure of these proteins leveraging the Rosetta software suite. Protein structure models were predictedde novousing a two-stage protocol. First, low-resolution models were generated with the Rosettade novomethod guided by non-ambiguous nuclear Overhauser effect (NOE) contacts and residual dipolar coupling (RDC) restraints. Second, iterative model hybridization and fragment insertion with the Rosetta comparative modeling method was used to refine and regularize models guided by all ambiguous and non-ambiguous NOE contacts and RDCs. Nine out of 16 of the Rosettade novomodels had the correct fold (GDT-TS score >45) and in three cases high-resolution models were achieved (RMSD <3.5 Å). We also show that a meta-approach applying iterative Rosetta+NMR refinement on server-predicted models which employed non-NMR-contacts and structural templates leads to substantial improvement in model quality. Integrating these data-assisted refinement strategies with innovative non-data-assisted approaches which became possible in CASP13 such as high precision contact prediction will in the near future enable structure determination for large proteins that are outside of the realm of conventional NMR.

Download Full-text

State-of-the-art web services for de novo protein structure prediction

Briefings in Bioinformatics ◽

10.1093/bib/bbaa139 ◽

2020 ◽

Cited By ~ 1

Author(s):

Luciano A Abriata ◽

Matteo Dal Peraro

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Tertiary Structure ◽

De Novo ◽

State Of The Art ◽

Data Bank ◽

End Users ◽

Model Quality ◽

Uncharacterized Protein

Abstract Residue coevolution estimations coupled to machine learning methods are revolutionizing the ability of protein structure prediction approaches to model proteins that lack clear homologous templates in the Protein Data Bank (PDB). This has been patent in the last round of the Critical Assessment of Structure Prediction (CASP), which presented several very good models for the hardest targets. Unfortunately, literature reporting on these advances often lacks digests tailored to lay end users; moreover, some of the top-ranking predictors do not provide webservers that can be used by nonexperts. How can then end users benefit from these advances and correctly interpret the predicted models? Here we review the web resources that biologists can use today to take advantage of these state-of-the-art methods in their research, including not only the best de novo modeling servers but also datasets of models precomputed by experts for structurally uncharacterized protein families. We highlight their features, advantages and pitfalls for predicting structures of proteins without clear templates. We present a broad number of applications that span from driving forward biochemical investigations that lack experimental structures to actually assisting experimental structure determination in X-ray diffraction, cryo-EM and other forms of integrative modeling. We also discuss issues that must be considered by users yet still require further developments, such as global and residue-wise model quality estimates and sources of residue coevolution other than monomeric tertiary structure.

Download Full-text

De novo protein structure prediction by incremental inter-residue geometries prediction and model quality assessment using deep learning

10.1101/2022.01.11.475831 ◽

2022 ◽

Author(s):

Jun Liu ◽

Guangxing He ◽

Kailong Zhao ◽

Guijun Zhang

Keyword(s):

Protein Structure ◽

Quality Assessment ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Feedback Mechanism ◽

Geometric Constraints ◽

Model Quality ◽

Model Quality Assessment ◽

Loop Feedback

Motivation: The successful application of deep learning has promoted progress in protein model quality assessment. How to use model quality assessment to further improve the accuracy of protein structure prediction, especially not reliant on the existing templates, is helpful for unraveling the folding mechanism. Here, we investigate whether model quality assessment can be introduced into structure prediction to form a closed-loop feedback, and iteratively improve the accuracy of de novo protein structure prediction. Results: In this study, we propose a de novo protein structure prediction method called RocketX. In RocketX, a feedback mechanism is constructed through the geometric constraint prediction network GeomNet, the structural simulation module, and the model quality evaluation network EmaNet. In GeomNet, the co-evolutionary features extracted from MSA that search from the sequence databases are sent to an improved residual neural network to predict the inter-residue geometric constraints. The structure model is folded based on the predicted geometric constraints. In EmaNet, the 1D and 2D features are extracted from the folded model and sent to the deep residual neural network to estimate the inter-residue distance deviation and per-residue lDDT of the model, which will be fed back to GeomNet as dynamic features to correct the geometries prediction and progressively improve model accuracy. RocketX is tested on 483 benchmark proteins and 20 FM targets of CASP14. Experimental results show that the closed-loop feedback mechanism significantly contributes to the performance of RocketX, and the prediction accuracy of RocketX outperforms that of the state-of-the-art methods trRosetta (without templates) and RaptorX. In addition, the blind test results on CAMEO show that although no template is used, the prediction accuracy of RocketX on medium and hard targets is comparable to the advanced methods that integrate templates.

Download Full-text

Protein structure prediction using sparse dipolar coupling data

Nucleic Acids Research ◽

10.1093/nar/gkh204 ◽

2004 ◽

Vol 32 (2) ◽

pp. 551-561 ◽

Cited By ~ 13

Author(s):

Y. Qu

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Dipolar Coupling

Download Full-text

Building a Better Fragment Library for De Novo Protein Structure Prediction

PLoS ONE ◽

10.1371/journal.pone.0123998 ◽

2015 ◽

Vol 10 (4) ◽

pp. e0123998 ◽

Cited By ~ 13

Author(s):

Saulo H. P. de Oliveira ◽

Jiye Shi ◽

Charlotte M. Deane

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Fragment Library

Download Full-text

Generalized protein structure prediction based on combination of fold-recognition with de novo folding and evaluation of models

Proteins Structure Function and Bioinformatics ◽

10.1002/prot.20723 ◽

2005 ◽

Vol 61 (S7) ◽

pp. 84-90 ◽

Cited By ~ 72

Author(s):

Andrzej Koliński ◽

Janusz M. Bujnicki

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Fold Recognition ◽

De Novo Folding

Download Full-text

De novo protein structure prediction using ultra-fast molecular dynamics simulation

PLoS ONE ◽

10.1371/journal.pone.0205819 ◽

2018 ◽

Vol 13 (11) ◽

pp. e0205819 ◽

Cited By ~ 6

Author(s):

Ngaam J. Cheung ◽

Wookyung Yu

Keyword(s):

Molecular Dynamics ◽

Protein Structure ◽

Molecular Dynamics Simulation ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Dynamics Simulation

Download Full-text

Sampling Bottlenecks in De novo Protein Structure Prediction

Journal of Molecular Biology ◽

10.1016/j.jmb.2009.07.063 ◽

2009 ◽

Vol 393 (1) ◽

pp. 249-260 ◽

Cited By ~ 68

Author(s):

David E. Kim ◽

Ben Blum ◽

Philip Bradley ◽

David Baker

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo

Download Full-text

Protein structure prediction and design in a biologically-realistic implicit membrane

10.1101/630715 ◽

2019 ◽

Author(s):

Rebecca F. Alford ◽

Patrick J. Fleming ◽

Karen G. Fleming ◽

Jeffrey J. Gray

Keyword(s):

Protein Structure ◽

Amino Acid ◽

Membrane Proteins ◽

Membrane Protein ◽

Protein Structure Prediction ◽

Protein Design ◽

Structure Prediction ◽

De Novo ◽

Computational Design ◽

Amino Acid Distribution

ABSTRACTProtein design is a powerful tool for elucidating mechanisms of function and engineering new therapeutics and nanotechnologies. While soluble protein design has advanced, membrane protein design remains challenging due to difficulties in modeling the lipid bilayer. In this work, we developed an implicit approach that captures the anisotropic structure, shape of water-filled pores, and nanoscale dimensions of membranes with different lipid compositions. The model improves performance in computational bench-marks against experimental targets including prediction of protein orientations in the bilayer, ΔΔG calculations, native structure dis-crimination, and native sequence recovery. When applied to de novo protein design, this approach designs sequences with an amino acid distribution near the native amino acid distribution in membrane proteins, overcoming a critical flaw in previous membrane models that were prone to generating leucine-rich designs. Further, the proteins designed in the new membrane model exhibit native-like features including interfacial aromatic side chains, hydrophobic lengths compatible with bilayer thickness, and polar pores. Our method advances high-resolution membrane protein structure prediction and design toward tackling key biological questions and engineering challenges.Significance StatementMembrane proteins participate in many life processes including transport, signaling, and catalysis. They constitute over 30% of all proteins and are targets for over 60% of pharmaceuticals. Computational design tools for membrane proteins will transform the interrogation of basic science questions such as membrane protein thermodynamics and the pipeline for engineering new therapeutics and nanotechnologies. Existing tools are either too expensive to compute or rely on manual design strategies. In this work, we developed a fast and accurate method for membrane protein design. The tool is available to the public and will accelerate the experimental design pipeline for membrane proteins.

Download Full-text