A de novo protein structure prediction by iterative partition sampling, topology adjustment, and residue-level distance deviation optimization

Bioinformatics ◽

10.1093/bioinformatics/btab620 ◽

2021 ◽

Author(s):

Jun Liu ◽

Kai-Long Zhao ◽

Guang-Xing He ◽

Liu-Jing Wang ◽

Xiao-Gen Zhou ◽

...

Keyword(s):

Protein Structure ◽

Dihedral Angle ◽

Structure Prediction ◽

Prediction Accuracy ◽

Closed Loop ◽

De Novo ◽

Residue Level ◽

Fragment Assembly ◽

Assembly Method ◽

Distance Deviation

Abstract Motivation With the great progress of deep learning-based inter-residue contact/distance prediction, the discrete space formed by fragment assembly cannot satisfy the distance constraint well. Thus, the optimal solution of the continuous space may not be achieved. Designing an effective closed-loop continuous dihedral angle optimization strategy that complements the discrete fragment assembly is crucial to improve the performance of the distance-assisted fragment assembly method. Results In this article, we proposed a de novo protein structure prediction method called IPTDFold based on closed-loop iterative partition sampling, topology adjustment and residue-level distance deviation optimization. First, local dihedral angle crossover and mutation operators are designed to explore the conformational space extensively and achieve information exchange between the conformations in the population. Then, the dihedral angle rotation model of loop region with partial inter-residue distance constraints is constructed, and the rotation angle satisfying the constraints is obtained by differential evolution algorithm, so as to adjust the spatial position relationship between the secondary structures. Lastly, the residue distance deviation is evaluated according to the difference between the conformation and the predicted distance, and the dihedral angle of the residue is optimized with biased probability. The final model is generated by iterating the above three steps. IPTDFold is tested on 462 benchmark proteins, 24 FM targets of CASP13, and 20 FM targets of CASP14. Results show that IPTDFold is significantly superior to the distance-assisted fragment assembly method Rosetta_D (Rosetta with distance). In particular, the prediction accuracy of IPTDFold does not decrease as the length of the protein increases. When using the same FastRelax protocol, the prediction accuracy of IPTDFold is significantly superior to that of trRosetta without orientation constraints, and is equivalent to that of the full version of trRosetta. Availability The source code and executable are freely available at https://github.com/iobio-zjut/IPTDFold. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

A de novo protein structure prediction by iterative partition sampling, topology adjustment, and residue-level distance deviation optimization

10.1101/2021.05.12.443769 ◽

2021 ◽

Author(s):

Jun Liu ◽

Kailong Zhao ◽

Guangxing He ◽

Liujing Wang ◽

Xiaogen Zhou ◽

...

Keyword(s):

Protein Structure ◽

Dihedral Angle ◽

Structure Prediction ◽

Prediction Accuracy ◽

Closed Loop ◽

De Novo ◽

Residue Level ◽

Fragment Assembly ◽

Assembly Method ◽

Distance Deviation

Motivation: With the great progress of deep learning-based inter-residue contact/distance prediction, the discrete space formed by fragment assembly cannot satisfy the distance constraint well. Thus, the optimal solution of the continuous space may not be achieved. Designing an effective closed-loop continuous dihedral angle optimization strategy that complements the discrete fragment assembly is crucial to improve the performance of the distance-assisted fragment assembly method. Results: In this article, we proposed a de novo protein structure prediction method called IPTDFold based on closed-loop iterative partition sampling, topology adjustment and residue-level distance deviation optimization. First, local dihedral angle crossover and mutation operators are designed to explore the conformational space extensively and achieve information exchange between the conformations in the population. Then, the dihedral angle rotation model of loop region with partial inter-residue distance constraints is constructed, and the rotation angle satisfying the constraints is obtained by differential evolution algorithm, so as to adjust the spatial position relationship between the secondary structures. Lastly, the residue distance deviation is evaluated according to the difference between the conformation and the predicted distance, and the dihedral angle of the residue is optimized with biased probability. The final model is generated by iterating the above three steps. IPTDFold is tested on 462 benchmark proteins, 24 FM targets of CASP13, and 20 FM targets of CASP14. Results show that IPTDFold is significantly superior to the distance-assisted fragment assembly method Rosetta_D (Rosetta with distance). In particular, the prediction accuracy of IPTDFold does not decrease as the length of the protein increases. When using the same FastRelax protocol, the prediction accuracy of IPTDFold is significantly superior to that of trRosetta without orientation constraints, and is equivalent to that of the full version of trRosetta.

Download Full-text

A reversible fragment assembly method for de novo protein structure prediction

The Journal of Chemical Physics ◽

10.1063/1.1597474 ◽

2003 ◽

Vol 119 (13) ◽

pp. 6895-6903 ◽

Cited By ~ 39

Author(s):

George Chikenji ◽

Yoshimi Fujitsuka ◽

Shoji Takada

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Fragment Assembly ◽

Assembly Method

Download Full-text

The dual role of fragments in fragment-assembly methods for de novo protein structure prediction

Proteins Structure Function and Bioinformatics ◽

10.1002/prot.23215 ◽

2011 ◽

Vol 80 (2) ◽

pp. 490-504 ◽

Cited By ~ 32

Author(s):

Julia Handl ◽

Joshua Knowles ◽

Robert Vernon ◽

David Baker ◽

Simon C. Lovell

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Dual Role ◽

Fragment Assembly

Download Full-text

De novo protein structure prediction by dynamic fragment assembly and conformational space annealing

Proteins Structure Function and Bioinformatics ◽

10.1002/prot.23059 ◽

2011 ◽

Vol 79 (8) ◽

pp. 2403-2417 ◽

Cited By ~ 42

Author(s):

Juyong Lee ◽

Jinhyuk Lee ◽

Takeshi N. Sasaki ◽

Masaki Sasai ◽

Chaok Seok ◽

...

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Conformational Space ◽

Fragment Assembly

Download Full-text

Sequence Specific Dihedral Angle Distribution: Application in Protein Structure Prediction and Evaluation

Plant Tissue Culture and Biotechnology ◽

10.3329/ptcb.v19i2.5439 ◽

1970 ◽

Vol 19 (2) ◽

pp. 217-226

Author(s):

S. M. Minhaz Ud-Dean ◽

Mahdi Muhammad Moosa

Keyword(s):

Protein Structure ◽

Dihedral Angle ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Protein Structures ◽

Angle Distribution ◽

Ramachandran Plot ◽

Specific Data ◽

Specific Distribution ◽

Structure Evaluation

Protein structure prediction and evaluation is one of the major fields of computational biology. Estimation of dihedral angle can provide information about the acceptability of both theoretically predicted and experimentally determined structures. Here we report on the sequence specific dihedral angle distribution of high resolution protein structures available in PDB and have developed Sasichandran, a tool for sequence specific dihedral angle prediction and structure evaluation. This tool will allow evaluation of a protein structure in pdb format from the sequence specific distribution of Ramachandran angles. Additionally, it will allow retrieval of the most probable Ramachandran angles for a given sequence along with the sequence specific data. Key words: Torsion angle, φ-ψ distribution, sequence specific ramachandran plot, Ramasekharan, protein structure appraisal D.O.I. 10.3329/ptcb.v19i2.5439 Plant Tissue Cult. & Biotech. 19(2): 217-226, 2009 (December)

Download Full-text

Building a Better Fragment Library for De Novo Protein Structure Prediction

PLoS ONE ◽

10.1371/journal.pone.0123998 ◽

2015 ◽

Vol 10 (4) ◽

pp. e0123998 ◽

Cited By ~ 13

Author(s):

Saulo H. P. de Oliveira ◽

Jiye Shi ◽

Charlotte M. Deane

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Fragment Library

Download Full-text

Generalized protein structure prediction based on combination of fold-recognition with de novo folding and evaluation of models

Proteins Structure Function and Bioinformatics ◽

10.1002/prot.20723 ◽

2005 ◽

Vol 61 (S7) ◽

pp. 84-90 ◽

Cited By ~ 72

Author(s):

Andrzej Koliński ◽

Janusz M. Bujnicki

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Fold Recognition ◽

De Novo Folding

Download Full-text

De novo protein structure prediction using ultra-fast molecular dynamics simulation

PLoS ONE ◽

10.1371/journal.pone.0205819 ◽

2018 ◽

Vol 13 (11) ◽

pp. e0205819 ◽

Cited By ~ 6

Author(s):

Ngaam J. Cheung ◽

Wookyung Yu

Keyword(s):

Molecular Dynamics ◽

Protein Structure ◽

Molecular Dynamics Simulation ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Dynamics Simulation

Download Full-text

Sampling Bottlenecks in De novo Protein Structure Prediction

Journal of Molecular Biology ◽

10.1016/j.jmb.2009.07.063 ◽

2009 ◽

Vol 393 (1) ◽

pp. 249-260 ◽

Cited By ~ 68

Author(s):

David E. Kim ◽

Ben Blum ◽

Philip Bradley ◽

David Baker

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo

Download Full-text

Protein structure prediction and design in a biologically-realistic implicit membrane

10.1101/630715 ◽

2019 ◽

Author(s):

Rebecca F. Alford ◽

Patrick J. Fleming ◽

Karen G. Fleming ◽

Jeffrey J. Gray

Keyword(s):

Protein Structure ◽

Amino Acid ◽

Membrane Proteins ◽

Membrane Protein ◽

Protein Structure Prediction ◽

Protein Design ◽

Structure Prediction ◽

De Novo ◽

Computational Design ◽

Amino Acid Distribution

ABSTRACTProtein design is a powerful tool for elucidating mechanisms of function and engineering new therapeutics and nanotechnologies. While soluble protein design has advanced, membrane protein design remains challenging due to difficulties in modeling the lipid bilayer. In this work, we developed an implicit approach that captures the anisotropic structure, shape of water-filled pores, and nanoscale dimensions of membranes with different lipid compositions. The model improves performance in computational bench-marks against experimental targets including prediction of protein orientations in the bilayer, ΔΔG calculations, native structure dis-crimination, and native sequence recovery. When applied to de novo protein design, this approach designs sequences with an amino acid distribution near the native amino acid distribution in membrane proteins, overcoming a critical flaw in previous membrane models that were prone to generating leucine-rich designs. Further, the proteins designed in the new membrane model exhibit native-like features including interfacial aromatic side chains, hydrophobic lengths compatible with bilayer thickness, and polar pores. Our method advances high-resolution membrane protein structure prediction and design toward tackling key biological questions and engineering challenges.Significance StatementMembrane proteins participate in many life processes including transport, signaling, and catalysis. They constitute over 30% of all proteins and are targets for over 60% of pharmaceuticals. Computational design tools for membrane proteins will transform the interrogation of basic science questions such as membrane protein thermodynamics and the pipeline for engineering new therapeutics and nanotechnologies. Existing tools are either too expensive to compute or rely on manual design strategies. In this work, we developed a fast and accurate method for membrane protein design. The tool is available to the public and will accelerate the experimental design pipeline for membrane proteins.

Download Full-text