How Good Are Simplified Models for Protein Structure Prediction?

Protein structure prediction (PSP) has been one of the most challenging problems in computational biology for several decades. The challenge is largely due to the complexity of the all-atomic details and the unknown nature of the energy function. Researchers have therefore used simplified energy models that consider interaction potentials only between the amino acid monomers in contact on discrete lattices. The restricted nature of the lattices and the energy models poses a twofold concern regarding the assessment of the models. Can a native or a very close structure be obtained when structures are mapped to lattices? Can the contact based energy models on discrete lattices guide the search towards the native structures? In this paper, we use the protein chain lattice fitting (PCLF) problem to address the first concern; we developed a constraint-based local search algorithm for the PCLF problem for cubic and face-centered cubic lattices and found very close lattice fits for the native structures. For the second concern, we use a number of techniques to sample the conformation space and find correlations between energy functions and root mean square deviation (RMSD) distance of the lattice-based structures with the native structures. Our analysis reveals weakness of several contact based energy models used that are popular in PSP.

Download Full-text

A Hybrid Evolutionary Algorithm for Protein Structure Prediction Using the Face-Centered Cubic Lattice Model

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-70087-8_65 ◽

2017 ◽

pp. 628-638 ◽

Cited By ~ 2

Author(s):

Daniel Varela ◽

José Santos

Keyword(s):

Protein Structure ◽

Evolutionary Algorithm ◽

Protein Structure Prediction ◽

Lattice Model ◽

Structure Prediction ◽

Face Centered Cubic ◽

Cubic Lattice ◽

Hybrid Evolutionary Algorithm ◽

The Face ◽

Face Centered

Download Full-text

CONSTRAINT-BASED HYDROPHOBIC CORE CONSTRUCTION FOR PROTEIN STRUCTURE PREDICTION IN THE FACE-CENTERED-CUBIC LATTICE

Biocomputing 2002 ◽

10.1142/9789812799623_0061 ◽

2001 ◽

Author(s):

SEBASTIAN WILL

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Face Centered Cubic ◽

Hydrophobic Core ◽

Cubic Lattice ◽

The Face ◽

Face Centered

Download Full-text

A Sequential Niche Multimodal Conformation Sampling Algorithm for Protein Structure Prediction

10.1101/2020.12.29.424663 ◽

2020 ◽

Author(s):

Yu-Hao Xia ◽

Chun-Xiang Peng ◽

Xiao-Gen Zhou ◽

Gui-Jun Zhang

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

High Energy ◽

Native Structure ◽

Energy Barriers ◽

Energy Functions ◽

Sampling Algorithm ◽

Energy Models ◽

Protein Energy

AbstractMotivationMassive local minima on the protein energy surface often causes traditional conformation sampling algorithms to be easily trapped in local basin regions, because they are difficult to stride over high-energy barriers. Also, the lowest energy conformation may not correspond to the native structure due to the inaccuracy of energy models. This study investigates whether these two problems can be alleviated by a sequential niche technique without loss of accuracy.ResultsA sequential niche multimodal conformation sampling algorithm for protein structure prediction (SNfold) is proposed in this study. In SNfold, a derating function is designed based on the knowledge learned from the previous sampling and used to construct a series of sampling-guided energy functions. These functions then help the sampling algorithm stride over high-energy barriers and avoid the re-sampling of the explored regions. In inaccurate protein energy models, the high- energy conformation that may correspond to the native structure can be sampled with successively updated sampling-guided energy functions. The proposed SNfold is tested on 300 benchmark proteins and 24 CASP13 FM targets. Results show that SNfold is comparable with Rosetta restrained by distance (Rosetta-dist) and C-QUARK. SNfold correctly folds (TM-score ≥ 0.5) 231 out of 300 proteins. In particular, compared with Rosetta-dist protocol, SNfold achieves higher average TM- score and improves the sampling efficiency by more than 100 times. On the 24 CASP13 FM targets, SNfold is also comparable with four state-of-the-art methods in the CASP13 server group. As a plugin conformation sampling algorithm, SNfold can be extended to other protein structure prediction methods.AvailabilityThe source code and executable versions are freely available at https://github.com/iobio-zjut/[email protected]

Download Full-text

Effective energy functions for protein structure prediction

Current Opinion in Structural Biology ◽

10.1016/s0959-440x(00)00063-4 ◽

2000 ◽

Vol 10 (2) ◽

pp. 139-145 ◽

Cited By ~ 290

Author(s):

T Lazaridis

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Effective Energy ◽

Energy Functions ◽

Effective Energy Functions

Download Full-text

Protein structure prediction with local adjust tabu search algorithm

BMC Bioinformatics ◽

10.1186/1471-2105-15-s15-s1 ◽

2014 ◽

Vol 15 (Suppl 15) ◽

pp. S1 ◽

Cited By ~ 16

Author(s):

Xiaoli Lin ◽

Xiaolong Zhang ◽

Fengli zhou

Keyword(s):

Protein Structure ◽

Tabu Search ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Search Algorithm ◽

Tabu Search Algorithm

Download Full-text

Comparing alternative energy functions for the HP model of protein structure prediction

2011 IEEE Congress of Evolutionary Computation (CEC) ◽

10.1109/cec.2011.5949902 ◽

2011 ◽

Cited By ~ 4

Author(s):

Mario Garza-Fabre ◽

Eduardo Rodriguez-Tello ◽

Gregorio Toscano-Pulido

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Alternative Energy ◽

Energy Functions ◽

Hp Model

Download Full-text

An Improved Harmony Search Algorithm for Protein Structure Prediction Using 3D Off-Lattice Model

Advances in Intelligent Systems and Computing - Harmony Search Algorithm ◽

10.1007/978-981-10-3728-3_30 ◽

2017 ◽

pp. 304-314 ◽

Cited By ~ 5

Author(s):

Nanda Dulal Jana ◽

Jaya Sil ◽

Swagatam Das

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Lattice Model ◽

Structure Prediction ◽

Search Algorithm ◽

Harmony Search ◽

Harmony Search Algorithm ◽

Improved Harmony Search ◽

Improved Harmony Search Algorithm

Download Full-text

A constraint solver for discrete lattices, its parallelization, and application to protein structure prediction

Software Practice and Experience ◽

10.1002/spe.810 ◽

2007 ◽

Vol 37 (13) ◽

pp. 1405-1449 ◽

Cited By ~ 14

Author(s):

Alessandro Dal Palù ◽

Agostino Dovier ◽

Enrico Pontelli

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Constraint Solver ◽

Discrete Lattices

Download Full-text

A Novel Framework for Ab Initio Coarse Protein Structure Prediction

Advances in Bioinformatics ◽

10.1155/2018/7607384 ◽

2018 ◽

Vol 2018 ◽

pp. 1-17 ◽

Cited By ~ 2

Author(s):

Sandhya Parasnath Dubey ◽

S. Balaji ◽

N. Gopalakrishna Kini ◽

M. Sathish Kumar

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Search Algorithm ◽

Search Space ◽

Population Based ◽

Hill Climbing ◽

Superior Performance ◽

Protein Database ◽

Fitness Value

Hydrophobic-Polar model is a simplified representation of Protein Structure Prediction (PSP) problem. However, even with the HP model, the PSP problem remains NP-complete. This work proposes a systematic and problem specific design for operators of the evolutionary program which hybrids with local search hill climbing, to efficiently explore the search space of PSP and thereby obtain an optimum conformation. The proposed algorithm achieves this by incorporating the following novel features: (i) new initialization method which generates only valid individuals with (rather than random) better fitness values; (ii) use of probability-based selection operators that limit the local convergence; (iii) use of secondary structure based mutation operator that makes the structure more closely to the laboratory determined structure; and (iv) incorporating all the above-mentioned features developed a complete two-tier framework. The developed framework builds the protein conformation on the square and triangular lattice. The test has been performed using benchmark sequences, and a comparative evaluation is done with various state-of-the-art algorithms. Moreover, in addition to hypothetical test sequences, we have tested protein sequences deposited in protein database repository. It has been observed that the proposed framework has shown superior performance regarding accuracy (fitness value) and speed (number of generations needed to attain the final conformation). The concepts used to enhance the performance are generic and can be used with any other population-based search algorithm such as genetic algorithm, ant colony optimization, and immune algorithm.

Download Full-text