scholarly journals A Sequential Niche Multimodal Conformation Sampling Algorithm for Protein Structure Prediction

2020 ◽  
Author(s):  
Yu-Hao Xia ◽  
Chun-Xiang Peng ◽  
Xiao-Gen Zhou ◽  
Gui-Jun Zhang

AbstractMotivationMassive local minima on the protein energy surface often causes traditional conformation sampling algorithms to be easily trapped in local basin regions, because they are difficult to stride over high-energy barriers. Also, the lowest energy conformation may not correspond to the native structure due to the inaccuracy of energy models. This study investigates whether these two problems can be alleviated by a sequential niche technique without loss of accuracy.ResultsA sequential niche multimodal conformation sampling algorithm for protein structure prediction (SNfold) is proposed in this study. In SNfold, a derating function is designed based on the knowledge learned from the previous sampling and used to construct a series of sampling-guided energy functions. These functions then help the sampling algorithm stride over high-energy barriers and avoid the re-sampling of the explored regions. In inaccurate protein energy models, the high- energy conformation that may correspond to the native structure can be sampled with successively updated sampling-guided energy functions. The proposed SNfold is tested on 300 benchmark proteins and 24 CASP13 FM targets. Results show that SNfold is comparable with Rosetta restrained by distance (Rosetta-dist) and C-QUARK. SNfold correctly folds (TM-score ≥ 0.5) 231 out of 300 proteins. In particular, compared with Rosetta-dist protocol, SNfold achieves higher average TM- score and improves the sampling efficiency by more than 100 times. On the 24 CASP13 FM targets, SNfold is also comparable with four state-of-the-art methods in the CASP13 server group. As a plugin conformation sampling algorithm, SNfold can be extended to other protein structure prediction methods.AvailabilityThe source code and executable versions are freely available at https://github.com/iobio-zjut/[email protected]

2014 ◽  
Vol 2014 ◽  
pp. 1-9 ◽  
Author(s):  
Swakkhar Shatabda ◽  
M. A. Hakim Newton ◽  
Mahmood A. Rashid ◽  
Duc Nghia Pham ◽  
Abdul Sattar

Protein structure prediction (PSP) has been one of the most challenging problems in computational biology for several decades. The challenge is largely due to the complexity of the all-atomic details and the unknown nature of the energy function. Researchers have therefore used simplified energy models that consider interaction potentials only between the amino acid monomers in contact on discrete lattices. The restricted nature of the lattices and the energy models poses a twofold concern regarding the assessment of the models. Can a native or a very close structure be obtained when structures are mapped to lattices? Can the contact based energy models on discrete lattices guide the search towards the native structures? In this paper, we use the protein chain lattice fitting (PCLF) problem to address the first concern; we developed a constraint-based local search algorithm for the PCLF problem for cubic and face-centered cubic lattices and found very close lattice fits for the native structures. For the second concern, we use a number of techniques to sample the conformation space and find correlations between energy functions and root mean square deviation (RMSD) distance of the lattice-based structures with the native structures. Our analysis reveals weakness of several contact based energy models used that are popular in PSP.


2003 ◽  
Vol 53 (1) ◽  
pp. 76-87 ◽  
Author(s):  
Jerry Tsai ◽  
Richard Bonneau ◽  
Alexandre V. Morozov ◽  
Brian Kuhlman ◽  
Carol A. Rohl ◽  
...  

2014 ◽  
Vol 2014 ◽  
pp. 1-17
Author(s):  
Mahmood A. Rashid ◽  
Swakkhar Shatabda ◽  
M. A. Hakim Newton ◽  
Md Tamjidul Hoque ◽  
Abdul Sattar

Protein structure prediction is computationally a very challenging problem. A large number of existing search algorithms attempt to solve the problem by exploring possible structures and finding the one with the minimum free energy. However, these algorithms perform poorly on large sized proteins due to an astronomically wide search space. In this paper, we present a multipoint spiral search framework that uses parallel processing techniques to expedite exploration by starting from different points. In our approach, a set of random initial solutions are generated and distributed to different threads. We allow each thread to run for a predefined period of time. The improved solutions are stored threadwise. When the threads finish, the solutions are merged together and the duplicates are removed. A selected distinct set of solutions are then split to different threads again. In our ab initio protein structure prediction method, we use the three-dimensional face-centred-cubic lattice for structure-backbone mapping. We use both the low resolution hydrophobic-polar energy model and the high-resolution 20×20 energy model for search guiding. The experimental results show that our new parallel framework significantly improves the results obtained by the state-of-the-art single-point search approaches for both energy models on three-dimensional face-centred-cubic lattice. We also experimentally show the effectiveness of mixing energy models within parallel threads.


2010 ◽  
Vol 18 (2) ◽  
pp. 255-275 ◽  
Author(s):  
Milan Mijajlovic ◽  
Mark J. Biggs ◽  
Dusan P. Djurdjevic

Ab initio protein structure prediction involves determination of the three-dimensional (3D) conformation of proteins on the basis of their amino acid sequence, a potential energy (PE) model that captures the physics of the interatomic interactions, and a method to search for and identify the global minimum in the PE (or free energy) surface such as an evolutionary algorithm (EA). Many PE models have been proposed over the past three decades and more. There is currently no understanding of how the behavior of an EA is affected by the PE model used. The study reported here shows that the EA behavior can be profoundly affected: the EA performance obtained when using the ECEPP PE model is significantly worse than that obtained when using the Amber, OPLS, and CVFF PE models, and the optimal EA control parameter values for the ECEPP model also differ significantly from those associated with the other models.


Sign in / Sign up

Export Citation Format

Share Document