A single-model quality assessment method for poor quality protein structure

Abstract Quality assessment of protein tertiary structure prediction models, in which structures of the best quality are selected from decoys, is a major challenge in protein structure prediction, and is crucial to determine a model’s utility and potential applications. Estimating the quality of a single model predicts the model’s quality based on the single model itself. In general, the Pearson correlation value of the quality assessment method increases in tandem with an increase in the quality of the model pool. However, there is no consensus regarding the best method to select a few good models from the poor quality model pool. In this work, we introduce a novel single-model quality assessment method for poor quality models that uses simple linear combinations of six features. We perform weighted search and linear regression on a large dataset of models from the 12th Critical Assessment of Protein Structure Prediction (CASP12) and benchmark the results on CASP13 models. We demonstrate that our method achieves outstanding performance on poor quality models.

Download Full-text

A single-model quality assessment method for poor quality protein structure

10.21203/rs.3.rs-17080/v2 ◽

2020 ◽

Author(s):

Jianquan Ouyang ◽

Ningqiao Huang ◽

Yunqi Jiang

Keyword(s):

Protein Structure ◽

Quality Assessment ◽

Structure Prediction ◽

Assessment Method ◽

Poor Quality ◽

Single Model ◽

Model Quality ◽

Model Quality Assessment ◽

Quality Assessment Method

Abstract Background: Quality assessment of protein tertiary structure prediction models, in which structures of the best quality are selected from decoys, is a major challenge in protein structure prediction, and is crucial to determine a model’s utility and potential applications. Estimating the quality of a single model predicts the model’s quality based on the single model itself. In general, the Pearson correlation value of the quality assessment method increases in tandem with an increase in the quality of the model pool. However, there is no consensus regarding the best method to select a few good models from the poor quality model pool.Results: We introduce a novel single-model quality assessment method for poor quality models that uses simple linear combinations of six features. We perform weighted search and linear regression on a large dataset of models from the 12th Critical Assessment of Protein Structure Prediction (CASP12) and benchmark the results on CASP13 models. We demonstrate that our method achieves outstanding performance on poor quality models.Conclusions: According to results of poor protein structure assessment based on six features, contact prediction and relying on fewer prediction features can improve selection accuracy.

Download Full-text

Model Quality Assessment Method Based on Support Vector Machine

Pomiary Automatyka Robotyka ◽

10.14313/par_239/35 ◽

2021 ◽

Vol 25 (1) ◽

pp. 35-39

Author(s):

Łukasz Glodek ◽

Szymon Bysko ◽

Witold Nocoń

Keyword(s):

Support Vector Machine ◽

Quality Assessment ◽

Industry 4.0 ◽

Assessment Method ◽

Support Vector ◽

Model Quality ◽

Digital Twin ◽

Model Quality Assessment ◽

Virtual Commissioning ◽

Quality Assessment Method

This paper proposes a model quality assessment method based on Support Vector Machine, which can be used to develop a digital twin. This work is strongly connected with Industry 4.0, in which the main idea is to integrate machines, devices, systems, and IT. One of the goals of Industry 4.0 is to introduce flexible assortment changes. Virtual commissioning can be used to create a simulation model of a plant or conduct training for maintenance engineers. On a branch of virtual commissioning is a digital twin. The digital twin is a virtual representation of a plant or a device. Thanks to the digital twin, different scenarios can be analyzed to make the testing process less complicated and less time-consuming. The goal of this work is to propose a coefficient that will take into account expert knowledge and methods used for model quality assessment (such as Normalized Root Mean Square Error – NRMSE, Maximum Error – ME). NRMSE and ME methods are commonly used for this purpose, but they have not been used simultaneously so far. Each of them takes into consideration another aspect of a model. The coefficient allows deciding whether the model can be used for digital twin appliances. Such an attitude introduces the ability to test models automatically or in a semi-automatic way.

Download Full-text

2SCP-04 Model quality assessment method based on a residue-residue distance matrix prediction(Prediction and analysis of protein functions from structural bioinformatics,Symposium,The 52th Annual Meeting of the Biophysical Society of Japan(BSJ2014))

Seibutsu Butsuri ◽

10.2142/biophys.54.s132_6 ◽

2014 ◽

Vol 54 (supplement1-2) ◽

pp. S132

Author(s):

Mayuko Takeda-Shitaka

Keyword(s):

Quality Assessment ◽

Annual Meeting ◽

Distance Matrix ◽

Assessment Method ◽

Structural Bioinformatics ◽

Model Quality ◽

Model Quality Assessment ◽

Biophysical Society ◽

Protein Functions ◽

Quality Assessment Method

Download Full-text

QAcon: single model quality assessment using protein structural and contact information with machine learning techniques

Bioinformatics ◽

10.1093/bioinformatics/btw694 ◽

2016 ◽

pp. btw694 ◽

Cited By ~ 12

Author(s):

Renzhi Cao ◽

Badri Adhikari ◽

Debswapna Bhattacharya ◽

Miao Sun ◽

Jie Hou ◽

...

Keyword(s):

Machine Learning ◽

Quality Assessment ◽

Machine Learning Techniques ◽

Single Model ◽

Model Quality ◽

Model Quality Assessment ◽

Contact Information ◽

Learning Techniques

Download Full-text

PconsD: ultra rapid, accurate model quality assessment for protein structure prediction

Bioinformatics ◽

10.1093/bioinformatics/btt272 ◽

2013 ◽

Vol 29 (14) ◽

pp. 1817-1818 ◽

Cited By ~ 26

Author(s):

Marcin J. Skwark ◽

Arne Elofsson

Keyword(s):

Protein Structure ◽

Quality Assessment ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Model Quality ◽

Accurate Model ◽

Model Quality Assessment

Download Full-text

Protein model quality assessment using 3D oriented convolutional neural networks

Bioinformatics ◽

10.1093/bioinformatics/btz122 ◽

2019 ◽

Vol 35 (18) ◽

pp. 3313-3319 ◽

Cited By ~ 14

Author(s):

Guillaume Pagès ◽

Benoit Charmettant ◽

Sergei Grudinin

Keyword(s):

Neural Networks ◽

Quality Assessment ◽

Convolutional Neural Networks ◽

Single Model ◽

Model Quality ◽

Model Quality Assessment ◽

Density Maps ◽

Protein Model ◽

Protein Model Quality Assessment ◽

3D Cnn

Abstract Motivation Protein model quality assessment (QA) is a crucial and yet open problem in structural bioinformatics. The current best methods for single-model QA typically combine results from different approaches, each based on different input features constructed by experts in the field. Then, the prediction model is trained using a machine-learning algorithm. Recently, with the development of convolutional neural networks (CNN), the training paradigm has changed. In computer vision, the expert-developed features have been significantly overpassed by automatically trained convolutional filters. This motivated us to apply a three-dimensional (3D) CNN to the problem of protein model QA. Results We developed Ornate (Oriented Routed Neural network with Automatic Typing)—a novel method for single-model QA. Ornate is a residue-wise scoring function that takes as input 3D density maps. It predicts the local (residue-wise) and the global model quality through a deep 3D CNN. Specifically, Ornate aligns the input density map, corresponding to each residue and its neighborhood, with the backbone topology of this residue. This circumvents the problem of ambiguous orientations of the initial models. Also, Ornate includes automatic identification of atom types and dynamic routing of the data in the network. Established benchmarks (CASP 11 and CASP 12) demonstrate the state-of-the-art performance of our approach among single-model QA methods. Availability and implementation The method is available at https://team.inria.fr/nano-d/software/Ornate/. It consists of a C++ executable that transforms molecular structures into volumetric density maps, and a Python code based on the TensorFlow framework for applying the Ornate model to these maps. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

SVMQA: support–vector-machine-based protein single-model quality assessment

Bioinformatics ◽

10.1093/bioinformatics/btx222 ◽

2017 ◽

Vol 33 (16) ◽

pp. 2496-2503 ◽

Cited By ~ 102

Author(s):

Balachandran Manavalan ◽

Jooyoung Lee

Keyword(s):

Support Vector Machine ◽

Quality Assessment ◽

Support Vector ◽

Single Model ◽

Model Quality ◽

Model Quality Assessment

Download Full-text

P3CMQA: Single-Model Quality Assessment Using 3DCNN with Profile-Based Features

Bioengineering ◽

10.3390/bioengineering8030040 ◽

2021 ◽

Vol 8 (3) ◽

pp. 40

Author(s):

Yuma Takei ◽

Takashi Ishida

Keyword(s):

Quality Assessment ◽

Structure Prediction ◽

Tertiary Structure ◽

Protein Structures ◽

Three Dimensional ◽

Sequence Profile ◽

Single Model ◽

Model Quality ◽

Model Quality Assessment ◽

Assessment Performance

Model quality assessment (MQA), which selects near-native structures from structure models, is an important process in protein tertiary structure prediction. The three-dimensional convolution neural network (3DCNN) was applied to the task, but the performance was comparable to existing methods because it used only atom-type features as the input. Thus, we added sequence profile-based features, which are also used in other methods, to improve the performance. We developed a single-model MQA method for protein structures based on 3DCNN using sequence profile-based features, namely, P3CMQA. Performance evaluation using a CASP13 dataset showed that profile-based features improved the assessment performance, and the proposed method was better than currently available single-model MQA methods, including the previous 3DCNN-based method. We also implemented a web-interface of the method to make it more user-friendly.

Download Full-text

De novo protein structure prediction by incremental inter-residue geometries prediction and model quality assessment using deep learning

10.1101/2022.01.11.475831 ◽

2022 ◽

Author(s):

Jun Liu ◽

Guangxing He ◽

Kailong Zhao ◽

Guijun Zhang

Keyword(s):

Protein Structure ◽

Quality Assessment ◽

Protein Structure Prediction ◽

Structure Prediction ◽

De Novo ◽

Feedback Mechanism ◽

Geometric Constraints ◽

Model Quality ◽

Model Quality Assessment ◽

Loop Feedback

Motivation: The successful application of deep learning has promoted progress in protein model quality assessment. How to use model quality assessment to further improve the accuracy of protein structure prediction, especially not reliant on the existing templates, is helpful for unraveling the folding mechanism. Here, we investigate whether model quality assessment can be introduced into structure prediction to form a closed-loop feedback, and iteratively improve the accuracy of de novo protein structure prediction. Results: In this study, we propose a de novo protein structure prediction method called RocketX. In RocketX, a feedback mechanism is constructed through the geometric constraint prediction network GeomNet, the structural simulation module, and the model quality evaluation network EmaNet. In GeomNet, the co-evolutionary features extracted from MSA that search from the sequence databases are sent to an improved residual neural network to predict the inter-residue geometric constraints. The structure model is folded based on the predicted geometric constraints. In EmaNet, the 1D and 2D features are extracted from the folded model and sent to the deep residual neural network to estimate the inter-residue distance deviation and per-residue lDDT of the model, which will be fed back to GeomNet as dynamic features to correct the geometries prediction and progressively improve model accuracy. RocketX is tested on 483 benchmark proteins and 20 FM targets of CASP14. Experimental results show that the closed-loop feedback mechanism significantly contributes to the performance of RocketX, and the prediction accuracy of RocketX outperforms that of the state-of-the-art methods trRosetta (without templates) and RaptorX. In addition, the blind test results on CAMEO show that although no template is used, the prediction accuracy of RocketX on medium and hard targets is comparable to the advanced methods that integrate templates.

Download Full-text