protein contact maps Latest Research Papers

Protein structure prediction using AI and quantum computers

10.1101/2021.05.22.445242 ◽

2021 ◽

Author(s):

Ben Geoffrey A S

Keyword(s):

Machine Learning ◽

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Machine Learning Algorithms ◽

Quantum Computers ◽

Data Set ◽

Backbone Atoms ◽

Protein Contact Maps

This work seeks to combine the combined advantage of leveraging these emerging areas of Artificial Intelligence and quantum computing in applying it to solve the specific biological problem of protein structure prediction using Quantum Machine Learning algorithms. The CASP dataset from ProteinNet was downloaded which is a standardized data set for machine learning of protein structure. Its large and standardized dataset of PDB entries contains the coordinates of the backbone atoms, corresponding to the sequential chain of N, C_alpha, and C' atoms. This dataset was used to train a quantum-classical hybrid Keras deep neural network model to predict the structure of the proteins. To visually qualify the quality of the predicted versus the actual protein structure, protein contact maps were generated with the experimental and predicted protein structure data and qualified. Therefore this model is recommended for the use of protein structure prediction using AI leveraging the power of quantum computers. The code is provided in the following Github repository https://github.com/bengeof/Protein-structure-prediction-using-AI-and-quantum-computers.

Download Full-text

Protein contact map prediction using multiple sequence alignment dropout and consistency learning for sequences with less homologs

10.1101/2021.05.12.443740 ◽

2021 ◽

Author(s):

Xuyang Liu ◽

Lei Jin ◽

Shenghua Gao ◽

Suwen Zhao

Keyword(s):

Sequence Alignment ◽

Multiple Sequence Alignment ◽

Prediction Accuracy ◽

Contact Map ◽

Multiple Sequence ◽

Learning Network ◽

Contact Maps ◽

Protein Contact Maps

The prediction of protein contact map needs enough normalized number of effective sequence (Nf) in multiple sequence alignment (MSA). When Nf is small, the predicted contact maps are often not satisfactory. To solve this problem, we randomly selected a small part of sequence homologs for proteins with large Nf to generate MSAs with small Nf. From these MSAs, input features were generated and were passed through a consistency learning network, aiming to get the same results when using the features generated from the MSA with large Nf. The results showed that this method effectively improves the prediction accuracy of protein contact maps with small Nf.

Download Full-text

Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks

PLoS Computational Biology ◽

10.1371/journal.pcbi.1008865 ◽

2021 ◽

Vol 17 (3) ◽

pp. e1008865

Author(s):

Yang Li ◽

Chengxin Zhang ◽

Eric W. Bell ◽

Wei Zheng ◽

Xiaogen Zhou ◽

...

Keyword(s):

Long Range ◽

3D Structure ◽

High Accuracy ◽

Large Set ◽

Contact Map ◽

Homologous Proteins ◽

Convolutional Networks ◽

Contact Maps ◽

Model Training ◽

Protein Contact Maps

The topology of protein folds can be specified by the inter-residue contact-maps and accurate contact-map prediction can help ab initio structure folding. We developed TripletRes to deduce protein contact-maps from discretized distance profiles by end-to-end training of deep residual neural-networks. Compared to previous approaches, the major advantage of TripletRes is in its ability to learn and directly fuse a triplet of coevolutionary matrices extracted from the whole-genome and metagenome databases and therefore minimize the information loss during the course of contact model training. TripletRes was tested on a large set of 245 non-homologous proteins from CASP 11&12 and CAMEO experiments and outperformed other top methods from CASP12 by at least 58.4% for the CASP 11&12 targets and 44.4% for the CAMEO targets in the top-L long-range contact precision. On the 31 FM targets from the latest CASP13 challenge, TripletRes achieved the highest precision (71.6%) for the top-L/5 long-range contact predictions. It was also shown that a simple re-training of the TripletRes model with more proteins can lead to further improvement with precisions comparable to state-of-the-art methods developed after CASP13. These results demonstrate a novel efficient approach to extend the power of deep convolutional networks for high-accuracy medium- and long-range protein contact-map predictions starting from primary sequences, which are critical for constructing 3D structure of proteins that lack homologous templates in the PDB library.

Download Full-text

ConPlot: web-based application for the visualization of protein contact maps integrated with other data

Bioinformatics ◽

10.1093/bioinformatics/btab049 ◽

2021 ◽

Author(s):

Filomeno Sánchez Rodríguez ◽

Shahram Mesdaghi ◽

Adam J Simpkin ◽

J Javier Burgos-Mármol ◽

David L Murphy ◽

...

Keyword(s):

Empty Space ◽

Structural Features ◽

Supplementary Information ◽

Contact Map ◽

Web Based ◽

Contact Maps ◽

File Formats ◽

Residue Contacts ◽

Contact Data ◽

Protein Contact Maps

Abstract Summary Covariance-based predictions of residue contacts and inter-residue distances are an increasingly popular data type in protein bioinformatics. Here we present ConPlot, a web-based application for convenient display and analysis of contact maps and distograms. Integration of predicted contact data with other predictions is often required to facilitate inference of structural features. ConPlot can therefore use the empty space near the contact map diagonal to display multiple coloured tracks representing other sequence-based predictions. Popular file formats are natively read and bespoke data can also be flexibly displayed. This novel visualization will enable easier interpretation of predicted contact maps. Availability and implementation available online at www.conplot.org, along with documentation and examples. Alternatively, ConPlot can be installed and used locally using the docker image from the project’s Docker Hub repository. ConPlot is licensed under the BSD 3-Clause. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks

10.1101/2020.10.05.326140 ◽

2020 ◽

Cited By ~ 2

Author(s):

Yang Li ◽

Chengxin Zhang ◽

Eric W. Bell ◽

Wei Zheng ◽

Xiaogen Zhou ◽

...

Keyword(s):

Ab Initio ◽

Long Range ◽

Structure Prediction ◽

High Accuracy ◽

Contact Map ◽

Convolutional Networks ◽

Contact Maps ◽

Model Training ◽

Folding Simulations ◽

Protein Contact Maps

AbstractThe topology of protein folds can be specified by the inter-residue contact-maps and accurate contact-map prediction can help ab initio structure folding. We developed TripletRes to deduce protein contact-maps from discretized distance profiles by end-to-end training of deep residual neural-networks. Compared to previous approaches, the major advantage of TripletRes is in its ability to learn and directly fuse a triplet of coevolutionary matrices extracted from the whole-genome and metagenome databases and therefore minimize the information loss during the course of contact model training. TripletRes was tested on a large set of 245 non-homologous proteins from CASP and CAMEO experiments, and outperformed other state-of-the-art methods by at least 58.4% for the CASP 11&12 and 44.4% for the CAMEO targets in the top-L long-range contact precision. On the 31 FM targets from the latest CASP13 challenge, TripletRes achieved the highest precision (71.6%) for the top-L/5 long-range contact predictions. These results demonstrate a novel efficient approach to extend the power of deep convolutional networks for high-accuracy medium- and long-range protein contact-map predictions starting from primary sequences, which are critical for constructing 3D structure of proteins that lack homologous templates in the PDB library.AvailabilityThe training and testing data, standalone package, and the online server for TripletRes are available at https://zhanglab.ccmb.med.umich.edu/TripletRes/.Author SummaryAb initio protein folding has been a major unsolved problem in computational biology for more than half a century. Recent community-wide Critical Assessment of Structure Prediction (CASP) experiments have witnessed exciting progress on ab initio structure prediction, which was mainly powered by the boosting of contact-map prediction as the latter can be used as constraints to guide ab initio folding simulations. In this work, we proposed a new open-source deep-learning architecture, TripletRes, built on the residual convolutional neural networks for high-accuracy contact prediction. The large-scale benchmark and blind test results demonstrate significant advancement of the proposed methods over other approaches in predicting medium- and long-range contact-maps that are critical for guiding protein folding simulations. Detailed data analyses showed that the major advantage of TripletRes lies in the unique protocol to fuse multiple evolutionary feature matrices which are directly extracted from whole-genome and metagenome databases and therefore minimize the information loss during the contact model training.

Download Full-text

CoCoNet: Boosting RNA contact prediction by convolutional neural networks

10.1101/2020.07.30.229484 ◽

2020 ◽

Author(s):

Mehari B Zerihun ◽

Fabrizio Pucci ◽

Alexander Schug

Keyword(s):

Neural Networks ◽

Structure Prediction ◽

Prediction Accuracy ◽

Deep Neural Networks ◽

Cross Validation ◽

Rna Structures ◽

Contact Prediction ◽

Contact Maps ◽

Direct Coupling Analysis ◽

Protein Contact Maps

Physics-based co-evolutionary models such as direct coupling analysis (DCA) in combination with machine learning (ML) techniques based on deep neural networks are able to predict protein contact maps with astonishing accuracy. Such contacts can be used as constraints in structure prediction and massively increase prediction accuracy. Unfortunately, the same ML methods cannot readily be applied to RNA as they rely on large structural datasets only available for proteins but not for RNAs. Here, we demonstrate how the small amount of data available for RNA can be used to significantly improve prediction of RNA contact maps. We introduce an algorithm called CoCoNet that is based on a combination of a Coevolutionary model and a shallow Convolutional Neural Network. Despite its simplicity and the small number of trained parameters, the method boosts the contact prediction accuracy by about 70% with respect to straightforward DCA as tested by cross-validation on a dataset of about sixty RNA structures. Both our extensive robustness tests and the limited number of parameters allow the generalization properties of our model. Finally, applications to other RNAs highlight the power of our approach. CoCoNet is freely available and can be found at https://github.com/KIT-MBS/coconet.

Download Full-text

Patch-DCA: Improved Protein Interface Prediction by utilizing Structural Information and Clustering DCA scores

10.1101/656074 ◽

2019 ◽

Cited By ~ 1

Author(s):

Amir Vajdi ◽

Kourosh Zarringhalam ◽

Nurit Haspel

Keyword(s):

Structural Information ◽

Protein Complexes ◽

Evolutionary Information ◽

Sequence Information ◽

Protein Interface ◽

Residue Contacts ◽

Interface Prediction ◽

Sequential Information ◽

The Individual ◽

Protein Contact Maps

AbstractOver the past decade there have been impressive advances in determining the 3D structures of protein complexes. However, there are still many complexes with unknown structures, even when the structures of the individual proteins are known. The advent of protein sequence information provides an opportunity to leverage evolutionary information to enhance the accuracy of protein-protein interface prediction. To this end, several statistical and machine learning methods have been proposed. In particular, direct coupling analysis has recently emerged as a promising approach for identification of protein contact maps from sequential information. However, the ability of these methods to detect protein-protein inter-residue contacts remains relatively limited.In this work, we propose a method to integrate sequential and co-evolution information with structural and functional information to increase the performance of protein-protein interface prediction. Further, we present a post-processing clustering method that improves the average relative F1 score by 70 % and 24 % and the precision by 80 % and 36 % in comparison with two state-of-the-art methods PSICOV and GREMLIN.

Download Full-text

Tree-based Secondary Structure Interactions Predictor Method for Protein Contact Maps

Computación y Sistemas ◽

10.13053/cys-22-4-2827 ◽

2018 ◽

Vol 22 (4) ◽

Author(s):

Julio César Quintana-Zaez ◽

Reynaldo Molina-Ruiz ◽

Cosme Ernesto Santiesteban-Toca

Keyword(s):

Secondary Structure ◽

Contact Maps ◽

Protein Contact Maps

Download Full-text

Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks

Bioinformatics ◽

10.1093/bioinformatics/bty481 ◽

2018 ◽

Cited By ~ 21

Author(s):

Jack Hanson ◽

Kuldip Paliwal ◽

Thomas Litfin ◽

Yuedong Yang ◽

Yaoqi Zhou

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Short Term Memory ◽

Accurate Prediction ◽

Two Dimensional ◽

Short Term ◽

Term Memory ◽

Contact Maps ◽

Long Short Term Memory ◽

Protein Contact Maps

Download Full-text

Predicting Protein Contact Maps Directly from Primary Sequence without the Need for Homologs

Biophysical Journal ◽

10.1016/j.bpj.2017.11.245 ◽

2018 ◽

Vol 114 (3) ◽

pp. 36a

Author(s):

Thrasyvoulos Karydis ◽

Joseph M. Jacobson

Keyword(s):

Primary Sequence ◽

Contact Maps ◽

Protein Contact Maps

Download Full-text

protein contact maps
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Protein structure prediction using AI and quantum computers

Protein contact map prediction using multiple sequence alignment dropout and consistency learning for sequences with less homologs

Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks

ConPlot: web-based application for the visualization of protein contact maps integrated with other data

Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks

CoCoNet: Boosting RNA contact prediction by convolutional neural networks

Patch-DCA: Improved Protein Interface Prediction by utilizing Structural Information and Clustering DCA scores

Tree-based Secondary Structure Interactions Predictor Method for Protein Contact Maps

Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks

Predicting Protein Contact Maps Directly from Primary Sequence without the Need for Homologs

Export Citation Format

protein contact mapsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Protein structure prediction using AI and quantum computers

Protein contact map prediction using multiple sequence alignment dropout and consistency learning for sequences with less homologs

Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks

ConPlot: web-based application for the visualization of protein contact maps integrated with other data

Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks

CoCoNet: Boosting RNA contact prediction by convolutional neural networks

Patch-DCA: Improved Protein Interface Prediction by utilizing Structural Information and Clustering DCA scores

Tree-based Secondary Structure Interactions Predictor Method for Protein Contact Maps

Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks

Predicting Protein Contact Maps Directly from Primary Sequence without the Need for Homologs

protein contact maps
Recently Published Documents