Applications of AlphaFold beyond Protein Structure Prediction

Solving the half-century-old protein structure prediction problem by DeepMind's AlphaFold is certainly one of the greatest breakthroughs in biology in the twenty-first century. This breakthrough paved the way for tackling some previously highly challenging or even infeasible problems in structural biology. In this study, we propose strategies to use AlphaFold to address several fundamental problems: (1) protein engineering by predicting the experimentally measured stability changes using the representations extracted from AlphaFold models; (2) estimating the designability of a given protein structure by combining a protein design method (e.g. ProDCoNN), sequential Monte Carlo, and AlphaFold. The designability of a protein structure is defined as the number of sequences that encode that protein structure.; (3) predicting protein stabilities using natural sequences and designed sequences as training data, and representations extracted from AlphaFold models as input features; and (4) understanding the sequence-structure relationship of proteins by computational mutagenesis and testing the foldability of the mutants by AlphaFold. We found the representations extracted from AlphaFold models can be used to predict the experimentally measured stability changes accurately. For the first time, we have estimated the designability for a few real proteins. For example, the designability of chain A of FLT3 ligand (PDB ID: 1ETE) with 134 residues was estimated as 3.12 ± 2.14E85.

Download Full-text

Improved protein structure prediction by deep learning irrespective of co-evolution information

10.1101/2020.10.12.336859 ◽

2020 ◽

Cited By ~ 1

Author(s):

Jinbo Xu ◽

Matthew Mcpartlon ◽

Jin Li

Keyword(s):

Deep Learning ◽

Protein Structure ◽

Protein Structure Prediction ◽

Protein Design ◽

Structure Prediction ◽

Model Building ◽

Evolutionary Information ◽

Designed Proteins ◽

Structure Relationship ◽

Over The Top

We describe our latest study of the deep convolutional residual neural networks (ResNet) for protein structure prediction, including deeper and wider ResNets, the efficacy of different input features, and improved 3D model building methods. Our ResNet can predict correct folds (TMscore>0.5) for 26 out of 32 CASP13 FM (template-free-modeling) targets and L/5 long-range contacts for these targets with precision over 80%, a significant improvement over the CASP13 results. Although co-evolution analysis plays an important role in the most successful structure prediction methods, we show that when co-evolution is not used, our ResNet can still predict correct folds for 18 of the 32 CASP13 FM targets including several large ones. This marks a significant improvement over the top co-evolution-based, non-deep learning methods at CASP13, and other non-coevolution-based deep learning models, such as the popular recurrent geometric network (RGN). With only primary sequence, our ResNet can also predict correct folds for all 21 human-designed proteins we tested. In contrast, RGN predicts correct folds for only 3 human-designed proteins and zero CASP13 FM target. In addition, we find that ResNet may fare better for the human-designed proteins when trained without co-evolution information than with co-evolution. These results suggest that ResNet does not simply denoise co-evolution signals, but instead is able to learn important sequence-structure relationship from experimental structures. This has important implications on protein design and engineering especially when evolutionary information is not available.

Download Full-text

Deep learning techniques have significantly impacted protein structure prediction and protein design

Current Opinion in Structural Biology ◽

10.1016/j.sbi.2021.01.007 ◽

2021 ◽

Vol 68 ◽

pp. 194-207

Author(s):

Robin Pearce ◽

Yang Zhang

Keyword(s):

Deep Learning ◽

Protein Structure ◽

Protein Structure Prediction ◽

Protein Design ◽

Structure Prediction ◽

Learning Techniques

Download Full-text

A Multi-objective Approach to the Protein Structure Prediction Problem using the Biased Random-Key Genetic Algorithm

2021 IEEE Congress on Evolutionary Computation (CEC) ◽

10.1109/cec45853.2021.9504745 ◽

2021 ◽

Author(s):

Felipe Marchi ◽

Rafael Stubs Parpinelli

Keyword(s):

Genetic Algorithm ◽

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Prediction Problem ◽

Multi Objective ◽

Protein Structure Prediction Problem

Download Full-text

Protein structure prediction and design in a biologically-realistic implicit membrane

10.1101/630715 ◽

2019 ◽

Author(s):

Rebecca F. Alford ◽

Patrick J. Fleming ◽

Karen G. Fleming ◽

Jeffrey J. Gray

Keyword(s):

Protein Structure ◽

Amino Acid ◽

Membrane Proteins ◽

Membrane Protein ◽

Protein Structure Prediction ◽

Protein Design ◽

Structure Prediction ◽

De Novo ◽

Computational Design ◽

Amino Acid Distribution

ABSTRACTProtein design is a powerful tool for elucidating mechanisms of function and engineering new therapeutics and nanotechnologies. While soluble protein design has advanced, membrane protein design remains challenging due to difficulties in modeling the lipid bilayer. In this work, we developed an implicit approach that captures the anisotropic structure, shape of water-filled pores, and nanoscale dimensions of membranes with different lipid compositions. The model improves performance in computational bench-marks against experimental targets including prediction of protein orientations in the bilayer, ΔΔG calculations, native structure dis-crimination, and native sequence recovery. When applied to de novo protein design, this approach designs sequences with an amino acid distribution near the native amino acid distribution in membrane proteins, overcoming a critical flaw in previous membrane models that were prone to generating leucine-rich designs. Further, the proteins designed in the new membrane model exhibit native-like features including interfacial aromatic side chains, hydrophobic lengths compatible with bilayer thickness, and polar pores. Our method advances high-resolution membrane protein structure prediction and design toward tackling key biological questions and engineering challenges.Significance StatementMembrane proteins participate in many life processes including transport, signaling, and catalysis. They constitute over 30% of all proteins and are targets for over 60% of pharmaceuticals. Computational design tools for membrane proteins will transform the interrogation of basic science questions such as membrane protein thermodynamics and the pipeline for engineering new therapeutics and nanotechnologies. Existing tools are either too expensive to compute or rely on manual design strategies. In this work, we developed a fast and accurate method for membrane protein design. The tool is available to the public and will accelerate the experimental design pipeline for membrane proteins.

Download Full-text

Particle swarm optimization with backtracking in protein structure prediction problem

2012 IEEE International Conference on Signal Processing, Communication and Computing (ICSPCC 2012) ◽

10.1109/icspcc.2012.6335672 ◽

2012 ◽

Cited By ~ 1

Author(s):

Nanda Dulal Jana ◽

Jaya Sil

Keyword(s):

Particle Swarm Optimization ◽

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Particle Swarm ◽

Prediction Problem ◽

Swarm Optimization ◽

Protein Structure Prediction Problem

Download Full-text

A multi-population memetic algorithm for the 3-D protein structure prediction problem

Swarm and Evolutionary Computation ◽

10.1016/j.swevo.2020.100677 ◽

2020 ◽

Vol 55 ◽

pp. 100677

Author(s):

Leonardo de Lima Corrêa ◽

Márcio Dorn

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Memetic Algorithm ◽

Prediction Problem ◽

Protein Structure Prediction Problem

Download Full-text

A Heterogeneous Parallel Ecologically-Inspired Approach Applied to the 3D-AB Off-Lattice Protein Structure Prediction Problem

2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence ◽

10.1109/brics-cci-cbic.2013.104 ◽

2013 ◽

Cited By ~ 6

Author(s):

Cesar M.V. Benitez ◽

Rafael Stubs Parpinelli ◽

Heitor Silverio Lopes

Keyword(s):

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Prediction Problem ◽

Protein Structure Prediction Problem

Download Full-text

Linkage-learning genetic algorithm application to the protein structure prediction problem

Proceedings of the 2001 ACM symposium on Applied computing - SAC '01 ◽

10.1145/372202.372357 ◽

2001 ◽

Cited By ~ 1

Author(s):

Karl R. Deerman ◽

Gary B. Lamont ◽

Ruth Pachter

Keyword(s):

Genetic Algorithm ◽

Protein Structure ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Prediction Problem ◽

Linkage Learning ◽

Protein Structure Prediction Problem

Download Full-text

AngularQA: Protein Model Quality Assessment with LSTM Networks

10.1101/560995 ◽

2019 ◽

Cited By ~ 1

Author(s):

Matthew Conover ◽

Max Staples ◽

Dong Si ◽

Miao Sun ◽

Renzhi Cao

Keyword(s):

Protein Structure ◽

Quality Assessment ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Model Quality ◽

Time Step ◽

Testing Dataset ◽

Protein Model Quality Assessment ◽

Lstm Network ◽

Protein Structure Prediction Problem

AbstractQuality Assessment (QA) plays an important role in protein structure prediction. Traditional protein QA methods suffer from searching databases or comparing with other models for making predictions, which usually fail. We propose a novel protein single-model QA method which is built on a new representation that converts raw atom information into a series of carbon-alpha (Cα) atoms with side-chain information, defined by their dihedral angles and bond lengths to the prior residue. An LSTM network is used to predict the quality by treating each amino acid as a time-step and consider the final value returned by the LSTM cells. To the best of our knowledge, this is the first time anyone has attempted to use an LSTM model on the QA problem; furthermore, we use a new representation which has not been studied for QA. In addition to angles, we make use of sequence properties like secondary structure at each time-step, without using any database. Our model achieves an overall correlation of 0.651 on the CASP12 testing dataset. Our experiment points out new directions for QA problem and our method could be widely used for protein structure prediction problem. The software is freely available at GitHub:https://github.com/caorenzhi/AngularQA

Download Full-text

Using AlphaFold for Rapid and Accurate Fixed Backbone Protein Design

10.1101/2021.08.24.457549 ◽

2021 ◽

Cited By ~ 1

Author(s):

Lewis Moffat ◽

Joe G. Greener ◽

David T. Jones

Keyword(s):

Protein Structure ◽

Ab Initio ◽

Protein Structure Prediction ◽

Protein Design ◽

Structure Prediction ◽

Predictive Power ◽

Protein Sequences ◽

Supervised Methods ◽

New Generation ◽

Novel Protein

AbstractThe prediction of protein structure and the design of novel protein sequences and structures have long been intertwined. The recently released AlphaFold has heralded a new generation of accurate protein structure prediction, but the extent to which this affects protein design stands yet unexplored. Here we develop a rapid and effective approach for fixed backbone computational protein design, leveraging the predictive power of AlphaFold. For several designs we demonstrate that not only are the AlphaFold predicted structures in agreement with the desired backbones, but they are also supported by the structure predictions of other supervised methods as well as ab initio folding. These results suggest that AlphaFold, and methods like it, are able to facilitate the development of a new range of novel and accurate protein design methodologies.

Download Full-text