scholarly journals Frag’r’Us: knowledge-based sampling of protein backbone conformations for de novo structure-based protein design

2014 ◽  
Vol 30 (13) ◽  
pp. 1935-1936 ◽  
Author(s):  
Jaume Bonet ◽  
Joan Segura ◽  
Joan Planas-Iglesias ◽  
Baldomero Oliva ◽  
Narcis Fernandez-Fuentes
F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 298 ◽  
Author(s):  
Sari Sabban ◽  
Mikhail Markovsky

The ability to perform de novo protein design will allow researchers to expand the variety of available proteins. By designing synthetic structures computationally, they can utilise more structures than those available in the Protein Data Bank, design structures that are not found in nature, or direct the design of proteins to acquire a specific desired structure. While some researchers attempt to design proteins from first physical and thermodynamic principals, we decided to attempt to test whether it is possible to perform de novo helical protein design ofjust the backbone statistically using machine learning by building a model that uses a long short-term memory (LSTM) generative adversarial network (GAN) architecture. The LSTM-based GAN model used only theφandψangles of each residue from an augmented dataset of only helical protein structures. Though the network’s generated backbone structures were not perfect, they were idealised and evaluated post generation where the non-ideal structures were filtered out and the adequate structures kept. The results were successful in developing a logical, rigid, compact,helical protein backbone topology. This paper is a proof of concept that shows it is possible to generate a novel helical backbone topology using an LSTM-GAN architecture using only theφandψangles as features. The next step is to attempt to use these backbone topologies and sequence design them to form complete protein structures.


2019 ◽  
Author(s):  
Sari Sabban ◽  
Mikhail Markovsky

AbstractThe ability to perform de novo protein design will allow researchers to expand the variety of available proteins. By designing synthetic structures computationally, they can utilise more structures than those available in the Protein Data Bank, design structures that are not found in nature, or direct the design of proteins to acquire a specific desired structure. While some researchers attempt to design proteins from first physical and thermodynamic principals, we decided to attempt to test whether it is possible to perform de novo helical protein design of just the backbone statistically using machine learning by building a model that uses a long short-term memory (LSTM) architecture. The LSTM model used only the ϕ and ψ angles of each residue from an augmented dataset of only helical protein structures. Though the network’s generated backbone structures were not perfect, they were idealised and evaluated post generation where the non-ideal structures were filtered out and the adequate structures kept. The results were successful in developing a logical, rigid, compact, helical protein backbone topology. This paper is a proof of concept that shows it is possible to generate a novel helical backbone topology using an LSTM neural network architecture using only the ϕ and ψ angles as features. The next step is to attempt to use these backbone topologies and sequence design them to form complete protein structures.Author summaryThis research project stemmed from the desire to expand the pool of protein structures that can be used as scaffolds in computational vaccine development, since the number of structures available from the Protein Data Bank was not sufficient to allow for great diversity and increase the probability of grafting a target motif onto a protein scaffold. Since a protein structure’s backbone can be defined by the ϕ and ψ angles of each amino acid in the polypeptide and can effectively translate a protein’s 3D structure into a table of numbers, and since protein structures are not random, this numerical representation of protein structures can be used to train a neural network to mathematically generalise what a protein structure is, and therefore generate new a protein backbone. Instead of using all proteins in the Protein Data Bank a curated dataset was used encompassing protein structures with specific characteristics that will, theoretically, allow them to be evaluated computationally. This paper details how a trained neural network was able to successfully generate helical protein backbones.


F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 298 ◽  
Author(s):  
Sari Sabban ◽  
Mikhail Markovsky

The ability to perform de novo protein design will allow researchers to expand the variety of available proteins. By designing synthetic structures computationally, they can utilise more structures than those available in the Protein Data Bank, design structures that are not found in nature, or direct the design of proteins to acquire a specific desired structure. While some researchers attempt to design proteins from first physical and thermodynamic principals, we decided to attempt to test whether it is possible to perform de novo helical protein design of just the backbone statistically using machine learning by building a model that uses a long short-term memory (LSTM) architecture. The LSTM model used only the φ and ψ angles of each residue from an augmented dataset of only helical protein structures. Though the network’s generated backbone structures were not perfect, they were idealised and evaluated post generation where the non-ideal structures were filtered out and the adequate structures kept. The results were successful in developing a logical, rigid, compact, helical protein backbone topology. This paper is a proof of concept that shows it is possible to generate a novel helical backbone topology using an LSTM neural network architecture using only the φ and ψ angles as features. The next step is to attempt to use these backbone topologies and sequence design them to form complete protein structures.


F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 298 ◽  
Author(s):  
Sari Sabban ◽  
Mikhail Markovsky

The ability to perform de novo protein design will allow researchers to expand the variety of available proteins. By designing synthetic structures computationally, they can utilise more structures than those available in the Protein Data Bank, design structures that are not found in nature, or direct the design of proteins to acquire a specific desired structure. While some researchers attempt to design proteins from first physical and thermodynamic principals, we decided to attempt to test whether it is possible to perform de novo helical protein design of just the backbone statistically using machine learning by building a model that uses a long short-term memory (LSTM) architecture. The LSTM model used only the φ and ψ angles of each residue from an augmented dataset of only helical protein structures. Though the network’s generated backbone structures were not perfect, they were idealised and evaluated post generation where the non-ideal structures were filtered out and the adequate structures kept. The results were successful in developing a logical, rigid, compact, helical protein backbone topology. This paper is a proof of concept that shows it is possible to generate a novel helical backbone topology using an LSTM neural network architecture using only the φ and ψ angles as features. The next step is to attempt to use these backbone topologies and sequence design them to form complete protein structures.


2021 ◽  
Vol 18 (3) ◽  
pp. 233-233
Author(s):  
Arunima Singh

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Shin Irumagawa ◽  
Kaito Kobayashi ◽  
Yutaka Saito ◽  
Takeshi Miyata ◽  
Mitsuo Umetsu ◽  
...  

AbstractThe stability of proteins is an important factor for industrial and medical applications. Improving protein stability is one of the main subjects in protein engineering. In a previous study, we improved the stability of a four-helix bundle dimeric de novo protein (WA20) by five mutations. The stabilised mutant (H26L/G28S/N34L/V71L/E78L, SUWA) showed an extremely high denaturation midpoint temperature (Tm). Although SUWA is a remarkably hyperstable protein, in protein design and engineering, it is an attractive challenge to rationally explore more stable mutants. In this study, we predicted stabilising mutations of WA20 by in silico saturation mutagenesis and molecular dynamics simulation, and experimentally confirmed three stabilising mutations of WA20 (N22A, N22E, and H86K). The stability of a double mutant (N22A/H86K, rationally optimised WA20, ROWA) was greatly improved compared with WA20 (ΔTm = 10.6 °C). The model structures suggested that N22A enhances the stability of the α-helices and N22E and H86K contribute to salt-bridge formation for protein stabilisation. These mutations were also added to SUWA and improved its Tm. Remarkably, the most stable mutant of SUWA (N22E/H86K, rationally optimised SUWA, ROSA) showed the highest Tm (129.0 °C). These new thermostable mutants will be useful as a component of protein nanobuilding blocks to construct supramolecular protein complexes.


2006 ◽  
Vol 16 (4) ◽  
pp. 508-513 ◽  
Author(s):  
Alan M Poole ◽  
Rama Ranganathan

Sign in / Sign up

Export Citation Format

Share Document