Exploring Protein Fold Space

The model of protein folding proposed by Ptitsyn and colleagues involves the accretion of secondary structures around a nucleus. As developed by Efimov, this model also provides a useful way to view the relationships among structures. Although somewhat eclipsed by later databases based on the pairwise comparison of structures, Efimov’s approach provides a guide for the more automatic comparison of proteins based on an encoding of their topology as a string. Being restricted to layers of secondary structures based on beta sheets, this too has limitations which are partly overcome by moving to a more generalised secondary structure lattice that can encompass both open and closed (barrel) sheets as well as helical packing of the type encoded by Murzin and Finkelstein on small polyhedra. Regular (crystalline) lattices, such as close-packed hexagonals, were found to be too limited so pseudo-latticses were investigated including those found in quasicrystals and the Bernal tetrahedron-based lattice that he used to represent liquid water. The Bernal lattice was considered best and used to generate model protein structures. These were much more numerous than those seen in Nature, posing the open question of why this might be.

Download Full-text

Predicting secondary structures, contact numbers, and residue-wise contact orders of native protein structures from amino acid sequences using critical random networks

BIOPHYSICS ◽

10.2142/biophysics.1.67 ◽

2005 ◽

Vol 1 ◽

pp. 67-74 ◽

Cited By ~ 14

Author(s):

Akira R. Kinjo ◽

Ken Nishikawa

Keyword(s):

Amino Acid ◽

Protein Structures ◽

Secondary Structures ◽

Amino Acid Sequences ◽

Random Networks ◽

Native Protein

Download Full-text

Use of secondary structural information and Cα-Cα distance restraints to model protein structures with MODELLER

Journal of Biosciences ◽

10.1007/s12038-007-0093-1 ◽

2007 ◽

Vol 32 (S1) ◽

pp. 929-936 ◽

Cited By ~ 3

Author(s):

Boojala V. B. Reddy ◽

Yiannis N. Kaznessis

Keyword(s):

Structural Information ◽

Protein Structures ◽

Distance Restraints ◽

Model Protein

Download Full-text

Protein Structure Idealization: How accurately is it possible to model protein structures with dihedral angles?

Algorithms for Molecular Biology ◽

10.1186/1748-7188-8-5 ◽

2013 ◽

Vol 8 (1) ◽

pp. 5 ◽

Cited By ~ 2

Author(s):

Xuefeng Cui ◽

Shuai Cheng Li ◽

Dongbo Bu ◽

Babak Alipanahi ◽

Ming Li

Keyword(s):

Protein Structure ◽

Protein Structures ◽

Dihedral Angles ◽

Model Protein

Download Full-text

How root-mean-square distance (r.m.s.d.) values depend on the resolution of protein structures that are compared

Journal of Applied Crystallography ◽

10.1107/s0021889802020502 ◽

2003 ◽

Vol 36 (1) ◽

pp. 125-128 ◽

Cited By ~ 28

Author(s):

Oliviero Carugo

Keyword(s):

Protein Structure ◽

Root Mean Square ◽

Protein Pair ◽

Protein Structures ◽

Structural Similarity ◽

Protein Crystal ◽

Mean Square ◽

Root Mean Square Distance ◽

Open Question ◽

Mean Square Distance

The most popular estimator of structural similarity is the root-mean-square distance (r.m.s.d.) between equivalent atoms, computed after optimal superposition of the two structures that are compared. It is known that r.m.s.d. values do not depend only on conformational differences but also on other features, for example the dimensions of the structures that are compared. An open question is how they might depend on the accuracy of the experimentally determined protein structures. Given that the accuracy of the protein crystal structures is generally estimated through the crystallographic resolution, it is important to know the dependence of the r.m.s.d. on the crystallographic resolution of the two structures that are compared. 14458 protein structure pairs of identical sequence were compared and the resulting r.m.s.d. values were normalized to 100-residue length to avoid the bias introduced by the dependence of the r.m.s.d. values on the protein-pair dimensions. On average, smaller r.m.s.d. values are associated with protein structure pairs at better resolution and the r.m.s.d. values tend to increase if the two proteins that are compared have been refined at different resolutions. For crystallographic resolutions ranging between 1.6 and 2.9 Å, both relationships appear to be linear: r.m.s.d. = −0.73 + 0.48 resolution and delta_r.m.s.d. = 0.20 + 0.30 delta_resolution (`delta' indicating difference). Although the linearity of these relationships is not expected to hold outside the 1.6–2.9 Å resolution range, they are useful in making the r.m.s.d. values more reliable.

Download Full-text

Characteristics of Protein Fold Space Exhibits Close Dependence on Domain Usage

Bioinformatics and Biomedical Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-030-17938-0_32 ◽

2019 ◽

pp. 356-369

Author(s):

Michael T. Zimmermann ◽

Fadi Towfic ◽

Robert L. Jernigan ◽

Andrzej Kloczkowski

Keyword(s):

Protein Fold ◽

Protein Fold Space

Download Full-text

How are model protein structures distributed in sequence space?

Biophysical Journal ◽

10.1016/s0006-3495(97)78268-7 ◽

1997 ◽

Vol 73 (5) ◽

pp. 2393-2403 ◽

Cited By ~ 92

Author(s):

E. Bornberg-Bauer

Keyword(s):

Sequence Space ◽

Protein Structures ◽

Model Protein

Download Full-text

Exploring protein fold space by secondary structure prediction using data distribution method on Grid platform

Bioinformatics ◽

10.1093/bioinformatics/bth435 ◽

2004 ◽

Vol 20 (18) ◽

pp. 3500-3507 ◽

Cited By ~ 8

Author(s):

S. Lee ◽

M.-K. Cho ◽

J.-W. Jung ◽

J.-H. Kim ◽

W. Lee

Keyword(s):

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Data Distribution ◽

Protein Fold ◽

Distribution Method ◽

Using Data ◽

Protein Fold Space ◽

Grid Platform

Download Full-text

BRASERO: A Resource for Benchmarking RNA Secondary Structure Comparison Algorithms

Advances in Bioinformatics ◽

10.1155/2012/893048 ◽

2012 ◽

Vol 2012 ◽

pp. 1-5 ◽

Cited By ~ 6

Author(s):

Julien Allali ◽

Cédric Saule ◽

Cédric Chauve ◽

Yves d’Aubenton-Carafa ◽

Alain Denise ◽

...

Keyword(s):

Noncoding Rna ◽

Fundamental Problem ◽

Pairwise Comparison ◽

Secondary Structures ◽

Software Tools ◽

Structure Comparison ◽

Rna Secondary Structures ◽

Ordered Trees ◽

Synthetic Datasets ◽

Comparison Algorithms

The pairwise comparison of RNA secondary structures is a fundamental problem, with direct application in mining databases for annotating putative noncoding RNA candidates in newly sequenced genomes. An increasing number of software tools are available for comparing RNA secondary structures, based on different models (such as ordered trees or forests, arc annotated sequences, and multilevel trees) and computational principles (edit distance, alignment). We describe here the website BRASERO that offers tools for evaluating such software tools on real and synthetic datasets.

Download Full-text

Revisiting Chameleon Sequences in the Protein Data Bank

Algorithms ◽

10.3390/a11080114 ◽

2018 ◽

Vol 11 (8) ◽

pp. 114 ◽

Cited By ~ 3

Author(s):

Mihaly Mezei

Keyword(s):

Protein Data Bank ◽

Protein Structures ◽

Data Bank ◽

Secondary Structures ◽

Steady Growth ◽

Periodic Repetition

The steady growth of the Protein Data Bank (PDB) suggests the periodic repetition of searches for sequences that form different secondary structures in different protein structures; these are called chameleon sequences. This paper presents a fast (nlog(n)) algorithm for such searches and presents the results on all protein structures in the PDB. The longest such sequence found consists of 20 residues.

Download Full-text

The Automatic Discovery of Structural Principles Describing Protein Fold Space

Journal of Molecular Biology ◽

10.1016/s0022-2836(03)00620-x ◽

2003 ◽

Vol 330 (4) ◽

pp. 839-850 ◽

Cited By ~ 19

Author(s):

Adrian P Cootes ◽

Stephen H Muggleton ◽

Michael J.E Sternberg

Keyword(s):

Protein Fold ◽

Automatic Discovery ◽

Structural Principles ◽

Protein Fold Space

Download Full-text