3MOTIF: visualizing conserved protein sequence motifs in the protein structure database

Four fundamentally novel, recent developments make a basis for the Theory of Early Molecular Evolution. The theory outlines the molecular events from the onset of the triplet code to the formation of the earliest sequence/structure/function modules of proteins. These developments are: (1) Reconstruction of the evolutionary chart of codons; (2) Discovery of omnipresent protein sequence motifs, apparently conserved since the last common ancestor; (3) Discovery of closed loops—standard structural modules of modern proteins; (4) Construction of protein sequence space of module size fragments, with far-reaching evolutionary implications. The theory generates numerous predictions, confirmed by massive nucleotide and protein sequence analyses, such as existence of two distinct classes of amino acids, and their periodical distribution along the sequences. The emerging picture of the earliest molecular evolutionary events is outlined: consecutive engagement of codons, formation of the earliest short peptides, and growth of the polypeptide chains to the size of loop closure, 25-30 residues.

Download Full-text

Protein Sequence Motifs Involved in Intracellular Trafficking

Intracellular Antibodies ◽

10.1007/978-3-662-07992-8_5 ◽

1997 ◽

pp. 59-83

Author(s):

Silvia Biocca ◽

Antonino Cattaneo

Keyword(s):

Protein Sequence ◽

Intracellular Trafficking ◽

Sequence Motifs ◽

Protein Sequence Motifs

Download Full-text

Exploring Structurally Similar Protein Sequence Motifs using Relative-Distance Measures

2006 Fourth International Conference on Intelligent Sensing and Information Processing ◽

10.1109/icisip.2006.4286077 ◽

2006 ◽

Author(s):

K. G Srinivasa ◽

M Jagadish ◽

S J Prashanth ◽

K R Venugopal ◽

L M Patnaik

Keyword(s):

Protein Sequence ◽

Distance Measures ◽

Relative Distance ◽

Sequence Motifs ◽

Similar Protein ◽

Protein Sequence Motifs

Download Full-text

Estimation and efficient computation of the true probability of recurrence of short linear protein sequence motifs in unrelated proteins

BMC Bioinformatics ◽

10.1186/1471-2105-11-14 ◽

2010 ◽

Vol 11 (1) ◽

Cited By ~ 17

Author(s):

Norman E Davey ◽

Richard J Edwards ◽

Denis C Shields

Keyword(s):

Protein Sequence ◽

Sequence Motifs ◽

Efficient Computation ◽

True Probability ◽

Protein Sequence Motifs

Download Full-text

Doubly-lipid-modified protein sequence motifs exhibit long-lived anchorage to lipid bilayer membranes

Biochemistry ◽

10.1021/bi00011a039 ◽

1995 ◽

Vol 34 (11) ◽

pp. 3813-3822 ◽

Cited By ~ 203

Author(s):

Serge Shahinian ◽

John R. Silvius

Keyword(s):

Lipid Bilayer ◽

Protein Sequence ◽

Sequence Motifs ◽

Lipid Bilayer Membranes ◽

Bilayer Membranes ◽

Protein Sequence Motifs

Download Full-text

Mining Protein Sequence Motifs Representing Common 3D Structures

2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05) ◽

10.1109/csbw.2005.93 ◽

2006 ◽

Cited By ~ 2

Author(s):

Wei Zhong ◽

G. Altum ◽

R. Harrison ◽

Phang C. Tai ◽

Yi Pan

Keyword(s):

Protein Sequence ◽

Sequence Motifs ◽

3D Structures ◽

Protein Sequence Motifs

Download Full-text

Motif3D: relating protein sequence motifs to 3D structure

Nucleic Acids Research ◽

10.1093/nar/gkg534 ◽

2003 ◽

Vol 31 (13) ◽

pp. 3333-3336 ◽

Cited By ~ 6

Author(s):

A. Gaulton

Keyword(s):

Protein Sequence ◽

3D Structure ◽

Sequence Motifs ◽

Protein Sequence Motifs

Download Full-text

Impact of VP1-Specific Protein Sequence Motifs on Adeno-Associated Virus Type 2 Intracellular Trafficking and Nuclear Entry

Journal of Virology ◽

10.1128/jvi.00282-12 ◽

2012 ◽

Vol 86 (17) ◽

pp. 9163-9174 ◽

Cited By ~ 44

Author(s):

R. Popa-Wagner ◽

M. Porwal ◽

M. Kann ◽

M. Reuss ◽

M. Weimer ◽

...

Keyword(s):

Virus Type ◽

Protein Sequence ◽

Intracellular Trafficking ◽

Specific Protein ◽

Sequence Motifs ◽

Nuclear Entry ◽

Adeno Associated Virus ◽

Protein Sequence Motifs

Download Full-text

Motif Discovery in Protein Structure Databases

Pattern Discovery in Biomolecular Data ◽

10.1093/oso/9780195119404.003.0011 ◽

1999 ◽

Author(s):

Janice Glasgow ◽

Evan Steeg

Keyword(s):

Protein Structure ◽

Protein Sequence ◽

Structure Prediction ◽

Sequence Motifs ◽

Computational Molecular Biology ◽

X Ray ◽

X Ray Crystallography ◽

Efficiency And Effectiveness ◽

Structure Prediction Program ◽

Automated Discovery

The field of knowledge discovery is concerned with the theory and processes involved in the representation and extraction of patterns or motifs from large databases. Discovered patterns can be used to group data into meaningful classes, to summarize data, or to reveal deviant entries. Motifs stored in a database can be brought to bear on difficult instances of structure prediction or determination from X-ray crystallography or nuclear magnetic resonance (NMR) experiments. Automated discovery techniques are central to understanding and analyzing the rapidly expanding repositories of protein sequence and structure data. This chapter deals with the discovery of protein structure motifs. A motif is an abstraction over a set of recurring patterns observed in a dataset; it captures the essential features shared by a set of similar or related objects. In many domains, such as computer vision and speech recognition, there exist special regularities that permit such motif abstraction. In the protein science domain, the regularities derive from evolutionary and biophysical constraints on amino acid sequences and structures. The identification of a known pattern in a new protein sequence or structure permits the immediate retrieval and application of knowledge obtained from the analysis of other proteins. The discovery and manipulation of motifs—in DNA, RNA, and protein sequences and structures—is thus an important component of computational molecular biology and genome informatics. In particular, identifying protein structure classifications at varying levels of abstraction allows us to organize and increase our understanding of the rapidly growing protein structure datasets. Discovered motifs are also useful for improving the efficiency and effectiveness of X-ray crystallographic studies of proteins, for drug design, for understanding protein evolution, and ultimately for predicting the structure of proteins from sequence data. Motifs may be designed by hand, based on expert knowledge. For example, the Chou-Fasman protein secondary structure prediction program (Chou and Fasman, 1978), which dominated the field for many years, depended on the recognition of predefined, user-encoded sequence motifs for α-helices and β-sheets. Several hundred sequence motifs have been cataloged in PROSITE (Bairoch, 1992); the identification of one of these motifs in a novel protein often allows for immediate function interpretation.

Download Full-text