scholarly journals BLASTing away preconceptions in crystallization trials

Author(s):  
Gabriel Jan Abrahams ◽  
Janet Newman

Crystallization is in many cases a critical step for solving the three-dimensional structure of a protein molecule. Determining which set of chemicals to use in the initial screen is typically agnostic of the protein under investigation; however, crystallization efficiency could potentially be improved if this were not the case. Previous work has assumed that sequence similarity may provide useful information about appropriate crystallization cocktails; however, the authors are not aware of any quantitative verification of this assumption. This research investigates whether, given current information, one can detect any correlation between sequence similarity and crystallization cocktails. BLAST was used to quantitate the similarity between protein sequences in the Protein Data Bank, and this was compared with three estimations of the chemical similarities of the respective crystallization cocktails. No correlation was detected between proteins of similar (but not identical) sequence and their crystallization cocktails, suggesting that methods of determining screens based on this assumption are unlikely to result in screens that are better than those currently in use.

2019 ◽  
Vol 52 (6) ◽  
pp. 1422-1426
Author(s):  
Rajendran Santhosh ◽  
Namrata Bankoti ◽  
Adgonda Malgonnavar Padmashri ◽  
Daliah Michael ◽  
Jeyaraman Jeyakanthan ◽  
...  

Missing regions in protein crystal structures are those regions that cannot be resolved, mainly owing to poor electron density (if the three-dimensional structure was solved using X-ray crystallography). These missing regions are known to have high B factors and could represent loops with a possibility of being part of an active site of the protein molecule. Thus, they are likely to provide valuable information and play a crucial role in the design of inhibitors and drugs and in protein structure analysis. In view of this, an online database, Missing Regions in Polypeptide Chains (MRPC), has been developed which provides information about the missing regions in protein structures available in the Protein Data Bank. In addition, the new database has an option for users to obtain the above data for non-homologous protein structures (25 and 90%). A user-friendly graphical interface with various options has been incorporated, with a provision to view the three-dimensional structure of the protein along with the missing regions using JSmol. The MRPC database is updated regularly (currently once every three months) and can be accessed freely at the URL http://cluster.physics.iisc.ac.in/mrpc.


2011 ◽  
Vol 44 (3) ◽  
pp. 647-650
Author(s):  
Venkatesh Babu ◽  
M. Uthayakumar ◽  
M. Kirti Vaishnavi ◽  
R. Senthilkumar ◽  
M. Shankar ◽  
...  

Repeats are two or more contiguous segments of amino acid residues that are believed to have arisen as a result of intragenic duplication, recombination and mutation events. These repeats can be utilized for protein structure prediction and can provide insights into the protein evolution and phylogenetic relationship. Therefore, to aid structural biologists and phylogeneticists in their research, a computing resource (a web server and a database), Repeats in Protein Sequences (RPS), has been created. Using RPS, users can obtain useful information regarding identical, similar and distant repeats (of varying lengths) in protein sequences. In addition, users can check the frequency of occurrence of the repeats in sequence databases such as the Genome Database, PIR and SWISS-PROT and among the protein sequences available in the Protein Data Bank archive. Furthermore, users can view the three-dimensional structure of the repeats using the Java visualization plug-inJmol. The proposed computing resource can be accessed over the World Wide Web at http://bioserver1.physics.iisc.ernet.in/rps/.


2020 ◽  
Vol 5 (7) ◽  
Author(s):  
Lucas Paul ◽  
Celestin N. Mudogo ◽  
Kelvin M. Mtei ◽  
Revocatus L. Machunda ◽  
Fidele Ntie-Kang

AbstractCassava is a strategic crop, especially for developing countries. However, the presence of cyanogenic compounds in cassava products limits the proper nutrients utilization. Due to the poor availability of structure discovery and elucidation in the Protein Data Bank is limiting the full understanding of the enzyme, how to inhibit it and applications in different fields. There is a need to solve the three-dimensional structure (3-D) of linamarase from cassava. The structural elucidation will allow the development of a competitive inhibitor and various industrial applications of the enzyme. The goal of this review is to summarize and present the available 3-D modeling structure of linamarase enzyme using different computational strategies. This approach could help in determining the structure of linamarase and later guide the structure elucidation in silico and experimentally.


2021 ◽  
Vol 7 ◽  
Author(s):  
Castrense Savojardo ◽  
Matteo Manfredi ◽  
Pier Luigi Martelli ◽  
Rita Casadio

Solvent accessibility (SASA) is a key feature of proteins for determining their folding and stability. SASA is computed from protein structures with different algorithms, and from protein sequences with machine-learning based approaches trained on solved structures. Here we ask the question as to which extent solvent exposure of residues can be associated to the pathogenicity of the variation. By this, SASA of the wild-type residue acquires a role in the context of functional annotation of protein single-residue variations (SRVs). By mapping variations on a curated database of human protein structures, we found that residues targeted by disease related SRVs are less accessible to solvent than residues involved in polymorphisms. The disease association is not evenly distributed among the different residue types: SRVs targeting glycine, tryptophan, tyrosine, and cysteine are more frequently disease associated than others. For all residues, the proportion of disease related SRVs largely increases when the wild-type residue is buried and decreases when it is exposed. The extent of the increase depends on the residue type. With the aid of an in house developed predictor, based on a deep learning procedure and performing at the state-of-the-art, we are able to confirm the above tendency by analyzing a large data set of residues subjected to variations and occurring in some 12,494 human protein sequences still lacking three-dimensional structure (derived from HUMSAVAR). Our data support the notion that surface accessible area is a distinguished property of residues that undergo variation and that pathogenicity is more frequently associated to the buried property than to the exposed one.


Author(s):  
Enrique E. Abola ◽  
Joel L. Sussman ◽  
Jaime Prilusky ◽  
Nancy O. Manning

2020 ◽  
Vol 49 (D1) ◽  
pp. D452-D457
Author(s):  
Lisanna Paladin ◽  
Martina Bevilacqua ◽  
Sara Errigo ◽  
Damiano Piovesan ◽  
Ivan Mičetić ◽  
...  

Abstract The RepeatsDB database (URL: https://repeatsdb.org/) provides annotations and classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein tandem repeats are ubiquitous in all branches of the tree of life. The accumulation of solved repeat structures provides new possibilities for classification and detection, but also increasing the need for annotation. Here we present RepeatsDB 3.0, which addresses these challenges and presents an extended classification scheme. The major conceptual change compared to the previous version is the hierarchical classification combining top levels based solely on structural similarity (Class > Topology > Fold) with two new levels (Clan > Family) requiring sequence similarity and describing repeat motifs in collaboration with Pfam. Data growth has been addressed with improved mechanisms for browsing the classification hierarchy. A new UniProt-centric view unifies the increasingly frequent annotation of structures from identical or similar sequences. This update of RepeatsDB aligns with our commitment to develop a resource that extracts, organizes and distributes specialized information on tandem repeat protein structures.


Author(s):  
SANGEETA RANI ◽  
KAVITA GAHLOT ◽  
ARVIND KUMAR

Objective: The purpose of this study was to investigate the diabetic effect of phytocompounds isolated from Cressa cretica Linn. using spectroscopic analysis and molecular docking studies. Methods: Coarse powder of the whole plant of C. cretica was extracted with methanol, extracted part was subjected to silica column isolation, and two compounds: 2-Isopropyl-4-(1-methyl-dodeca-2,4-dienyloxy)-benzene-1,3,5-triol (Compound CN-01) and 11-Methyl-dodeca-2,4,6,8,10-pentenoic acid 2,3-dihydroxy-5-methyl-phenyl ester (Compound CN-02) were isolated in pure form. The three-dimensional structure of target protein was downloaded from PDB (www.rcsb.org) Protein Data Bank, Ligand file CN – 01 and CN – 02 were converted to MDL Molfile (V2000) format using ChemSketch 2017.2.1. These files could not be used directly in AutoDock 4.0 tools; thus, they were first converted to PDB files using an open babel tool. Results: Compounds were revealed through spectroscopic analysis and screened using AutoDock 4.0 tools. Docking study recommended that CN – 01 and CN – 02 an existing phytochemical from the plant of C. cretica had the highest fitness docking score and hence could be a potent antidiabetic drug. Conclusion: In this investigation, we docked the receptor (glycogen phosphorylase protein) holds a promising lead target formation against diabetes based on molecular docking analysis (minimum hydrogen bond length and maximum docked score). Thus, these compounds can be effectively used as drugs for treating diabetes which is predicted on the basis of docking scores.


2008 ◽  
Vol 41 (5) ◽  
pp. 952-954 ◽  
Author(s):  
S. Praveen ◽  
J. Ramesh ◽  
P. Sivasankari ◽  
G. Sowmiya ◽  
K. Sekar

By exploiting the fast-growing Internet technology, the interactive computing serverWater Analysis Package(WAP, version 2.0) has been updated with more flexible options to better understand the role of the water O atoms present in three-dimensional macromolecular (protein or nucleic acid) structures. The updated robust server facilitates the computation and visualization of water molecules from various hydration shells, interfacial water molecules and those water molecules that stabilize various secondary structural elements. It is also possible to detect the interactions of water molecules with various parts (polar atoms, nonpolar atoms, main-chain and side-chain atoms) of the protein molecule. Furthermore, a molecular graphics visualization program is interfaced to display the nature of the interactions of the water molecules. The Protein Data Bank archive interfaced with the server is updated every week; hence users get to analyse the latest structures. The computing server can be obtained from http://dicsoft2.physics.iisc.ernet.in/wap/.


Biologia ◽  
2014 ◽  
Vol 69 (3) ◽  
Author(s):  
Venkatesh Kumaresan ◽  
Prasanth Bhatt ◽  
Rajesh Palanisamy ◽  
Annie Gnanam ◽  
Mukesh Pasupuleti ◽  
...  

AbstractCathepsin L, a lysosomal endopeptidase, is a member of the peptidase C1 family (papain-like family) of cysteine proteinases that cleave peptide bonds of lysosomal proteins. In this study, we report a cathepsin L sequence identified from the constructed cDNA library of striped murrel Channa striatus (designated as CsCath L) using genome sequencing FLXTM technology. The full-length CsCath L contains three eukaryotic thiol protease domains at positions 134-145, 278-288 and 299-318. Phylogenetic analysis revealed that the CsCath L was clustered together with other cathepsin L from teleosts. The three-dimensional structure of CsCath L modelled by the I-Tasser program was compared with structures deposited in the Protein Data Bank to find out the structural similarity of CsCath L with experimentally identified structures. The results showed that the CsCath L exhibits maximum structural identity with pro-cathepsin L from human. The RNA fold structure of CsCath L was predicted along with its minimum free energy (−471.93 kcal/mol). The highest CsCath L gene expression was observed in liver, which was also significantly higher (P < 0.05) than that detected in other tissues taken for analysis. In order to investigate the mRNA transcription profile of CsCath L during infection, C. striatus were injected with fungus (Aphanomyces invadans) and bacteria (Aeromonas hydrophila) and its expression was up-regulated in liver at various time points. Similar to gene expression studies, the highest CsCath L enzyme activity was also observed in liver and its activity was up-regulated by fungal and bacterial infections.


Sign in / Sign up

Export Citation Format

Share Document