scholarly journals Prediction and Analysis of Protein Ubiquitin Sites in the Model Plant A. thaliana

2021 ◽  
Vol 2021 ◽  
pp. 1-7
Author(s):  
Shujun Shan ◽  
Yue Qi ◽  
Jihong Jiang ◽  
Song Guo

Ubiquitin is an important type of protein after translational modification. Ubiquitin has the ability to take part in several cellular regulations among several biological processions. At the same time, ubiquitin plays key roles in the enzymatic process. So as to construct the new tool to classify the ubiquitin amino acid residues, we employed the random forest model to classify the ubiquitin sites utilizing the experimentally identified ubiquitinated protein sequences of A. thaliana. More detailed, we utilized the k-spaced amino acid pair (CKSAAP) encoding and binary encoding to deal with the potential protein segments. The proposed tools can obtain 72.83% in Sp, 72.42% in Sn, 72.63% in Acc, and 0.4525 in MCC. With these performances, such tools can obtain the available results in the dataset of Arabidopsis.


2020 ◽  
Vol 17 (1) ◽  
pp. 59-77
Author(s):  
Anand Kumar Nelapati ◽  
JagadeeshBabu PonnanEttiyappan

Background:Hyperuricemia and gout are the conditions, which is a response of accumulation of uric acid in the blood and urine. Uric acid is the product of purine metabolic pathway in humans. Uricase is a therapeutic enzyme that can enzymatically reduces the concentration of uric acid in serum and urine into more a soluble allantoin. Uricases are widely available in several sources like bacteria, fungi, yeast, plants and animals.Objective:The present study is aimed at elucidating the structure and physiochemical properties of uricase by insilico analysis.Methods:A total number of sixty amino acid sequences of uricase belongs to different sources were obtained from NCBI and different analysis like Multiple Sequence Alignment (MSA), homology search, phylogenetic relation, motif search, domain architecture and physiochemical properties including pI, EC, Ai, Ii, and were performed.Results:Multiple sequence alignment of all the selected protein sequences has exhibited distinct difference between bacterial, fungal, plant and animal sources based on the position-specific existence of conserved amino acid residues. The maximum homology of all the selected protein sequences is between 51-388. In singular category, homology is between 16-337 for bacterial uricase, 14-339 for fungal uricase, 12-317 for plants uricase, and 37-361 for animals uricase. The phylogenetic tree constructed based on the amino acid sequences disclosed clusters indicating that uricase is from different source. The physiochemical features revealed that the uricase amino acid residues are in between 300- 338 with a molecular weight as 33-39kDa and theoretical pI ranging from 4.95-8.88. The amino acid composition results showed that valine amino acid has a high average frequency of 8.79 percentage compared to different amino acids in all analyzed species.Conclusion:In the area of bioinformatics field, this work might be informative and a stepping-stone to other researchers to get an idea about the physicochemical features, evolutionary history and structural motifs of uricase that can be widely used in biotechnological and pharmaceutical industries. Therefore, the proposed in silico analysis can be considered for protein engineering work, as well as for gout therapy.



1998 ◽  
Vol 79 (02) ◽  
pp. 306-309 ◽  
Author(s):  
Dougald Monroe ◽  
Julie Oliver ◽  
Darla Liles ◽  
Harold Roberts ◽  
Jen-Yea Chang

SummaryTissue factor pathway inhibitor (TFPI) acts to regulate the initiation of coagulation by first inhibiting factor Xa. The complex of factor Xa/ TFPI then inhibits the factor VIIa/tissue factor complex. The cDNA sequences of TFPI from several different species have been previously reported. A high level of similarity is present among TFPIs at the molecular level (DNA and protein sequences) as well as in biochemical function (inhibition of factor Xa, VIIa/tissue factor). In this report, we used a PCR-based screening method to clone cDNA for full length TFPI from a mouse macrophage cDNA library. Both cDNA and predicted protein sequences show significant homology to the other reported TFPI sequences, especially to that of rat. Mouse TFPI has a signal peptide of 28 amino acid residues followed by the mature protein (in which the signal peptide is removed) which has 278 amino acid residues. Mouse TFPI, like that of other species, consists of three tandem Kunitz type domains. Recombinant mouse TFPI was expressed in the human kidney cell line 293 and purified for functional assays. When using human clotting factors to investigate the inhibition spectrum of mouse TFPI, it was shown that, in addition to human factor Xa, mouse TFPI inhibits human factors VIIa, IXa, as well as factor XIa. Cloning and expression of the mouse TFPI gene will offer useful information and material for coagulation studies performed in a mouse model system.





2018 ◽  
Vol 15 (2) ◽  
pp. 275-294
Author(s):  
Deepsikha Anand ◽  
Jeya Nasim ◽  
Sangeeta Yadav ◽  
Dinesh Yadav

Microbial xylanases represents an industrially important group of enzymes associated with hydrolysis of xylan, a major hemicellulosic component of plant cell walls. A total of 122 protein sequences comprising of 58 fungal, 25 bacterial, 19actinomycetes and 20 yeasts xylanaseswere retrieved from NCBI, GenBank databases. These sequences were in-silico characterized for homology,sequence alignment, phylogenetic tree construction, motif assessment and physio-chemical attributes. The amino acid residues ranged from 188 to 362, molecular weights were in the range of 20.3 to 39.7 kDa and pI ranged from 3.93 to 9.69. The aliphatic index revealed comparatively less thermostability and negative GRAVY indicated that xylanasesarehydrophilicirrespective of the source organisms.Several conserved amino acid residues associated with catalytic domain of the enzyme were observed while different microbial sources also revealed few conserved amino acid residues. The comprehensive phylogenetic tree indicatedsevenorganismsspecific,distinct major clusters,designated as A, B, C, D, E, F and G. The MEME based analysis of 10 motifs indicated predominance of motifs specific to GH11 family and one of the motif designated as motif 3 with sequence GTVTSDGGTYDIYTTTRTNAP was found to be present in most of the xylanases irrespective of the sources.Sequence analysis of microbial xylanases provides an opportunity to develop strategies for molecular cloning and expression of xylanase genes and also foridentifying sites for genetic manipulation for developing novel xylanases with desired features as per industrial needs.



2020 ◽  
Author(s):  
Kunchur Guruprasad

<p></p><p>Mutations in orf1ab poly-protein sequences from human SARS-CoV-2 isolates representing six geographical locations were identified by comparing with the equivalent reference sequences from the Wuhan-Hu-1, China isolate, epicentre of the current COVID-19 pandemic disease. The orf1ab poly-proteins of sequence length 7096 amino acid residues representing 10,929 genomes from six geographical locations comprised a total of 27,895 mutations that corresponded to 2,095 distinct mutation sites. The percentage of mutations was significantly high for RdRp (33.47%), nsp2 (20.04%), helicase (15.95%) and nsp3 (12.61%) proteins, compared to rest of the proteins which ranged between (0.14%) for nsp10 to (2.79%) for nsp6 proteins. A total of 2715 mutations were observed for the unique mutation sites identified for each of the six geographical locations. The distribution of the mutations was; Africa (87), Asia (605), Europe (134), North America (1677), Oceania (200) and South America (12). The RdRp protein contained significantly high mutation percentage (>31%) that varied among the different geographical locations. The nsp2 proteins from Asia, North America, Oceania and South America, the nsp3 proteins from Africa and Europe and the helicase proteins from North America showed high mutation percentage next to the RdRp proteins. The P4715L mutation in RdRp, T265I in nsp2 and L3606F in nsp6 were observed in all the geographical locations with the RdRp P4715L mutation being predominant among the orf1ab poly-proteins. In another dataset comprising 158 genomes in which the orf1ab poly-proteins comprised sequences of variable length between 7084-7095 amino acid residues, 88 additional distinct mutations were observed for the six geographical locations that included deletion mutations. The proteins containing deletion mutations were; leader protein, nsp2, nsp3, nsp4, nsp6, RdRp, 3’ -to-5’ exonuclease and endoRNAse.</p> <p> </p> <p>In this work, all the mutations observed in 11,087 orf1ab poly-proteins of human SARS CoV-2 comprising between 7084-7096 amino acid residues with reference to the human SARS-CoV-2 orf1ab poly-protein sequences from Wuhan-Hu-1, China and representing the six geographical locations; Africa, Asia, Europe, North America, Oceania and South America are presented.</p><br><p></p>



2020 ◽  
Vol 65 (6) ◽  
pp. 1065-1071
Author(s):  
А.Н. Некрасов ◽  
◽  
Ю.П. Козмин ◽  
С.В. Козырев ◽  
Н.Г. Есипова ◽  
...  

This research investigates 24 647 non-homologous protein sequences. The occurrence profile of peptapeptides was constructed for every sequence and hierarchically organized elements of various sizes were revealed by a special mathematical method in each profile. The correlations between these hierarchical elements were analyzed and it was shown that in a tested set of protein sequences there are 11 levels of protein organization with elements ranging in length from 7 to 56 amino acid residues. It was suggested that the identified levels of organization correspond to elements of a super-secondary structure with different topology.



2005 ◽  
Vol 79 (2) ◽  
pp. 725-731 ◽  
Author(s):  
Duncan J. McGeoch ◽  
Derek Gatherer

ABSTRACT The phylogeny of reptilian herpesviruses (HVs) relative to mammalian and avian HVs was investigated by using available gene sequences and by alignment of encoded amino acid sequences and derivation of trees by maximum-likelihood and Bayesian methods. Phylogenetic loci were obtained for green turtle HV (GTHV) primarily on the basis of DNA polymerase (POL) and DNA binding protein sequences, and for lung-eye-trachea disease-associated HV (LETV) primarily from its glycoprotein B sequence; both have nodes on the branch leading to recognized species in the Alphaherpesvirinae subfamily and should be regarded as new members of that subfamily. A similar but less well defined locus was obtained for an iguanid HV based on a partial POL sequence. On the basis of short POL sequences (around 60 amino acid residues), it appeared likely that GTHV and LETV belong to a private clade and that three HVs of gerrhosaurs (plated lizards) are associated with the iguanid HV. Based on phylogenetic branching patterns for mammalian HV lineages that mirror those of host lineages, we estimated a date for the HV tree's root of around 400 million years ago. Estimated dates for branching events in the development of reptilian, avian, and mammalian Alphaherpesvirinae lineages could plausibly be accounted for in part but not completely by ancient coevolution of these virus lines with reptilian lineages and with the development of birds and mammals from reptilian progenitors.



2017 ◽  
Vol 15 (01) ◽  
pp. 1650025 ◽  
Author(s):  
Liqi Li ◽  
Qifa Luo ◽  
Weidong Xiao ◽  
Jinhui Li ◽  
Shiwen Zhou ◽  
...  

Palmitoylation is the covalent attachment of lipids to amino acid residues in proteins. As an important form of protein posttranslational modification, it increases the hydrophobicity of proteins, which contributes to the protein transportation, organelle localization, and functions, therefore plays an important role in a variety of cell biological processes. Identification of palmitoylation sites is necessary for understanding protein–protein interaction, protein stability, and activity. Since conventional experimental techniques to determine palmitoylation sites in proteins are both labor intensive and costly, a fast and accurate computational approach to predict palmitoylation sites from protein sequences is in urgent need. In this study, a support vector machine (SVM)-based method was proposed through integrating PSI-BLAST profile, physicochemical properties, [Formula: see text]-mer amino acid compositions (AACs), and [Formula: see text]-mer pseudo AACs into the principal feature vector. A recursive feature selection scheme was subsequently implemented to single out the most discriminative features. Finally, an SVM method was implemented to predict palmitoylation sites in proteins based on the optimal features. The proposed method achieved an accuracy of 99.41% and Matthews Correlation Coefficient of 0.9773 for a benchmark dataset. The result indicates the efficiency and accuracy of our method in prediction of palmitoylation sites based on protein sequences.



PLoS ONE ◽  
2012 ◽  
Vol 7 (8) ◽  
pp. e43847 ◽  
Author(s):  
Mingjun Wang ◽  
Xing-Ming Zhao ◽  
Kazuhiro Takemoto ◽  
Haisong Xu ◽  
Yuan Li ◽  
...  


Sign in / Sign up

Export Citation Format

Share Document