Predicting mucin-type O-glycosylation using enhancement value products from derived protein features

Mucin-type O-glycosylation is one of the most common post-translational modifications of proteins. This glycosylation is initiated in the Golgi by the addition of the sugar N-acetylgalactosamine (GalNAc) onto protein Ser and Thr residues by a family of polypeptide GalNAc transferases. In humans, there are 20 isoforms that are differentially expressed across tissues that serve multiple important biological roles. Using random peptide substrates, isoform specific amino acid preferences have been obtained in the form of enhancement values (EV). These EVs alone have previously been used to predict O-glycosylation sites via the web based ISOGlyP (Isoform Specific O-Glycosylation Prediction) tool. Here, we explore additional protein features to determine whether these can complement the random peptide derived enhancement values and increase the predictive power of ISOGlyP. The inclusion of additional protein substrate features (such as secondary structure and surface accessibility) was found to increase sensitivity with minimal loss of specificity, when tested with three different published in vivo O-glycoproteomics data sets, thus increasing the overall accuracy of the ISOGlyP predictions.

Download Full-text

Identification of Common and Unique Peptide Substrate Preferences for the UDP-GalNAc:Polypeptide α-N-acetylgalactosaminyltransferases T1 and T2 Derived from Oriented Random Peptide Substrates

Journal of Biological Chemistry ◽

10.1074/jbc.m605149200 ◽

2006 ◽

Vol 281 (43) ◽

pp. 32403-32416 ◽

Cited By ~ 69

Author(s):

Thomas A. Gerken ◽

Jayalakshmi Raman ◽

Timothy A. Fritz ◽

Oliver Jamison

Keyword(s):

Amino Acid ◽

Substrate Specificity ◽

Sequence Motif ◽

Peptide Substrate ◽

Lectin Affinity Chromatography ◽

Peptide Substrates ◽

Random Peptide ◽

Glycosylation Sites ◽

T1 And T2

A large family of UDP-GalNAc:polypeptide α-N-acetylgalactosaminyltransferases (ppGalNAc Ts) catalyzes the first step of mucin-type protein O-glycosylation by transferring GalNAc to serine and threonine residues of acceptor polypeptides. The acceptor peptide substrate specificity and specific protein targets of the individual ppGalNAc T family members remain poorly characterized and poorly understood, despite the fact that mutations in two individual isoforms are deleterious to man and the fly. In this work a series of oriented random peptide substrate libraries, based on the GAGAXXXTXXXAGAGK sequence motif (where X = randomized positions), have been used to obtain the first comprehensive determination of the peptide substrate specificities of the mammalian ppGalNAc T1 and T2 isoforms. ppGalNAc T-glycosylated random peptides were isolated by lectin affinity chromatography, and transferase amino acid preferences were determined by Edman amino acid sequencing. The results reveal common and unique position-sensitive features for both transferases, consistent with previous reports of the preferences of ppGalNAc T1 and T2. The random peptide substrates also reveal additional specific features that have never been described before that are consistent with the x-ray crystal structures of the two transferases and furthermore are reflected in a data base analysis of in vivo O-glycosylation sites. By using the transferase-specific preferences, optimum and selective acceptor peptide substrates have been generated for each transferase. This approach represents a relatively complete, facile, and reproducible method for obtaining ppGalNAc T peptide substrate specificity. Such information will be invaluable for identifying isoform-specific peptide acceptors, creating isoform-specific substrates, and predicting O-glycosylation sites.

Download Full-text

ISOGlyP: de novo prediction of isoform-specific mucin-type O-glycosylation

Glycobiology ◽

10.1093/glycob/cwaa067 ◽

2020 ◽

Author(s):

Jonathon E Mohl ◽

Thomas A Gerken ◽

Ming-Ying Leung

Keyword(s):

Amino Acid ◽

Posttranslational Modifications ◽

Prediction Accuracy ◽

Short Range ◽

De Novo ◽

Specific Amino Acid ◽

Glycosylation Sites ◽

Specific Enhancement ◽

A Site

Abstract Mucin-type O-glycosylation is one of the most common posttranslational modifications of proteins. The abnormal expression of various polypeptide GalNAc-transferases (GalNAc-Ts) which initiate and define sites of O-glycosylation are linked to many cancers and other diseases. Current O-glycosyation prediction programs utilize O-glycoproteomics data obtained without regard to the transferase isoform (s) responsible for the glycosylation. With 20 different GalNAc-Ts in humans, having an ability to predict and interpret O-glycosylation sites in terms of specific GalNAc-T isoforms is invaluable. To fill this gap, ISOGlyP (Isoform-Specific O-Glycosylation Prediction) has been developed. Using position-specific enhancement values generated based on GalNAc-T isoform-specific amino acid preferences, ISOGlyP predicts the propensity that a site would be glycosylated by a specific transferase. ISOGlyP gave an overall prediction accuracy of 70% against in vivo data, which is comparable to that of the NetOGlyc4.0 predictor. Additionally, ISOGlyP can identify the known effects of long- and short-range prior glycosylation and can generate potential peptide sequences selectively glycosylated by specific isoforms. ISOGlyP is freely available for use at ISOGlyP.utep.edu. The code is also available on GitHub (https://github.com/jonmohl/ISOGlyP).

Download Full-text

PTMphinder: an R package for PTM site localization and motif extraction from proteomic datasets

PeerJ ◽

10.7717/peerj.7046 ◽

2019 ◽

Vol 7 ◽

pp. e7046 ◽

Cited By ~ 3

Author(s):

Jacob M. Wozniak ◽

David J. Gonzalez

Keyword(s):

R Package ◽

Data Sets ◽

Amino Acid Residues ◽

Post Translational Modifications ◽

Specific Amino Acid ◽

Proteomic Data ◽

Wide Range ◽

Site Localization ◽

Programming Knowledge ◽

Beta Testing

Background Mass-spectrometry-based proteomics is a prominent field of study that allows for the unbiased quantification of thousands of proteins from a particular sample. A key advantage of these techniques is the ability to detect protein post-translational modifications (PTMs) and localize them to specific amino acid residues. These approaches have led to many significant findings in a wide range of biological disciplines, from developmental biology to cancer and infectious diseases. However, there is a current lack of tools available to connect raw PTM site information to biologically meaningful results in a high-throughput manner. Furthermore, many of the available tools require significant programming knowledge to implement. Results The R package PTMphinder was designed to enable researchers, particularly those with minimal programming background, to thoroughly analyze PTMs in proteomic data sets. The package contains three functions: parseDB, phindPTMs and extractBackground. Together, these functions allow users to reformat proteome databases for easier analysis, localize PTMs within full proteins, extract motifs surrounding the identified sites and create proteome-specific motif backgrounds for statistical purposes. Beta-testing of this R package has demonstrated its simplicity and ease of integration with existing tools. Conclusion PTMphinder empowers researchers to fully analyze and interpret PTMs derived from proteomic data. This package is simple enough for researchers with limited programming experience to understand and implement. The data produced from this package can inform subsequent research by itself and also be used in conjunction with other tools, such as motif-x, for further analysis.

Download Full-text

Predictive and Descriptive CoMFA Models: The Effect of Variable Selection

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207321666180212162028 ◽

2018 ◽

Vol 21 (2) ◽

pp. 117-124 ◽

Cited By ~ 4

Author(s):

Bakhtyar Sepehri ◽

Nematollah Omidikia ◽

Mohsen Kompany-Zareh ◽

Raouf Ghavami

Keyword(s):

Variable Selection ◽

Predictive Power ◽

Selection Method ◽

Data Sets ◽

Data Set ◽

Comfa Model ◽

Variable Selection Method

Aims & Scope: In this research, 8 variable selection approaches were used to investigate the effect of variable selection on the predictive power and stability of CoMFA models. Materials & Methods: Three data sets including 36 EPAC antagonists, 79 CD38 inhibitors and 57 ATAD2 bromodomain inhibitors were modelled by CoMFA. First of all, for all three data sets, CoMFA models with all CoMFA descriptors were created then by applying each variable selection method a new CoMFA model was developed so for each data set, 9 CoMFA models were built. Obtained results show noisy and uninformative variables affect CoMFA results. Based on created models, applying 5 variable selection approaches including FFD, SRD-FFD, IVE-PLS, SRD-UVEPLS and SPA-jackknife increases the predictive power and stability of CoMFA models significantly. Result & Conclusion: Among them, SPA-jackknife removes most of the variables while FFD retains most of them. FFD and IVE-PLS are time consuming process while SRD-FFD and SRD-UVE-PLS run need to few seconds. Also applying FFD, SRD-FFD, IVE-PLS, SRD-UVE-PLS protect CoMFA countor maps information for both fields.

Download Full-text

Aryl Hydrocarbon Receptor: Its Regulation and Roles in Transformation and Tumorigenesis

Current Drug Targets ◽

10.2174/1389450120666181109092225 ◽

2019 ◽

Vol 20 (6) ◽

pp. 625-634 ◽

Cited By ~ 1

Author(s):

Xun Che ◽

Wei Dai

Keyword(s):

Target Gene ◽

Tumor Development ◽

Environmental Response ◽

Carcinogenic Effect ◽

Post Translational Modifications ◽

Downstream Target ◽

Aryl Hydrocarbon ◽

Downstream Target Gene ◽

Major Mediator

AhR is an environmental response gene that mediates cellular responses to a variety of xenobiotic compounds that frequently function as AhR ligands. Many AhR ligands are classified as carcinogens or pro-carcinogens. Thus, AhR itself acts as a major mediator of the carcinogenic effect of many xenobiotics in vivo. In this concise review, mechanisms by which AhR trans-activates downstream target gene expression, modulates immune responses, and mediates malignant transformation and tumor development are discussed. Moreover, activation of AhR by post-translational modifications and crosstalk with other transcription factors or signaling pathways are also summarized.

Download Full-text

Development of a Web-Based Toolbox to Support Quantitative In-Vitro-to-In-Vivo Extrapolations (QIVIVE) within Nonanimal Testing Strategies

Chemical Research in Toxicology ◽

10.1021/acs.chemrestox.0c00307 ◽

2020 ◽

Author(s):

Ans Punt ◽

Nicole Pinckaers ◽

Ad Peijnenburg ◽

Jochem Louisse

Keyword(s):

Web Based ◽

Testing Strategies

Download Full-text

TMOD-24. GENERATION AND CHARACTERIZATION OF A NOVEL TRANSGENIC IDO REPORTER MOUSE FOR IDO POSTTRANSLATIONAL MODIFICATION ANALYSIS IN SITU

Neuro-Oncology ◽

10.1093/neuonc/noaa215.974 ◽

2020 ◽

Vol 22 (Supplement_2) ◽

pp. ii233-ii233

Author(s):

April Bell ◽

Lijie Zhai ◽

Erik Ladomersky ◽

Kristen Lauing ◽

Lakshmi Bollu ◽

...

Keyword(s):

Mouse Model ◽

Tumor Cell ◽

Central Nervous System Tumor ◽

Post Translational Modifications ◽

Reporter Mouse ◽

C Terminus ◽

Human Gbm

Abstract Glioblastoma (GBM) is the most common and aggressive primary central nervous system tumor in adults with a median survival of 14.6 months. GBM is a potently immunosuppressive cancer due in-part to the prolific expression of immunosuppressive indoleamine 2,3 dioxygenase 1 (IDO). Tumor cell IDO facilitates the intratumoral accumulation of regulatory T cells (Tregs; CD4+CD25+FoxP3+). Although immunosuppressive IDO activity is canonically characterized by the conversion of tryptophan into kynurenine, we have utilized transgenic and syngeneic mouse models and mutant glioma lines to demonstrate that tumor cell IDO increases Treg accumulation independent of tryptophan metabolism. Here, we address the gap in our understanding of IDO signaling activity in vivo. Subcutaneously-engrafted human GBM expressing human IDO-GFP cDNA was isolated from immunodeficient humanized NSG-SGM3 mice. The tumor was immunoprecipitated for the GFP tag using GFP-TRAP followed by mass spectrometry which revealed a novel methylation site on a lysine residue at amino acid 373 in the IDO C-terminus region. Western blot analysis of IDO protein also revealed the presence of tyrosine phosphorylation. Additionally, we recently created a new transgenic IDO reporter mouse model whereby endogenous IDO is fused to GFP via a T2A linker (IDO→GFP). This model allows for the isolation of IDO+ cells in real-time and without causing cell death, thereby creating the opportunity for downstream molecular analysis of in situ-isolated GFP+ cells. Collectively, our work suggests that IDO non-enzyme activity may involve the post-translational modifications we recently identified. As IDO activity may differ between in vitro and in vivo modeling systems, we will use the new IDO→GFP reporter mouse model for an improved mechanistic understanding of how immunosuppressive IDO facilitates Treg accumulation in vivo.

Download Full-text

Tumor Suppressor SMAR1 Activates and Stabilizes p53 through Its Arginine-Serine-rich Motif

Journal of Biological Chemistry ◽

10.1074/jbc.m413200200 ◽

2005 ◽

Vol 280 (16) ◽

pp. 16019-16029 ◽

Cited By ~ 28

Author(s):

Archana Jalota ◽

Kamini Singh ◽

Lakshminarasimhan Pavithra ◽

Ruchika Kaul-Ghanekar ◽

Shahid Jameel ◽

...

Keyword(s):

Cell Fate ◽

The Novel ◽

Nuclear Retention ◽

Post Translational Modifications ◽

Rich Domain ◽

Dna Damaging Agents ◽

Proliferation And Apoptosis ◽

Interfering Rna ◽

The Guardian

Various stresses and DNA-damaging agents trigger transcriptional activity of p53 by post-translational modifications, making it a global regulatory switch that controls cell proliferation and apoptosis. Earlier we have shown that the novel MAR-associated protein SMAR1 interacts with p53. Here we delineate the minimal domain of SMAR1 (the arginine-serine-rich domain) that is phosphorylated by protein kinase C family proteins and is responsible for p53 interaction, activation, and stabilization within the nucleus. SMAR1-mediated stabilization of p53 is brought about by inhibiting Mdm2-mediated degradation of p53. We also demonstrate that this arginine-serine (RS)-rich domain triggers the various cell cycle modulating proteins that decide cell fate. Furthermore, phenotypic knock-down experiments using small interfering RNA showed that SMAR1 is required for activation and nuclear retention of p53. The level of phosphorylated p53 was significantly increased in the thymus of SMAR1 transgenic mice, showingin vivosignificance of SMAR1 expression. This is the first report that demonstrates the mechanism of action of the MAR-binding protein SMAR1 in modulating the activity of p53, often referred to as the “guardian of the genome.”

Download Full-text

Internal exposure dynamics drive the Adverse Outcome Pathways of synthetic glucocorticoids in fish

Scientific Reports ◽

10.1038/srep21978 ◽

2016 ◽

Vol 6 (1) ◽

Cited By ~ 33

Author(s):

Luigi Margiotta-Casaluci ◽

Stewart F. Owen ◽

Belinda Huerta ◽

Sara Rodríguez-Mozaz ◽

Subramanian Kugathas ◽

...

Keyword(s):

Predictive Power ◽

Adverse Outcome ◽

Plasma Concentrations ◽

Internal Exposure ◽

Adverse Outcome Pathway ◽

Conceptual Tool ◽

Fish Model ◽

Species Specific

Abstract The Adverse Outcome Pathway (AOP) framework represents a valuable conceptual tool to systematically integrate existing toxicological knowledge from a mechanistic perspective to facilitate predictions of chemical-induced effects across species. However, its application for decision-making requires the transition from qualitative to quantitative AOP (qAOP). Here we used a fish model and the synthetic glucocorticoid beclomethasone dipropionate (BDP) to investigate the role of chemical-specific properties, pharmacokinetics, and internal exposure dynamics in the development of qAOPs. We generated a qAOP network based on drug plasma concentrations and focused on immunodepression, skin androgenisation, disruption of gluconeogenesis and reproductive performance. We showed that internal exposure dynamics and chemical-specific properties influence the development of qAOPs and their predictive power. Comparing the effects of two different glucocorticoids, we highlight how relatively similar in vitro hazard-based indicators can lead to different in vivo risk. This discrepancy can be predicted by their different uptake potential, pharmacokinetic (PK) and pharmacodynamic (PD) profiles. We recommend that the development phase of qAOPs should include the application of species-specific uptake and physiologically-based PK/PD models. This integration will significantly enhance the predictive power, enabling a more accurate assessment of the risk and the reliable transferability of qAOPs across chemicals.

Download Full-text

Identification and functional characterization of legumain in amphioxus Branchiostoma belcheri

Bioscience Reports ◽

10.1042/bsr20090049 ◽

2009 ◽

Vol 30 (3) ◽

pp. 177-186 ◽

Cited By ~ 6

Author(s):

Lei Teng ◽

Hiroshi Wada ◽

Shicui Zhang

Keyword(s):

Functional Characterization ◽

Specific Substrate ◽

Glycosylation Sites ◽

Branchiostoma Belcheri ◽

Hen’S Egg ◽

Hind Gut ◽

Catalytic Dyad ◽

Pepstatin A

Legumain has been reported from diverse sources such as plants, parasites (animals) and mammals, but little is known in the lower chordates. The present study reports the first characterization of legumain cDNA from the protochordate Branchiostoma belcheri. The deduced 435-amino-acid-long protein is structurally characterized by the presence of a putative N-terminal signal peptide, a peptidase_C13 superfamily domain with the conserved Lys123-Gly124-Asp125 motif and catalytic dyad His153 and Cys195 and two potential Asn-glycosylation sites at Asn85 and Asn270. Phylogenetic analysis demonstrates that B. belcheri legumain forms an independent cluster together with ascidian legumain, and is positioned at the base of vertebrate legumains, suggesting that B. belcheri legumain gene may represent the archetype of vertebrate legumain genes. Both recombinant legumain expressed in yeast and endogenous legumain are able to be converted into active protein of ~37 kDa via a C-terminal autocleavage at acid pH values. The recombinant legumain efficiently degrades the legumain-specific substrate Z-Ala-Ala-Asn-MCA (benzyloxycarbonyl-L-alanyl-L-alanyl-L-asparagine-4-methylcoumaryl-7-amide) at optimum pH 5.5; and the enzymatic activity is inhibited potently by iodoacetamide and N-ethylmaleimide, partially by hen's-egg white cystatin, but not by E-64 [trans-epoxysuccinyl-L-leucylamido-(4-guanidino)butane], PMSF and pepstatin A. In addition, legumain is expressed in vivo in a tissue-specific manner, with main expression in the hepatic caecum and hind-gut of B. belcheri. Altogether, these results suggest that B. belcheri legumain plays a role in the degradation of macromolecules in food.

Download Full-text