Insilico investigations of disordered regions in hypothetical sequences from human proteome as drug targets

Author(s):  
N. Rathankar ◽  
H.G. Nagendra ◽  
K.A. Nirmala
Author(s):  
Yujia Xiang ◽  
Quan Zou ◽  
Lilin Zhao

Abstract In viruses, posttranslational modifications (PTMs) are essential for their life cycle. Recognizing viral PTMs is very important for a better understanding of the mechanism of viral infections and finding potential drug targets. However, few studies have investigated the roles of viral PTMs in virus–human interactions using comprehensive viral PTM datasets. To fill this gap, we developed the first comprehensive viral posttranslational modification database (VPTMdb) for collecting systematic information of PTMs in human viruses and infected host cells. The VPTMdb contains 1240 unique viral PTM sites with 8 modification types from 43 viruses (818 experimentally verified PTM sites manually extracted from 150 publications and 422 PTMs extracted from SwissProt) as well as 13 650 infected cells’ PTMs extracted from seven global proteomics experiments in six human viruses. The investigation of viral PTM sequences motifs showed that most viral PTMs have the consensus motifs with human proteins in phosphorylation and five cellular kinase families phosphorylate more than 10 viral species. The analysis of protein disordered regions presented that more than 50% glycosylation sites of double-strand DNA viruses are in the disordered regions, whereas single-strand RNA and retroviruses prefer ordered regions. Domain–domain interaction analysis indicating potential roles of viral PTMs play in infections. The findings should make an important contribution to the field of virus–human interaction. Moreover, we created a novel sequence-based classifier named VPTMpre to help users predict viral protein phosphorylation sites. VPTMdb online web server (http://vptmdb.com:8787/VPTMdb/) was implemented for users to download viral PTM data and predict phosphorylation sites of interest.


2020 ◽  
Vol 1864 (8) ◽  
pp. 129618 ◽  
Author(s):  
Ilaria Genovese ◽  
Andrea Carotti ◽  
Andrea Ilari ◽  
Annarita Fiorillo ◽  
Theo Battista ◽  
...  

2019 ◽  
pp. 827-847 ◽  
Author(s):  
Chen Wang ◽  
Michal Brylinski ◽  
Lukasz Kurgan
Keyword(s):  

FEBS Journal ◽  
2017 ◽  
Vol 284 (3) ◽  
pp. 485-498 ◽  
Author(s):  
Norman E. Davey ◽  
Moon‐Hyeong Seo ◽  
Vikash Kumar Yadav ◽  
Jouhyun Jeon ◽  
Satra Nim ◽  
...  

Author(s):  
Lucas Prescott

AbstractA novel coronavirus (SARS-CoV-2) has devastated the globe as a pandemic that has killed more than 1,600,000 people. Widespread vaccination is still uncertain, so many scientific efforts have been directed toward discovering antiviral treatments. Many drugs are being investigated to inhibit the coronavirus main protease, 3CLpro, from cleaving its viral polyprotein, but few publications have addressed this protease’s interactions with the host proteome or their probable contribution to virulence. Too few host protein cleavages have been experimentally verified to fully understand 3CLpro’s global effects on relevant cellular pathways and tissues. Here, I set out to determine this protease’s targets and corresponding potential drug targets. Using a neural network trained on cleavages from 388 coronavirus proteomes with a Matthews correlation coefficient of 0.983, I predict that a large proportion of the human proteome is vulnerable to 3CLpro, with 4,460 out of approximately 20,000 human proteins containing at least one putative cleavage site. These cleavages are nonrandomly distributed and are enriched in the epithelium along the respiratory tract, brain, testis, plasma, and immune tissues and depleted in olfactory and gustatory receptors despite the prevalence of anosmia and ageusia in COVID-19 patients. Affected cellular pathways include cytoskeleton/motor/cell adhesion proteins, nuclear condensation and other epigenetics, host transcription and RNAi, ribosomal stoichiometry and nascent-chain detection and degradation, coagulation, pattern recognition receptors, growth factors, lipoproteins, redox, ubiquitination, and apoptosis. This whole proteome cleavage prediction demonstrates the importance of 3CLpro in expected and nontrivial pathways affecting virulence, lead me to propose more than a dozen potential therapeutic targets against coronaviruses, and should therefore be applied to all viral proteases and subsequently experimentally verified.


2021 ◽  
Author(s):  
Caroline Benz ◽  
Muhammad Ali ◽  
Izabella Krystkowiak ◽  
Leandro Simonetti ◽  
Ahmed Sayadi ◽  
...  

Specific protein-protein interactions are central to all processes that underlie cell physiology. Numerous studies using a wide range of experimental approaches have identified tens of thousands of human protein-protein interactions. However, many interactions remain to be discovered, and low affinity, conditional and cell type-specific interactions are likely to be disproportionately under-represented. Moreover, for most known protein-protein interactions the binding regions remain uncharacterized. We previously developed proteomic peptide phage display (ProP-PD), a method for simultaneous proteome-scale identification of short linear motif (SLiM)-mediated interactions and footprinting of the binding region with amino acid resolution. Here, we describe the second-generation human disorderome (HD2), an optimized ProP-PD library that tiles all disordered regions of the human proteome and allows the screening of ~1,000,000 overlapping peptides in a single binding assay. We define guidelines for how to process, filter and rank the results and provide PepTools, a toolkit for annotation and analysis of identified hits. We uncovered 2,161 interaction pairs for 35 known SLiM-binding domains and confirmed a subset of 38 interactions by biophysical or cell-based assays. Finally, we show how the amino acid resolution binding site information can be used to pinpoint functionally important disease mutations and phosphorylation events in intrinsically disordered regions of the human proteome. The HD2 ProP-PD library paired with PepTools represents a powerful pipeline for unbiased proteome-wide discovery of SLiM-based interactions.


2019 ◽  
Vol 17 (01) ◽  
pp. 1950010 ◽  
Author(s):  
Oxana V. Galzitskaya ◽  
Georgii S. Novikov ◽  
Nikita V. Dovidchenko ◽  
Mikhail Yu. Lobanov

We have analyzed codon usage for poly-Q stretches of different lengths for the human proteome. First, we have obtained that all long poly-Q stretches in Protein Data Bank (PDB) belong to the disordered regions. Second, we have found the bias for codon usage for glutamine homo-repeats in the human proteome. In the cases when the same codon is used for poly-Q stretches only CAG triplets are found. Similar results are obtained for human proteins with glutamine homo-repeats associated with diseases. Moreover, for proteins associated with diseases (from the HraDis database), the fraction of proteins for which the same codon is used for glutamine homo-repeats is less (22%) than for proteins from the human proteome (26%). We have demonstrated for poly-Q stretches in the human proteome that in some cases (28) the splicing sites correspond to the homo-repeats and in 11 cases, these sites appear at the [Formula: see text]-terminal part of the homo-repeats with statistical significance 10[Formula: see text].


2018 ◽  
Author(s):  
Edward Lau ◽  
Yu Han ◽  
Maggie P. Y. Lam

AbstractRNA sequencing has led to the discovery of many transcript isoforms created by alternative splicing, but the translational status and functional significance of most alternative splicing events remain unknown. Here we applied a splice junction-centric approach to survey the landscape of protein alternative isoform expression in the human proteome. We focused on alternative splice events where pairs of splice junctions corresponding to included and excluded exons with appreciable read counts are translated together into selective protein sequence databases. Using this approach, we constructed tissue-specific FASTA databases from ENCODE RNA sequencing data, then reanalyzed splice junction peptides in existing mass spectrometry datasets across 10 human tissues (heart, lung, liver, pancreas, ovary, testis, colon, prostate, adrenal gland, and esophagus). Our analysis reidentified 1,108 non-canonical isoforms annotated in SwissProt. We further found 253 novel splice junction peptides in 212 genes that are not documented in the comprehensive Uniprot TrEMBL or Ensembl RefSeq databases. On a proteome scale, non-canonical isoforms differ from canonical sequences preferentially at sequences with heightened protein disorder, suggesting a functional consequence of alternative splicing on the proteome is the regulation of intrinsically disordered regions. We further observed examples where isoform-specific regions intersect with important cardiac protein phosphorylation sites. Our results reveal previously unidentified protein isoforms and may avail efforts to elucidate the functions of splicing events and expand the pool of observable biomarkers in profiling studies.Acronyms and AbbreviationsA3SSalternative 3-prime splice site;A5SSalternative 5-prime splice site;FDRfalse discovery rate;IDRintrinsically disordered regions;MXEmutually exclusive exons;PSIpercent spliced in;PTCpremature termination codon;PTMpost-translational modifications;SEskipped exon;RIretained intron.


ASHA Leader ◽  
2013 ◽  
Vol 18 (3) ◽  
pp. 33-33

Discovery of Alzheimer's Molecular Pathway Reveals New Drug Targets


Sign in / Sign up

Export Citation Format

Share Document