Spatially Interacting Phosphorylation Sites and Mutations in Cancer

ABSTRACTAdvances in mass-spectrometry have generated increasingly large-scale proteomics datasets containing tens of thousands of phosphorylation sites (phosphosites) that require prioritization. We develop a bioinformatics tool called HotPho and systematically discover 3D co-clustering of phosphosites and cancer mutations on protein structures. HotPho identifies 474 such hybrid clusters containing 1,255 co-clustering phosphosites, including RET p.S904/Y928, the conserved HRAS/KRAS p.Y96, and IDH1 p.Y139/IDH2 p.Y179 that are adjacent to recurrent mutations on protein structures not found by linear proximity approaches. Hybrid clusters, enriched in histone and kinase domains, frequently include expression-associated mutations experimentally shown as activating and conferring genetic dependency. Approximately 300 co-clustering phosphosites are verified in patient samples of 5 cancer types or previously implicated in cancer, including CTNNB1 p.S29/Y30, EGFR p.S720, MAPK1 p.S142, and PTPN12 p.S275. In summary, systematic 3D clustering analysis highlights nearly 3,000 likely functional mutations and over 1,000 cancer phosphosites for downstream investigation and evaluation of potential clinical relevance.

Download Full-text

Spatially interacting phosphorylation sites and mutations in cancer

Nature Communications ◽

10.1038/s41467-021-22481-w ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Kuan-lin Huang ◽

Adam D. Scott ◽

Daniel Cui Zhou ◽

Liang-Bo Wang ◽

Amila Weerasinghe ◽

...

Keyword(s):

Mass Spectrometry ◽

Clustering Analysis ◽

Clinical Relevance ◽

Large Scale ◽

Protein Structures ◽

Phosphorylation Sites ◽

Bioinformatics Tool ◽

Recurrent Mutations ◽

Cancer Mutations ◽

Cancer Types

AbstractAdvances in mass-spectrometry have generated increasingly large-scale proteomics datasets containing tens of thousands of phosphorylation sites (phosphosites) that require prioritization. We develop a bioinformatics tool called HotPho and systematically discover 3D co-clustering of phosphosites and cancer mutations on protein structures. HotPho identifies 474 such hybrid clusters containing 1255 co-clustering phosphosites, including RET p.S904/Y928, the conserved HRAS/KRAS p.Y96, and IDH1 p.Y139/IDH2 p.Y179 that are adjacent to recurrent mutations on protein structures not found by linear proximity approaches. Hybrid clusters, enriched in histone and kinase domains, frequently include expression-associated mutations experimentally shown as activating and conferring genetic dependency. Approximately 300 co-clustering phosphosites are verified in patient samples of 5 cancer types or previously implicated in cancer, including CTNNB1 p.S29/Y30, EGFR p.S720, MAPK1 p.S142, and PTPN12 p.S275. In summary, systematic 3D clustering analysis highlights nearly 3,000 likely functional mutations and over 1000 cancer phosphosites for downstream investigation and evaluation of potential clinical relevance.

Download Full-text

Multi-OMICs and Genome Editing Perspectives on Liver Cancer Signaling Networks

BioMed Research International ◽

10.1155/2016/6186281 ◽

2016 ◽

Vol 2016 ◽

pp. 1-14 ◽

Cited By ~ 1

Author(s):

Shengda Lin ◽

Yi A. Yin ◽

Xiaoqian Jiang ◽

Nidhi Sahni ◽

Song Yi

Keyword(s):

Large Scale ◽

Cancer Genomics ◽

The Cancer Genome Atlas ◽

The Past ◽

Cancer Initiation ◽

Cancer Mutations ◽

Cancer Genome Atlas ◽

Cancer Types ◽

Generation Sequencing ◽

Integrative Omics

The advent of the human genome sequence and the resulting ~20,000 genes provide a crucial framework for a transition from traditional biology to an integrative “OMICs” arena (Lander et al., 2001; Venter et al., 2001; Kitano, 2002). This brings in a revolution for cancer research, which now enters a big data era. In the past decade, with the facilitation by next-generation sequencing, there have been a huge number of large-scale sequencing efforts, such as The Cancer Genome Atlas (TCGA), the HapMap, and the 1000 genomes project. As a result, a deluge of genomic information becomes available from patients stricken by a variety of cancer types. The list of cancer-associated genes is ever expanding. New discoveries are made on how frequent and highly penetrant mutations, such as those in the telomerase reverse transcriptase (TERT) andTP53, function in cancer initiation, progression, and metastasis. Most genes with relatively frequent but weakly penetrant cancer mutations still remain to be characterized. In addition, genes that harbor rare but highly penetrant cancer-associated mutations continue to emerge. Here, we review recent advances related to cancer genomics, proteomics, and systems biology and suggest new perspectives in targeted therapy and precision medicine.

Download Full-text

Statistical Force-Field for Structural Modeling Using Chemical Cross-Linking/mass Spectrometry Distance Constraints

10.26434/chemrxiv.6030563 ◽

2018 ◽

Author(s):

Allan J. R. Ferrari ◽

Fabio C. Gozzo ◽

Leandro Martinez

Keyword(s):

Mass Spectrometry ◽

Amino Acid ◽

Force Field ◽

Protein Structures ◽

Protein Structure Determination ◽

Structural Modeling ◽

Cross Linking ◽

Amino Acid Residues ◽

Distance Constraints ◽

Chemical Cross Linking

<div><p>Chemical cross-linking/Mass Spectrometry (XLMS) is an experimental method to obtain distance constraints between amino acid residues, which can be applied to structural modeling of tertiary and quaternary biomolecular structures. These constraints provide, in principle, only upper limits to the distance between amino acid residues along the surface of the biomolecule. In practice, attempts to use of XLMS constraints for tertiary protein structure determination have not been widely successful. This indicates the need of specifically designed strategies for the representation of these constraints within modeling algorithms. Here, a force-field designed to represent XLMS-derived constraints is proposed. The potential energy functions are obtained by computing, in the database of known protein structures, the probability of satisfaction of a topological cross-linking distance as a function of the Euclidean distance between amino acid residues. The force-field can be easily incorporated into current modeling methods and software. In this work, the force-field was implemented within the Rosetta ab initio relax protocol. We show a significant improvement in the quality of the models obtained relative to current strategies for constraint representation. This force-field contributes to the long-desired goal of obtaining the tertiary structures of proteins using XLMS data. Force-field parameters and usage instructions are freely available at http://m3g.iqm.unicamp.br/topolink/xlff <br></p></div><p></p><p></p>

Download Full-text

Algorithm of combining chromatography mass spectrometry-untargeted profiling and multivariate analysis for identification of marker-substances in samples of complex composition

Industrial laboratory Diagnostics of materials ◽

10.26896/1028-6861-2020-86-7-12-19 ◽

2020 ◽

Vol 86 (7) ◽

pp. 12-19

Author(s):

I. V. Plyushchenko ◽

D. G. Shakhmatov ◽

I. A. Rodin

Keyword(s):

Mass Spectrometry ◽

Multivariate Analysis ◽

Large Scale ◽

Complex Composition ◽

Unified Protocol ◽

Chromatography Mass Spectrometry ◽

Marker Substances ◽

Selection Testing ◽

Untargeted Profiling

A viral development of statistical data processing, computing capabilities, chromatography-mass spectrometry, and omics technologies (technologies based on the achievements of genomics, transcriptomics, proteomics, metabolomics) in recent decades has not led to formation of a unified protocol for untargeted profiling. Systematic errors reduce the reproducibility and reliability of the obtained results, and at the same time hinder consolidation and analysis of data gained in large-scale multi-day experiments. We propose an algorithm for conducting omics profiling to identify potential markers in the samples of complex composition and present the case study of urine samples obtained from different clinical groups of patients. Profiling was carried out by the method of liquid chromatography mass spectrometry. The markers were selected using methods of multivariate analysis including machine learning and feature selection. Testing of the approach was performed using an independent dataset by clustering and projection on principal components.

Download Full-text

Phosphorylation sites of mouse gap junction protein 31 were determined by mass spectrometry

Modern Analytical Chemistry Research ◽

10.35534/macr.0101004c ◽

2019 ◽

Vol 1 (1) ◽

pp. 18-27

Author(s):

Ma Junjie

Keyword(s):

Mass Spectrometry ◽

Gap Junction ◽

Phosphorylation Sites ◽

Junction Protein ◽

Gap Junction Protein

Download Full-text

Automated 16-Plex Plasma Proteomics with Real-Time Search and Ion Mobility Mass Spectrometry Enables Large-Scale Profiling in Naked Mole-Rats and Mice

Journal of Proteome Research ◽

10.1021/acs.jproteome.0c00681 ◽

2021 ◽

Vol 20 (2) ◽

pp. 1280-1295

Author(s):

Aleksandr Gaun ◽

Kaitlyn N. Lewis Hardell ◽

Niclas Olsson ◽

Jonathon J. O’Brien ◽

Sudha Gollapudi ◽

...

Keyword(s):

Mass Spectrometry ◽

Real Time ◽

Ion Mobility ◽

Large Scale ◽

Ion Mobility Mass Spectrometry ◽

Plasma Proteomics ◽

Rats And Mice

Download Full-text

Effect of charge derivatization in the determination of phosphorylation sites in peptides by electrospray ionization collision-activated dissociation tandem mass spectrometry

Journal of Mass Spectrometry ◽

10.1002/(sici)1096-9888(199912)34:12<1279::aid-jms899>3.0.co;2-9 ◽

1999 ◽

Vol 34 (12) ◽

pp. 1279-1282 ◽

Cited By ~ 5

Author(s):

Nalini Sadagopan ◽

Michael Malone ◽

J. Th-rock Watson

Keyword(s):

Mass Spectrometry ◽

Tandem Mass Spectrometry ◽

Electrospray Ionization ◽

Tandem Mass ◽

Phosphorylation Sites

Download Full-text

Cross‐Linking/Mass Spectrometry for Studying Protein Structures and Protein–Protein Interactions: Where Are We Now and Where Should We Go from Here?

Angewandte Chemie International Edition ◽

10.1002/anie.201709559 ◽

2018 ◽

Vol 57 (22) ◽

pp. 6390-6396 ◽

Cited By ~ 82

Author(s):

Andrea Sinz

Keyword(s):

Mass Spectrometry ◽

Protein Interactions ◽

Protein Structures ◽

Cross Linking ◽

Protein Protein Interactions

Download Full-text

Development of an ion-pair liquid chromatography–high resolution mass spectrometry method for determination of organophosphate pesticide metabolites in large-scale biomonitoring studies

Journal of Chromatography A ◽

10.1016/j.chroma.2016.05.067 ◽

2016 ◽

Vol 1454 ◽

pp. 32-41 ◽

Cited By ~ 15

Author(s):

Enrique Cequier ◽

Amrit Kaur Sakhi ◽

Line Småstuen Haug ◽

Cathrine Thomsen

Keyword(s):

Mass Spectrometry ◽

Liquid Chromatography ◽

Large Scale ◽

High Resolution Mass Spectrometry ◽

Ion Pair ◽

Organophosphate Pesticide ◽

Spectrometry Method ◽

Pesticide Metabolites ◽

Resolution Mass

Download Full-text

The proteomic future: where mass spectrometry should be taking us

Biochemical Journal ◽

10.1042/bj20110363 ◽

2012 ◽

Vol 444 (2) ◽

pp. 169-181 ◽

Cited By ~ 52

Author(s):

Jay J. Thelen ◽

Ján A. Miernyk

Keyword(s):

Mass Spectrometry ◽

Gel Electrophoresis ◽

Protein Structures ◽

Research Area ◽

Mass Accuracy ◽

Two Dimensional ◽

Two Dimensional Gel Electrophoresis ◽

Post Translational Modifications ◽

Peptide Separation ◽

Research Areas

A newcomer to the -omics era, proteomics, is a broad instrument-intensive research area that has advanced rapidly since its inception less than 20 years ago. Although the ‘wet-bench’ aspects of proteomics have undergone a renaissance with the improvement in protein and peptide separation techniques, including various improvements in two-dimensional gel electrophoresis and gel-free or off-gel protein focusing, it has been the seminal advances in MS that have led to the ascension of this field. Recent improvements in sensitivity, mass accuracy and fragmentation have led to achievements previously only dreamed of, including whole-proteome identification, and quantification and extensive mapping of specific PTMs (post-translational modifications). With such capabilities at present, one might conclude that proteomics has already reached its zenith; however, ‘capability’ indicates that the envisioned goals have not yet been achieved. In the present review we focus on what we perceive as the areas requiring more attention to achieve the improvements in workflow and instrumentation that will bridge the gap between capability and achievement for at least most proteomes and PTMs. Additionally, it is essential that we extend our ability to understand protein structures, interactions and localizations. Towards these ends, we briefly focus on selected methods and research areas where we anticipate the next wave of proteomic advances.

Download Full-text