Understanding the limit of open search in the identification of peptides with post-translational modifications — A simulation-based study

AbstractMotivationAnalyzing tandem mass spectrometry data to recognize peptides in a sample is the fundamental task in computational proteomics. Traditional peptide identification algorithms perform well when identifying unmodified peptides. However, when peptides have post-translational modifications (PTMs), these methods cannot provide satisfactory results. Recently, Chick et al., 2015 and Yu et al., 2016 proposed the spectrum-based and tag-based open search methods, respectively, to identify peptides with PTMs. While the performance of these two methods is promising, the identification results vary greatly with respect to the quality of tandem mass spectra and the number of PTMs in peptides. This motivates us to systematically study the relationship between the performance of open search methods and quality parameters of tandem mass spectrum data, as well as the number of PTMs in peptides.ResultsThrough large-scale simulations, we obtain the performance trend when simulated tandem mass spectra are of different quality. We propose an analytical model to describe the relationship between the probability of obtaining correct identifications and the spectrum quality as well as the number of PTMs. Based on the analytical model, we can quantitatively describe the necessary condition to effectively apply open search methods.AvailabilitySource codes of the simulation are available at http://bioinformatics.ust.hk/[email protected] or [email protected] informationSupplementary data are available at Bioinformatics online.

Download Full-text

CluMSID: an R package for similarity-based clustering of tandem mass spectra to aid feature annotation in metabolomics

Bioinformatics ◽

10.1093/bioinformatics/btz005 ◽

2019 ◽

Vol 35 (17) ◽

pp. 3196-3198 ◽

Cited By ~ 5

Author(s):

Tobias Depke ◽

Raimo Franke ◽

Mark Brönstrup

Keyword(s):

Mass Spectra ◽

Neutral Loss ◽

Metabolite Identification ◽

R Package ◽

Supplementary Information ◽

Tandem Mass ◽

Compound Identification ◽

Feature Identification ◽

Tandem Mass Spectra ◽

Interactive Visualizations

Abstract Summary Compound identification is one of the most eminent challenges in the untargeted analysis of complex mixtures of small molecules by mass spectrometry. Similarity of tandem mass spectra can provide valuable information on putative structural similarities between known and unknown analytes and hence aids feature identification in the bioanalytical sciences. We have developed CluMSID (Clustering of MS2 spectra for metabolite identification), an R package that enables researchers to make use of tandem mass spectra and neutral loss pattern similarities as a part of their metabolite annotation workflow. CluMSID offers functions for all analysis steps from import of raw data to data mining by unsupervised multivariate methods along with respective (interactive) visualizations. A detailed tutorial with example data is provided as supplementary information. Availability and implementation CluMSID is available as R package from https://github.com/tdepke/CluMSID/and from https://bioconductor.org/packages/CluMSID/. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Unrestricted identification of post translational modifications from tandem mass spectra datasets

2010 International Conference on Bioinformatics and Biomedical Technology ◽

10.1109/icbbt.2010.5478968 ◽

2010 ◽

Cited By ~ 1

Author(s):

Chiyong Kang ◽

Dong-Joo Kim ◽

Young-Rae Kim ◽

Gwan-Su Yi

Keyword(s):

Mass Spectra ◽

Tandem Mass ◽

Post Translational Modifications ◽

Tandem Mass Spectra

Download Full-text

PTMSearch: A Greedy Tree Traversal Algorithm for Finding Protein Post-Translational Modifications in Tandem Mass Spectra

Machine Learning and Knowledge Discovery in Databases - Lecture Notes in Computer Science ◽

10.1007/978-3-642-23783-6_11 ◽

2011 ◽

pp. 162-176 ◽

Cited By ~ 1

Author(s):

Attila Kertész-Farkas ◽

Beáta Reiz ◽

Michael P. Myers ◽

Sándor Pongor

Keyword(s):

Mass Spectra ◽

Tandem Mass ◽

Post Translational Modifications ◽

Tandem Mass Spectra ◽

Tree Traversal ◽

Traversal Algorithm

Download Full-text

Features-Based Deisotoping Method for Tandem Mass Spectra

Advances in Bioinformatics ◽

10.1155/2011/210805 ◽

2011 ◽

Vol 2011 ◽

pp. 1-12 ◽

Cited By ~ 7

Author(s):

Zheng Yuan ◽

Jinhong Shi ◽

Wenjun Lin ◽

Bolin Chen ◽

Fang-Xiang Wu

Keyword(s):

Mass Spectra ◽

Protein Identification ◽

Score Function ◽

Tandem Mass ◽

Isotopic Cluster ◽

Tandem Mass Spectra ◽

Fragment Ions ◽

The Relationship ◽

Cluster Graphs

For high-resolution tandem mass spectra, the determination of monoisotopic masses of fragment ions plays a key role in the subsequent peptide and protein identification. In this paper, we present a new algorithm for deisotoping the bottom-up spectra. Isotopic-cluster graphs are constructed to describe the relationship between all possible isotopic clusters. Based on the relationship in isotopic-cluster graphs, each possible isotopic cluster is assessed with a score function, which is built by combining nonintensity and intensity features of fragment ions. The non-intensity features are used to prevent fragment ions with low intensity from being removed. Dynamic programming is adopted to find the highest score path with the most reliable isotopic clusters. The experimental results have shown that the average Mascot scores and F-scores of identified peptides from spectra processed by our deisotoping method are greater than those by YADA and MS-Deconv software.

Download Full-text

A suffix tree approach to the interpretation of tandem mass spectra: applications to peptides of non-specific digestion and post-translational modifications

Bioinformatics ◽

10.1093/bioinformatics/btg1068 ◽

2003 ◽

Vol 19 (Suppl 2) ◽

pp. ii113-ii121 ◽

Cited By ~ 12

Author(s):

B. Lu ◽

T. Chen

Keyword(s):

Mass Spectra ◽

Suffix Tree ◽

Tandem Mass ◽

Post Translational Modifications ◽

Tandem Mass Spectra ◽

Tree Approach

Download Full-text

Peptide sequence tag generation for tandem mass spectra containing post-translational modifications

International Journal of Computational Biology and Drug Design ◽

10.1504/ijcbdd.2012.049209 ◽

2012 ◽

Vol 5 (3/4) ◽

pp. 325

Author(s):

Hui Li ◽

Chunmei Liu

Keyword(s):

Mass Spectra ◽

Peptide Sequence ◽

Tandem Mass ◽

Post Translational Modifications ◽

Tandem Mass Spectra

Download Full-text

SILVER helps assign peptides to tandem mass spectra using intensity-based scoring

Journal of the American Society for Mass Spectrometry ◽

10.1016/s1044-0305(04)00167-9 ◽

2004 ◽

Vol 15 (6) ◽

pp. 910-912

Author(s):

F GIBBONS

Keyword(s):

Mass Spectra ◽

Tandem Mass ◽

Tandem Mass Spectra

Download Full-text

Unrestrictive protein modification localization and quality control for open search of mass spectra

10.26434/chemrxiv.5797995 ◽

2018 ◽

Author(s):

Zhiwu An ◽

Fuzhou Gong ◽

Yan Fu

Keyword(s):

Mass Spectrometry ◽

Quality Control ◽

Tandem Mass Spectrometry ◽

Mass Spectra ◽

Protein Modification ◽

Software Tool ◽

Tandem Mass ◽

Simulation Data ◽

Post Translational Modifications

We have developed PTMiner, a first software tool for automated, confident filtering, localization and annotation of protein post-translational modifications identified by open (mass-tolerant) search of large tandem mass spectrometry datasets. The performance of the software was validated on carefully designed simulation data. <br>

Download Full-text