scholarly journals Optimization of Spectral Library Size Improves DIA-MS Proteome Coverage

2020 ◽  
Author(s):  
Weigang Ge ◽  
Xiao Liang ◽  
Fangfei Zhang ◽  
Luang Xu ◽  
Nan Xiang ◽  
...  

AbstractEfficient peptide and protein identification from data-independent acquisition mass spectrometric (DIA-MS) data typically rely on an experiment-specific spectral library with a suitable size. Here, we report a computational strategy for optimizing the spectral library for a specific DIA dataset based on a comprehensive spectral library, which is accomplished by a priori analysis of the DIA dataset. This strategy achieved up to 44.7% increase in peptide identification and 38.1% increase in protein identification in the test dataset of six colorectal tumor samples compared with the comprehensive pan-human library strategy. We further applied this strategy to 389 carcinoma samples from 15 tumor datasets and observed up to 39.2% increase in peptide identification and 19.0% increase in protein identification. In summary, we present a computational strategy for spectral library size optimization to achieve deeper proteome coverage of DIA-MS data.

2020 ◽  
Author(s):  
Ronghui Lou ◽  
Pan Tang ◽  
Kang Ding ◽  
Shanshan Li ◽  
Cuiping Tian ◽  
...  

AbstractData-independent acquisition mass spectrometry (DIA-MS) is a rapidly evolving technique that enables relatively deep proteomic profiling with superior quantification reproducibility. DIA data mining predominantly relies on a spectral library of sufficient proteome coverage that, in most cases, is built on data-dependent acquisition-based analysis of the same sample. To expand the proteome coverage for a pre-determined protein family, we report herein on the construction of a hybrid spectral library that supplements a DIA experiment-derived library with a protein family-targeted virtual library predicted by deep learning. Leveraging this DIA hybrid library substantially deepens the coverage of three transmembrane protein families (G protein coupled receptors; ion channels; and transporters) in mouse brain tissues with increases in protein identification of 37-87%, and peptide identification of 58-161%. Moreover, of the 412 novel GPCR peptides exclusively identified with the DIA hybrid library strategy, 53.6% were validated as present in mouse brain tissues based on orthogonal experimental measurement.


2017 ◽  
Author(s):  
Jesse G. Meyer ◽  
Sushanth Mukkamalla ◽  
Alexandria K. D’Souza ◽  
Alexey I. Nesvizhskii ◽  
Bradford W. Gibson ◽  
...  

Label-free quantification using data-independent acquisition (DIA) is a robust method for deep and accurate proteome quantification1,2. However, when lacking a pre-existing spectral library, as is often the case with studies of novel post-translational modifications (PTMs), samples are typically analyzed several times: one or more data dependent acquisitions (DDA) are used to generate a spectral library followed by DIA for quantification. This type of multi-injection analysis results in significant cost with regard to sample consumption and instrument time for each new PTM study, and may not be possible when sample amount is limiting and/or studies require a large number of biological replicates. Recently developed software (e.g. DIA-Umpire) has enabled combined peptide identification and quantification from a data-independent acquisition without any pre-existing spectral library3,4. Still, these tools are designed for protein level quantification. Here we demonstrate a software tool and workflow that extends DIA-Umpire to allow automated identification and quantification of PTM peptides from DIA. We accomplish this using a custom, open-source graphical user interface DIA-Pipe (https://github.com/jgmeyerucsd/PIQEDia/releases/tag/v0.1.2) (figure 1a).


Author(s):  
Asad Ali Siyal ◽  
Eric Sheng-Wen Chen ◽  
Hsin-Ju Chan ◽  
Reta Birhanu Kitata ◽  
Jhih-Ci Yang ◽  
...  

2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Ying Hon Sze ◽  
Qian Zhao ◽  
Jimmy Ka Wai Cheung ◽  
King Kit Li ◽  
Dennis Yan Yin Tse ◽  
...  

AbstractThe retina is a key sensory tissue composed of multiple layers of cell populations that work coherently to process and decode visual information. Mass spectrometry-based proteomics approach has allowed high-throughput, untargeted protein identification, demonstrating the presence of these proteins in the retina and their involvement in biological signalling cascades. The comprehensive wild-type mouse retina proteome was prepared using a novel sample preparation approach, the suspension trapping (S-Trap) filter, and further fractionated with high-pH reversed phase chromatography involving a total of 28 injections. This data-dependent acquisition (DDA) approach using a Sciex TripleTOF 6600 mass spectrometer identified a total of 7,122 unique proteins (1% FDR), and generated a spectral library of 5,950 proteins in the normal C57BL/6 mouse retina. Data-independent acquisition (DIA) approach relies on a large and high-quality spectral library to analyse chromatograms, this spectral library would enable access to SWATH-MS acquisition to provide unbiased, multiplexed, and quantification of proteins in the mouse retina, acting as the most extensive reference library to investigate retinal diseases using the C57BL/6 mouse model.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Ronghui Lou ◽  
Weizhen Liu ◽  
Rongjie Li ◽  
Shanshan Li ◽  
Xuming He ◽  
...  

AbstractPhosphoproteomics integrating data-independent acquisition (DIA) enables deep phosphoproteome profiling with improved quantification reproducibility and accuracy compared to data-dependent acquisition (DDA)-based phosphoproteomics. DIA data mining heavily relies on a spectral library that in most cases is built on DDA analysis of the same sample. Construction of this project-specific DDA library impairs the analytical throughput, limits the proteome coverage, and increases the sample size for DIA phosphoproteomics. Herein we introduce a deep neural network, DeepPhospho, which conceptually differs from previous deep learning models to achieve accurate predictions of LC-MS/MS data for phosphopeptides. By leveraging in silico libraries generated by DeepPhospho, we establish a DIA workflow for phosphoproteome profiling which involves DIA data acquisition and data mining with DeepPhospho predicted libraries, thus circumventing the need of DDA library construction. Our DeepPhospho-empowered workflow substantially expands the phosphoproteome coverage while maintaining high quantification performance, which leads to the discovery of more signaling pathways and regulated kinases in an EGF signaling study than the DDA library-based approach. DeepPhospho is provided as a web server as well as an offline app to facilitate user access to model training, predictions and library generation.


2021 ◽  
Author(s):  
Wenqing Shui ◽  
Ronghui Lou ◽  
Weizhen Liu ◽  
Rongjie Li ◽  
Shanshan Li ◽  
...  

Abstract Phosphoproteomics integrating data-independent acquisition (DIA) has enabled deep phosphoproteome profiling with improved quantification reproducibility and accuracy compared to data-dependent acquisition (DDA)-based phosphoproteomics. DIA data mining heavily relies on a spectral library that in most cases is built on DDA analysis of the same sample. Construction of this project-specific DDA library impairs the analytical throughput, limits the proteome coverage, and increases the sample size for DIA phosphoproteomics. Herein we introduce a novel deep neural network, DeepPhospho, which conceptually differs from previous deep learning models to achieve accurate predictions of LC-MS/MS data for phosphopeptides. By leveraging in silico libraries generated by DeepPhospho, we established a new DIA workflow for phosphoproteome profiling which involves DIA data acquisition and data mining with DeepPhospho predicted libraries, thus circumventing the need of DDA library construction. Our DeepPhospho-empowered workflow substantially expanded the phosphoproteome coverage while maintaining high quantification performance, which led to the discovery of more signaling pathways and regulated kinases in an EGF signaling study than the DDA library-based approach. DeepPhospho is provided as a web server to facilitate user access to predictions and library generation.


2021 ◽  
Author(s):  
Jian Song ◽  
Fangfei Zhang ◽  
Changbin Yu

ABSTRACTMotivationIdentification of peptides in data-independent acquisition (DIA) mass spectrometry (MS) typically relies on the scoring for the peak groups upon extracted chromatograms of fragment ions. Expanding fragment scoring features closer to the genuine experimental spectra can improve DIA identification. Deep learning is able to predict fragment presence without understanding the fragmentation mechanism that can enrich the scoring features in DIA identification.ResultsIn this work, we developed a deep neural network-based model, Alpha-Frag, to predict the fragment ions that should be present for a given peptide by reporting their probabilities of existence. The prediction performance was evaluated in terms of intersection over union (IoU), and Alpha-Frag achieved an average of >0.7 and outperformed substantially the benchmarks across the validation datasets. Furthermore, qualitative scores based on Alpha-Frag were designed and incorporated into the peptide statistical validation tools as auxiliary scores. Our preliminary experiments show that the qualitative scores by Alpha-Frag are profitable for DIA identification, especially in the case of short gradient, and yielded an increase of 10.1%-29.3% improvements for the test dataset compared to the same scoring strategy but using Prosit.Availability and ImplementationSource code and the trained model are available at www.github.com/YuAirLab/Alpha-Frag.


PROTEOMICS ◽  
2007 ◽  
Vol 7 (5) ◽  
pp. 655-667 ◽  
Author(s):  
Henry Lam ◽  
Eric W. Deutsch ◽  
James S. Eddes ◽  
Jimmy K. Eng ◽  
Nichole King ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document