AbstractRecent advances in liquid chromatography/mass spectrometry (LC/MS) technology have notably improved the sensitivity, resolution, and speed of proteome analysis, resulting in increasing demand for more sophisticated algorithms to interpret complex mass spectrograms. Here, we propose a novel statistical method that we call proteomic mass spectrogram decomposition (ProtMSD) for joint identification and quantification of peptides and proteins. Given the proteomic mass spectrogram and the reference mass spectra of all possible peptide ions associated with proteins as a dictionary, our method directly estimates the temporal intensity curves of those peptide ions, i.e., the chromatograms, under a group sparsity constraint without using the conventional careful pre-processing (e.g., thresholding and peak picking). We show that the accuracy of protein identification was significantly improved by using the protein-peptide hierarchical relationships, the isotopic distribution profiles and predicted retention times of peptide ions and the pre-learned mass spectra of noise. In the analysis of E. coli cell lysate, our ProtMSD showed excellent agreement (3277 peptide ions (94.79%) and 493 proteins (98.21%)) with the conventional cascading approach to identification and quantification based on Mascot and Skyline. This is the first attempt to use a matrix decomposition technique as a tool for LC/MS-based joint proteome identification and quantification.