Comparing Top-Down Proteoform Identification: Deconvolution, PrSM Overlap, and PTM Detection
Journal Article
·
· Journal of Proteome Research
- Université Paris Cite (France). Institut Pasteur; Centre National de la Recherche Scientifique (CNRS) (France)
- Univ. of Tubingen (Germany)
- Univ. of Wisconsin, Madison, WI (United States)
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States). Environmental Molecular Sciences Laboratory (EMSL)
- Univ. of Idaho, Moscow, ID (United States)
Generating top-down tandem mass spectra (MS/MS) for complex mixtures of proteoforms has become possible through improvements in fractionation, on-line separation, dissociation, and mass analysis. The algorithms to match tandem mass spectra to sequences have undergone a parallel evolution, with both spectral alignment and peak matching being paired with diverse methods for scoring proteoform-spectral matches (PrSMs). This study assesses state-of-the-art algorithms for top-down identification through three distinct challenges. The first is identifying a large yield of PrSMs while controlling false discovery rate (FDR) in identifying thousands of proteoforms from complex cell lysates via four software workflows: ProSight Proteome Discoverer, TopPIC, Informed Proteomics, and pTop. The second is the deconvolution of data from both Thermo Orbitrap-class and Bruker maXis Q-TOF instruments to produce consistent precursor charge and mass determinations while generating fragment mass lists to optimize identification. The third attempts to detect diverse post-translational modifications (PTMs) in proteoforms from cow milk and human ovarian tissue. The data demonstrate that existing software suites produce admirable sensitivity, in some cases identifying a third of collected tandem mass spectra with FDR controlled below 2%; the overlap in these PrSMs, however, illustrates real value in searching data with multiple search engines. Differences among identification workflows seem to result from each search algorithm incorporating its own deconvolution algorithm. By transmitting deconvolution data from multiple deconvolution routes (Thermo Xtract, Bruker Auto MSn, Mascot Distiller, TopFD, and FLASHDeconv) to the downstream TopPIC search algorithm, we were able to detect common causes of deconvolution disagreement. The detection of PTMs was very inconsistent among search algorithms, with some workflows suggesting as little as 1% of PrSMs from cow’s milk were singly-phosphorylated while other workflows found that 18% of PrSMs were singly-phosphorylated. Taken together, these results make a strong argument for top-down researchers to adopt a standard practice of analyzing each MS/MS experiment with at least two different search engines.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States). Environmental Molecular Sciences Laboratory (EMSL)
- Sponsoring Organization:
- European Union (EU); National Institutes of Health (NIH); USDOE Office of Science (SC), Basic Energy Sciences (BES). Scientific User Facilities (SUF); USDOE Office of Science (SC), Biological and Environmental Research (BER)
- Grant/Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1988433
- Report Number(s):
- PNNL-SA-178802
- Journal Information:
- Journal of Proteome Research, Journal Name: Journal of Proteome Research Journal Issue: 7 Vol. 22; ISSN 1535-3893
- Publisher:
- American Chemical Society (ACS)Copyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Characterization of Proteoforms with Unknown Post-translational Modifications Using the MIScore
Profiling of Histone Post-Translational Modifications in Mouse Brain with High-Resolution Top-Down Mass Spectrometry
Journal Article
·
Fri Aug 05 00:00:00 EDT 2016
· Journal of Proteome Research
·
OSTI ID:1324898
Profiling of Histone Post-Translational Modifications in Mouse Brain with High-Resolution Top-Down Mass Spectrometry
Journal Article
·
Tue Dec 20 23:00:00 EST 2016
· Journal of Proteome Research
·
OSTI ID:1349168