Informed-Proteomics: open-source software package for top-down proteomics
Abstract
Top-down proteomics involves the analysis of intact proteins. This approach is very attractive as it allows for analyzing proteins in their endogenous form without proteolysis, preserving valuable information about post-translation modifications, isoforms, proteolytic processing or their combinations collectively called proteoforms. Moreover, the quality of the top-down LC-MS/MS datasets is rapidly increasing due to advances in the liquid chromatography and mass spectrometry instrumentation and sample processing protocols. However, the top-down mass spectra are substantially more complex compare to the more conventional bottom-up data. To take full advantage of the increasing quality of the top-down LC-MS/MS datasets there is an urgent need to develop algorithms and software tools for confident proteoform identification and quantification. In this study we present a new open source software suite for top-down proteomics analysis consisting of an LC-MS feature finding algorithm, a database search algorithm, and an interactive results viewer. The presented tool along with several other popular tools were evaluated using human-in-mouse xenograft luminal and basal breast tumor samples that are known to have significant differences in protein abundance based on bottom-up analysis.
- Authors:
- Publication Date:
- Research Org.:
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States). Environmental Molecular Sciences Lab. (EMSL)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1398206
- Report Number(s):
- PNNL-SA-120171
Journal ID: ISSN 1548-7091; 48135; 49670; 47418; 48680; 453040220
- DOE Contract Number:
- AC05-76RL01830
- Resource Type:
- Journal Article
- Journal Name:
- Nature Methods
- Additional Journal Information:
- Journal Volume: 14; Journal Issue: 9; Journal ID: ISSN 1548-7091
- Publisher:
- Nature Publishing Group
- Country of Publication:
- United States
- Language:
- English
- Subject:
- Environmental Molecular Sciences Laboratory
Citation Formats
Park, Jungkap, Piehowski, Paul D., Wilkins, Christopher, Zhou, Mowei, Mendoza, Joshua, Fujimoto, Grant M., Gibbons, Bryson C., Shaw, Jared B., Shen, Yufeng, Shukla, Anil K., Moore, Ronald J., Liu, Tao, Petyuk, Vladislav A., Tolić, Nikola, Paša-Tolić, Ljiljana, Smith, Richard D., Payne, Samuel H., and Kim, Sangtae. Informed-Proteomics: open-source software package for top-down proteomics. United States: N. p., 2017.
Web. doi:10.1038/nmeth.4388.
Park, Jungkap, Piehowski, Paul D., Wilkins, Christopher, Zhou, Mowei, Mendoza, Joshua, Fujimoto, Grant M., Gibbons, Bryson C., Shaw, Jared B., Shen, Yufeng, Shukla, Anil K., Moore, Ronald J., Liu, Tao, Petyuk, Vladislav A., Tolić, Nikola, Paša-Tolić, Ljiljana, Smith, Richard D., Payne, Samuel H., & Kim, Sangtae. Informed-Proteomics: open-source software package for top-down proteomics. United States. https://doi.org/10.1038/nmeth.4388
Park, Jungkap, Piehowski, Paul D., Wilkins, Christopher, Zhou, Mowei, Mendoza, Joshua, Fujimoto, Grant M., Gibbons, Bryson C., Shaw, Jared B., Shen, Yufeng, Shukla, Anil K., Moore, Ronald J., Liu, Tao, Petyuk, Vladislav A., Tolić, Nikola, Paša-Tolić, Ljiljana, Smith, Richard D., Payne, Samuel H., and Kim, Sangtae. 2017.
"Informed-Proteomics: open-source software package for top-down proteomics". United States. https://doi.org/10.1038/nmeth.4388.
@article{osti_1398206,
title = {Informed-Proteomics: open-source software package for top-down proteomics},
author = {Park, Jungkap and Piehowski, Paul D. and Wilkins, Christopher and Zhou, Mowei and Mendoza, Joshua and Fujimoto, Grant M. and Gibbons, Bryson C. and Shaw, Jared B. and Shen, Yufeng and Shukla, Anil K. and Moore, Ronald J. and Liu, Tao and Petyuk, Vladislav A. and Tolić, Nikola and Paša-Tolić, Ljiljana and Smith, Richard D. and Payne, Samuel H. and Kim, Sangtae},
abstractNote = {Top-down proteomics involves the analysis of intact proteins. This approach is very attractive as it allows for analyzing proteins in their endogenous form without proteolysis, preserving valuable information about post-translation modifications, isoforms, proteolytic processing or their combinations collectively called proteoforms. Moreover, the quality of the top-down LC-MS/MS datasets is rapidly increasing due to advances in the liquid chromatography and mass spectrometry instrumentation and sample processing protocols. However, the top-down mass spectra are substantially more complex compare to the more conventional bottom-up data. To take full advantage of the increasing quality of the top-down LC-MS/MS datasets there is an urgent need to develop algorithms and software tools for confident proteoform identification and quantification. In this study we present a new open source software suite for top-down proteomics analysis consisting of an LC-MS feature finding algorithm, a database search algorithm, and an interactive results viewer. The presented tool along with several other popular tools were evaluated using human-in-mouse xenograft luminal and basal breast tumor samples that are known to have significant differences in protein abundance based on bottom-up analysis.},
doi = {10.1038/nmeth.4388},
url = {https://www.osti.gov/biblio/1398206},
journal = {Nature Methods},
issn = {1548-7091},
number = 9,
volume = 14,
place = {United States},
year = {Mon Aug 07 00:00:00 EDT 2017},
month = {Mon Aug 07 00:00:00 EDT 2017}
}
Works referenced in this record:
Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions
journal, April 1995
- Senko, Michael W.; Beu, Steven C.; McLaffertycor, Fred W.
- Journal of the American Society for Mass Spectrometry, Vol. 6, Issue 4
Top-down mass spectrometry for the analysis of combinatorial post-translational modifications
journal, June 2012
- Lanucara, Francesco; Eyers, Claire E.
- Mass Spectrometry Reviews, Vol. 32, Issue 1
Automated reduction and interpretation of
journal, April 2000
- Horn, David M.; Zubarev, Roman A.; McLafferty, Fred W.
- Journal of the American Society for Mass Spectrometry, Vol. 11, Issue 4
Mapping intact protein isoforms in discovery mode using top-down proteomics
journal, October 2011
- Tran, John C.; Zamdborg, Leonid; Ahlf, Dorothy R.
- Nature, Vol. 480, Issue 7376
Quantitative Analysis of the Intra- and Inter-Individual Variability of the Normal Urinary Proteome
journal, February 2011
- Nagaraj, Nagarjuna; Mann, Matthias
- Journal of Proteome Research, Vol. 10, Issue 2
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry
journal, February 2007
- Elias, Joshua E.; Gygi, Steven P.
- Nature Methods, Vol. 4, Issue 3
Advances and Challenges in Liquid Chromatography-Mass Spectrometry-based Proteomics Profiling for Clinical Applications
journal, October 2006
- Qian, Wei-Jun; Jacobs, Jon M.; Liu, Tao
- Molecular & Cellular Proteomics, Vol. 5, Issue 10
Integrated Bottom-Up and Top-Down Proteomics of Patient-Derived Breast Tumor Xenografts
journal, October 2015
- Ntai, Ioanna; LeDuc, Richard D.; Fellers, Ryan T.
- Molecular & Cellular Proteomics, Vol. 15, Issue 1
Interpreting Top-Down Mass Spectra Using Spectral Alignment
journal, February 2008
- Frank, Ari M.; Pesavento, James J.; Mizzen, Craig A.
- Analytical Chemistry, Vol. 80, Issue 7
New and automated MSn approaches for top-down identification of modified proteins
journal, December 2005
- Zabrouskov, Vlad; Senko, Michael W.; Du, Yi
- Journal of the American Society for Mass Spectrometry, Vol. 16, Issue 12
pTop 1.0: A High-Accuracy and High-Efficiency Search Engine for Intact Protein Identification
journal, February 2016
- Sun, Rui-Xiang; Luo, Lan; Wu, Long
- Analytical Chemistry, Vol. 88, Issue 6
High-Throughput Proteomics
journal, June 2014
- Zhang, Zhaorui; Wu, Si; Stenoien, David L.
- Annual Review of Analytical Chemistry, Vol. 7, Issue 1
The mzIdentML Data Standard for Mass Spectrometry-Based Proteomics Results
journal, February 2012
- Jones, Andrew R.; Eisenacher, Martin; Mayer, Gerhard
- Molecular & Cellular Proteomics, Vol. 11, Issue 7
Protein Identification Using Top-Down Spectra
journal, October 2011
- Liu, Xiaowen; Sirotkin, Yakov; Shen, Yufeng
- Molecular & Cellular Proteomics, Vol. 11, Issue 6
Decoding protein modifications using top-down mass spectrometry
journal, September 2007
- Siuti, Nertila; Kelleher, Neil L.
- Nature Methods, Vol. 4, Issue 10
Identification of Ultramodified Proteins Using Top-Down Tandem Mass Spectra
journal, November 2013
- Liu, Xiaowen; Hengel, Shawna; Wu, Si
- Journal of Proteome Research, Vol. 12, Issue 12
Deconvolution and Database Search of Complex Tandem Mass Spectra of Intact Proteins: A COMBINATORIAL APPROACH
journal, September 2010
- Liu, Xiaowen; Inbar, Yuval; Dorrestein, Pieter C.
- Molecular & Cellular Proteomics, Vol. 9, Issue 12
ProSight PTM: an integrated environment for protein identification and characterization by top-down mass spectrometry
journal, July 2004
- LeDuc, R. D.; Taylor, G. K.; Kim, Y. -B.
- Nucleic Acids Research, Vol. 32, Issue Web Server
Endocrine-Therapy-Resistant ESR1 Variants Revealed by Genomic Characterization of Breast-Cancer-Derived Xenografts
journal, September 2013
- Li, Shunqiang; Shen, Dong; Shao, Jieya
- Cell Reports, Vol. 4, Issue 6
Identification of post-translational modifications by blind search of mass spectra
journal, November 2005
- Tsur, Dekel; Tanner, Stephen; Zandi, Ebrahim
- Nature Biotechnology, Vol. 23, Issue 12
What does the future hold for top down mass spectrometry?
journal, February 2010
- Garcia, Benjamin A.
- Journal of the American Society for Mass Spectrometry, Vol. 21, Issue 2
Peptide Sequence Tags for Fast Database Search in Mass-Spectrometry
journal, August 2005
- Frank, Ari; Tanner, Stephen; Bafna, Vineet
- Journal of Proteome Research, Vol. 4, Issue 4
MASH Suite Pro: A Comprehensive Software Tool for Top-Down Proteomics
journal, November 2015
- Cai, Wenxuan; Guner, Huseyin; Gregorich, Zachery R.
- Molecular & Cellular Proteomics, Vol. 15, Issue 2
Options and considerations when selecting a quantitative proteomics strategy
journal, July 2010
- Domon, Bruno; Aebersold, Ruedi
- Nature Biotechnology, Vol. 28, Issue 7
A cross-platform toolkit for mass spectrometry and proteomics
journal, October 2012
- Chambers, Matthew C.; Maclean, Brendan; Burke, Robert
- Nature Biotechnology, Vol. 30, Issue 10
Spectral Probabilities and Generating Functions of Tandem Mass Spectra: A Strike against Decoy Databases
journal, August 2008
- Kim, Sangtae; Gupta, Nitin; Pevzner, Pavel A.
- Journal of Proteome Research, Vol. 7, Issue 8
MASH Suite: A User-Friendly and Versatile Software Interface for High-Resolution Mass Spectrometry Data Interpretation and Visualization
journal, January 2014
- Guner, Huseyin; Close, Patrick L.; Cai, Wenxuan
- Journal of The American Society for Mass Spectrometry, Vol. 25, Issue 3
Proteoform: a single term describing protein complexity
journal, February 2013
- Smith, Lloyd M.; Kelleher, Neil L.
- Nature Methods, Vol. 10, Issue 3
ProSight PTM 2.0: improved protein identification and characterization for top down mass spectrometry
journal, May 2007
- Zamdborg, L.; LeDuc, R. D.; Glowacz, K. J.
- Nucleic Acids Research, Vol. 35, Issue Web Server
Mutation-Tolerant Protein Identification by Mass Spectrometry
journal, December 2000
- Pevzner, Pavel A.; Dančík, Vlado; Tang, Chris L.
- Journal of Computational Biology, Vol. 7, Issue 6
CHEMISTRY: Mass Spectrometry: Bottom-Up or Top-Down?
journal, October 2006
- Chait, B. T.
- Science, Vol. 314, Issue 5796
MS-GF+ makes progress towards a universal database search tool for proteomics
journal, October 2014
- Kim, Sangtae; Pevzner, Pavel A.
- Nature Communications, Vol. 5, Issue 1
Reproducibility of Differential Proteomic Technologies in CPTAC Fractionated Xenografts
journal, December 2015
- Tabb, David L.; Wang, Xia; Carr, Steven A.
- Journal of Proteome Research, Vol. 15, Issue 3