skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: TopPICR: A Companion R Package for Top-Down Proteomics Data Analysis

Journal Article · · Journal of Proteome Research

Top-down proteomics is the analysis of proteins in their intact form without proteolysis, thus preserving valuable information about post-translational modifications, isoforms, and proteolytic processing. However, it is still a developing field due to limitations in the instrumentation, difficulties with interpretation of complex mass spectra, and a lack of well-established quantification approaches. TopPIC is one of the popular tools for proteoform identification. Here we extended its capabilities into label-free proteoform quantification by developing a companion R package (TopPICR). Key steps in the TopPICR pipeline include filtering identifications, inferring a minimal set of protein accessions explaining the observed sequences, aligning retention times, recalibrating measured masses, clustering features across datasets, and finally compiling feature intensities using the match-between-runs approach. The output of the pipeline is an MSnSet object which makes downstream data analysis seamlessly compatible with packages from the Bioconductor project. It also provides the capability for visualizing proteoforms within the context of the parent protein sequence. The functionality of TopPICR is demonstrated on top-down LC-MS/MS datasets of 10 human-in-mouse xenografts of luminal and basal breast tumor samples.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE; National Institutes of Health (NIH)
Grant/Contract Number:
AC05-76RL01830; U01 AG061356
OSTI ID:
1995421
Report Number(s):
PNNL-SA-177756
Journal Information:
Journal of Proteome Research, Vol. 22, Issue 2; ISSN 1535-3893
Publisher:
American Chemical Society (ACS)Copyright Statement
Country of Publication:
United States
Language:
English

References (66)

Protamine Characterization by Top-Down Proteomics: Boosting Proteoform Identification with DBSCAN journal April 2021
TopPIC: a software tool for top-down mass spectrometry-based proteoform identification and characterization journal July 2016
Best practices and benchmarks for intact protein analysis for top-down mass spectrometry journal June 2019
Studies on the Mechanism of Oxidative Modification of Human Glyceraldehyde-3-phosphate Dehydrogenase by Glutathione: Catalysis by Glutaredoxin journal June 1998
dbscan : Fast Density-Based Clustering with R journal January 2019
ProForma: A Standard Proteoform Notation journal February 2018
Orchestrating high-throughput genomic analysis with Bioconductor journal January 2015
Protection of Extraribosomal RPL13a by GAPDH and Dysregulation by S-Nitrosylation journal August 2012
DAnTE: a statistical tool for quantitative analysis of -omics data journal May 2008
Multidimensional Top-Down Proteomics of Brain-Region-Specific Mouse Brain Proteoforms Responsive to Cocaine and Estradiol journal September 2019
Integrated Bottom-Up and Top-Down Proteomics of Patient-Derived Breast Tumor Xenografts journal October 2015
MultiAlign: a multiple LC-MS analysis tool for targeted omics analysis journal January 2013
PEPPI-MS: Polyacrylamide-Gel-Based Prefractionation for Analysis of Intact Proteoforms and Protein Complexes by Mass Spectrometry journal June 2020
A five-level classification system for proteoform identifications journal August 2019
Large-Scale Qualitative and Quantitative Top-Down Proteomics Using Capillary Zone Electrophoresis-Electrospray Ionization-Tandem Mass Spectrometry with Nanograms of Proteome Samples journal April 2019
S-glutathionylation of glyceraldehyde-3-phosphate dehydrogenase induces formation of C150-C154 intrasubunit disulfide bond in the active site of the enzyme journal December 2017
MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry–based proteomics journal April 2017
Inactivation of Glyceraldehyde-3-Phosphate Dehydrogenase by Fumarate in Diabetes journal January 2008
High-Throughput Proteomics journal June 2014
LC-MS alignment in theory and practice: a comprehensive algorithmic review journal November 2013
The abc's (and xyz's) of peptide sequencing journal September 2004
Clustering by Passing Messages Between Data Points journal February 2007
DtaRefinery, a Software Tool for Elimination of Systematic Errors from Parent Ion Mass Measurements in Tandem Mass Spectra Data Sets journal December 2009
A robust two-dimensional separation for top-down tandem mass spectrometry of the low-mass proteome journal December 2009
iPQF: a new peptide-to-protein summarization method using peptide spectra characteristics to improve protein quantification journal November 2015
ProSight PTM: an integrated environment for protein identification and characterization by top-down mass spectrometry journal July 2004
Endocrine-Therapy-Resistant ESR1 Variants Revealed by Genomic Characterization of Breast-Cancer-Derived Xenografts journal September 2013
S-glutathionylation of human glyceraldehyde-3-phosphate dehydrogenase and possible role of Cys152-Cys156 disulfide bridge in the active site of the protein journal June 2020
The C-Score: A Bayesian Framework to Sharply Improve Proteoform Scoring in High-Throughput Top Down Proteomics journal June 2014
Increasing the Separation Capacity of Intact Histone Proteoforms Chromatography Coupling Online Weak Cation Exchange-HILIC to Reversed Phase LC UVPD-HRMS journal September 2018
A Top-Down Proteomics Platform Coupling Serial Size Exclusion Chromatography and Fourier Transform Ion Cyclotron Resonance Mass Spectrometry journal February 2019
Enhancing Top-Down Proteomics of Brain Tissue with FAIMS journal April 2021
MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification journal November 2008
Proteoform: a single term describing protein complexity journal February 2013
Informed-Proteomics: open-source software package for top-down proteomics journal August 2017
Elimination of Systematic Mass Measurement Errors in Liquid Chromatography−Mass Spectrometry Based Proteomics Using Regression Models and a Priori Partial Knowledge of the Sample Content journal December 2007
The Blood Proteoform Atlas: A reference map of proteoforms in human hematopoietic cells journal January 2022
Reproducibility of Differential Proteomic Technologies in CPTAC Fractionated Xenografts journal December 2015
Target-Selective Protein S-Nitrosylation by Sequence Motif Recognition journal October 2014
Sensitive Top-Down Proteomics Analysis of a Low Number of Mammalian Cells Using a Nanodroplet Sample Processing Platform journal May 2020
Reviews on Determining the Number of Clusters journal July 2016
Putting Humpty Dumpty Back Together Again: What Does Protein Quantification Mean in Bottom-Up Proteomics? journal February 2022
MSnbase-an R/Bioconductor package for isobaric tagged mass spectrometry data visualization, processing and quantitation journal November 2011
Capillary zone electrophoresis-mass spectrometry for top-down proteomics journal November 2019
Top-Down Proteomics: Ready for Prime Time? journal November 2017
Improved Identification of Proteoforms in Top-Down Proteomics Using FAIMS with Internal CV Stepping journal February 2022
Proteomic Parsimony through Bipartite Graph Analysis Improves Accuracy and Transparency journal September 2007
Robustness and Ruggedness of Isoelectric Focusing and Superficially Porous Liquid Chromatography with Fourier Transform Mass Spectrometry journal December 2020
Interlaboratory Study for Characterizing Monoclonal Antibodies by Top-Down and Middle-Down Mass Spectrometry journal July 2020
Top-Down Proteomics Enables Comparative Analysis of Brain Proteoforms Between Mouse Strains journal February 2018
Integrated Post-Experiment Monoisotopic Mass Refinement: An Integrated Approach to Accurately Assign Monoisotopic Precursor Masses to Tandem Mass Spectrometric Data journal September 2010
Target-Decoy Search Strategy for Mass Spectrometry-Based Proteomics book December 2009
The Human Proteoform Project: Defining the human proteome journal November 2021
Protein Identification Using Top-Down Spectra journal October 2011
Label-Free Protein Quantification for Plant Golgi Protein Localization and Abundance journal August 2014
Using size exclusion chromatography-RPLC and RPLC-CIEF as two-dimensional separation strategies for protein profiling journal July 2006
Beyond mass spectrometry, the next step in proteomics journal January 2020
Succination of Proteins by Fumarate journal April 2008
Proteomic Profiling of Intact Proteins Using WAX-RPLC 2-D Separations and FTICR Mass Spectrometry journal February 2007
APCluster: an R package for affinity propagation clustering journal July 2011
Deep-Learning-Derived Evaluation Metrics Enable Effective Benchmarking of Computational Tools for Phosphopeptide Identification journal January 2021
The MaxQuant computational platform for mass spectrometry-based shotgun proteomics journal October 2016
Interpretation of Shotgun Proteomic Data: The Protein Inference Problem journal July 2005
Novel Strategies to Address the Challenges in Top-Down Proteomics journal May 2021
MSnbase, Efficient and Elegant R-Based Processing and Visualization of Raw Mass Spectrometry Data journal September 2020
Deeper Protein Identification Using Field Asymmetric Ion Mobility Spectrometry in Top-Down Proteomics journal April 2021