skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Reinspection of a Clinical Proteomics Tumor Analysis Consortium (CPTAC) Dataset with Cloud Computing Reveals Abundant Post-Translational Modifications and Protein Sequence Variants

Journal Article · · Cancers (Basel)
 [1];  [2];  [1];  [3]; ORCiD logo [4];  [5];  [6];  [7];  [7];  [8]; ORCiD logo [9]; ORCiD logo [9]; ORCiD logo [10]; ORCiD logo [10];  [11];  [12]; ORCiD logo [12]; ORCiD logo [13]; ORCiD logo [14];  [14] more »; ORCiD logo [14];  [15]; ORCiD logo [15];  [16];  [7] « less
  1. Optys Tech Corporation, Shrewsbury, MA (United States)
  2. McGill University Health Center, Montreal (Canada)
  3. Aptagen LLC, Philadelphia, PA (United States)
  4. Leiden University Medical Center (The Netherlands)
  5. Northwestern University, Chicago, IL (United States)
  6. Thermo Fisher Scientific, Grimes, IA (United States)
  7. Johns Hopkins University, Baltimore, MD (United States)
  8. Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
  9. Monash University, Melbourne (Australia)
  10. Indian Institute of Technology, Mumbai (India)
  11. University of Texas Southwestern Medical Center, Dallas, TX (United States); Eastern Virginia Medical School, Norfolk, VA (United States)
  12. Eastern Virginia Medical School, Norfolk, VA (United States)
  13. University of Toronto (Canada)
  14. Mayo Clinic, Rochester, MN (United States)
  15. University of Virginia, Charlottesville, VA (United States)
  16. Deurion LLC, Ellicott City, MD (United States)

The Clinical Proteomic Tumor Analysis Consortium (CPTAC) has provided some of the most in-depth analyses of the phenotypes of human tumors ever constructed. Today, the majority of proteomic data analysis is still performed using software housed on desktop computers which limits the number of sequence variants and post-translational modifications that can be considered. The original CPTAC studies limited the search for PTMs to only samples that were chemically enriched for those modified peptides. Similarly, the only sequence variants considered were those with strong evidence at the exon or transcript level. In this multi-institutional collaborative reanalysis, we utilized unbiased protein databases containing millions of human sequence variants in conjunction with hundreds of common post-translational modifications. Using these tools, we identified tens of thousands of high-confidence PTMs and sequence variants. We identified 4132 phosphorylated peptides in nonenriched samples, 93% of which were confirmed in the samples which were chemically enriched for phosphopeptides. In addition, our results also cover 90% of the high-confidence variants reported by the original proteogenomics study, without the need for sample specific next-generation sequencing. Finally, we report fivefold more somatic and germline variants that have an independent evidence at the peptide level, including mutations in ERRB2 and BCAS1. In this reanalysis of CPTAC proteomic data with cloud computing, we present an openly available and searchable web resource of the highest-coverage proteomic profiling of human tumors described to date.

Research Organization:
Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
National Cancer Institute (NCI)
Grant/Contract Number:
89233218CNA000001
OSTI ID:
1829657
Report Number(s):
LA-UR-21-25931
Journal Information:
Cancers (Basel), Vol. 13, Issue 20; ISSN 2072-6694
Publisher:
MDPICopyright Statement
Country of Publication:
United States
Language:
English

References (34)

Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing journal September 2021
Proteogenomic Analysis of Human Colon Cancer Reveals New Therapeutic Opportunities journal May 2019
Comparative Analysis of Proteome and Transcriptome Variation in Mouse journal June 2011
Bolt: a New Age Peptide Search Engine for Comprehensive MS/MS Sequencing Through Vast Protein Databases in Minutes journal August 2019
An Optimized Shotgun Strategy for the Rapid Generation of Comprehensive Human Proteomes journal June 2017
Proteogenomics connects somatic mutations to signalling in breast cancer journal May 2016
State of the Field in Multi-Omics Research: From Computational Needs to Data Mining and Sharing journal December 2020
Proteomic maps of breast cancer subtypes journal January 2016
High-throughput mass spectrometry and bioinformatics analysis of breast cancer proteomic data journal August 2019
Integrated proteotranscriptomics of breast cancer reveals globally increased protein-mRNA concordance associated with subtypes and survival journal December 2018
The Proteomic Landscape of Triple-Negative Breast Cancer journal April 2015
The Art of Validating Quantitative Proteomics Data journal November 2018
A screen of the complete protein kinase gene family identifies diverse patterns of somatic mutations in human breast cancer journal May 2005
Online Parallel Accumulation–Serial Fragmentation (PASEF) with a Novel Trapped Ion Mobility Mass Spectrometer journal November 2018
Gene expression profiling of 49 human tumor xenografts from in vitro culture through multiple in vivo passages - strategies for data mining in support of therapeutic studies journal January 2014
Global Proteome Analysis of the NCI-60 Cell Line Panel journal August 2013
Small Molecule KRAS Inhibitors: The Future for Targeted Pancreatic Cancer Therapy? journal May 2020
The premise of personalized immunotherapy for cancer dormancy journal April 2020
Detection and verification of 2.3 million cancer mutations in NCI60 cancer cell lines with a cloud search engine journal October 2019
Proteogenomic Landscape of Breast Cancer Tumorigenesis and Targeted Therapy journal November 2020
Evolution of Orbitrap Mass Spectrometry Instrumentation journal July 2015
PANOPLY: a cloud-based platform for automated and reproducible proteogenomic data analysis journal May 2021
Large-Scale Interlaboratory Study to Develop, Analytically Validate and Apply Highly Multiplexed, Quantitative Peptide Assays to Measure Cancer-Relevant Proteins in Plasma journal February 2015
A Compact Quadrupole-Orbitrap Mass Spectrometer with FAIMS Interface Improves Proteome Coverage in Short LC Gradients journal February 2020
Interactive Peptide Spectral Annotator: A Versatile Web-based Tool for Proteomic Applications journal August 2019
Proteogenomic integration reveals therapeutic targets in breast cancer xenografts journal March 2017
Single-cell proteomic and transcriptomic analysis of macrophage heterogeneity using SCoPE2 journal January 2021
Plasma Proteome Profiling to Assess Human Health and Disease journal March 2016
XMAn v2—a database of Homo sapiens mutated peptides journal September 2019
Rapid genotype imputation from sequence with reference panels journal June 2021
Rapid non-uniform adaptation to conformation-specific KRAS(G12C) inhibition journal January 2020
Targeting the MAPK Pathway in KRAS-Driven Tumors journal April 2020
A Novel LC System Embeds Analytes in Pre-formed Gradients for Rapid, Ultra-robust Proteomics journal November 2018
Characterization of peptides bound to the class I MHC molecule HLA-A2.1 by mass spectrometry journal March 1992