Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Target–decoy false discovery rate estimation using Crema

Journal Article · · Proteomics
 [1];  [2];  [3];  [4];  [2]
  1. Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
  2. Univ. of Washington, Seattle, WA (United States)
  3. Talus Bioscience, Seattle, WA (United States)
  4. Univ. of Sydney, NSW (Australia)
Assigning statistical confidence estimates to discoveries produced by a tandem mass spectrometry proteomics experiment is critical to enabling principled interpretation of the results and assessing the cost/benefit ratio of experimental follow-up. The most common technique for computing such estimates is to use target-decoy competition (TDC), in which observed spectra are searched against a database of real (target) peptides and a database of shuffled or reversed (decoy) peptides. TDC procedures for estimating the false discovery rate (FDR) at a given score threshold have been developed for application at the level of spectra, peptides, or proteins. Although these techniques are relatively straightforward to implement, it is common in the literature to skip over the implementation details or even to make mistakes in how the TDC procedures are applied in practice. Here we present Crema, an open-source Python tool that implements several TDC methods of spectrum-, peptide- and protein-level FDR estimation. Crema is compatible with a variety of existing database search tools and provides a straightforward way to obtain robust FDR estimates.
Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE Laboratory Directed Research and Development (LDRD) Program
Grant/Contract Number:
AC05-76RL01830
OSTI ID:
2370132
Report Number(s):
PNNL-SA--186220
Journal Information:
Proteomics, Journal Name: Proteomics Journal Issue: 8 Vol. 24; ISSN 1615-9853
Publisher:
WileyCopyright Statement
Country of Publication:
United States
Language:
English

References (45)

Comet: An open-source MS/MS sequence database search tool journal December 2012
Target-Decoy Search Strategy for Mass Spectrometry-Based Proteomics book December 2009
Pyteomics—a Python Framework for Exploratory Data Analysis and Rapid Software Prototyping in Proteomics journal January 2013
A Deeper Look into Comet—Implementation and Features journal June 2015
Determining the calibration of confidence estimation procedures for unique peptides in shotgun proteomics journal March 2013
Reanalysis of ProteomicsDB Using an Accurate, Sensitive, and Scalable False Discovery Rate Estimation Approach for Protein Groups journal December 2022
Effect of cleavage enzyme, search algorithm and decoy database on mass spectrometric identification of wheat gluten proteins journal July 2011
Controlling the false discovery rate via competition: Is the +1 needed? journal June 2023
Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database Search journal October 2002
A Statistical Model for Identifying Proteins by Tandem Mass Spectrometry journal September 2003
Decoy Methods for Assessing False Positives and False Discovery Rates in Shotgun Proteomics journal December 2008
Beyond Target–Decoy Competition: Stable Validation of Peptide and Protein Identifications in Mass Spectrometry-Based Discovery Proteomics journal September 2020
mokapot: Fast and Flexible Semisupervised Learning for Peptide Detection journal February 2021
Improving Peptide-Level Mass Spectrometry Analysis via Double Competition journal September 2022
Quality Control for the Target Decoy Approach for Peptide Identification journal January 2023
Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach journal December 2016
Combining High-Resolution and Exact Calibration To Boost Statistical Power: A Well-Calibrated Score Function for High-Resolution MS2 Data journal September 2018
Pyteomics 4.0: Five Years of Development of a Python Proteomics Framework journal December 2018
Bias in False Discovery Rate Estimation in Mass-Spectrometry-Based Peptide Identification journal April 2019
ThermoRawFileParser: Modular, Scalable, and Cross-Platform RAW File Conversion journal November 2019
Repeat-Preserving Decoy Database for False Discovery Rate Estimation in Peptide Identification journal February 2020
Tailor: A Nonparametric and Rapid Score Calibration Method for Database Search-Based Peptide Identification in Shotgun Proteomics journal March 2020
Faster SEQUEST Searching for Peptide Identification from Tandem Mass Spectra journal September 2011
On Using Samples of Known Protein Content to Assess the Statistical Calibration of Scores Assigned to Peptide-Spectrum Matches in Shotgun Proteomics journal March 2011
MS Amanda, a Universal Identification Algorithm Optimized for High Accuracy Tandem Mass Spectra journal June 2014
Rapid and Accurate Peptide Identification from Tandem Mass Spectra journal May 2008
A Fast SEQUEST Cross Correlation Algorithm journal October 2008
Mass-spectrometry-based draft of the human proteome journal May 2014
MS-GF+ makes progress towards a universal database search tool for proteomics journal October 2014
Mass spectrometrists should search only for peptides they care about journal June 2015
OpenMS: a flexible open-source software platform for mass spectrometry data analysis journal August 2016
MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry–based proteomics journal April 2017
Mass spectrometrists should search for all peptides, but assess only the ones they care about journal July 2017
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry journal February 2007
Semi-supervised learning for peptide identification from shotgun proteomics datasets journal October 2007
Philosopher: a versatile toolkit for shotgun proteomics data analysis journal July 2020
Computing Exact p-values for a Cross-correlation Shotgun Proteomics Score Function journal September 2014
New mixture models for decoy-free false discovery rate estimation in mass spectrometry proteomics journal December 2020
Group-walk: a rigorous approach to group-wise false discovery rate analysis by target-decoy competition journal September 2022
UniProt: a hub for protein information journal October 2014
UniProt: a worldwide hub of protein knowledge November 2018
The PRIDE database and related tools and resources in 2019: improving support for quantification data journal November 2018
False discovery rates in spectral identification journal January 2012
Controlling the false discovery rate via knockoffs journal October 2015
Simultaneous high-probability bounds on the false discovery proportion in structured, regression and online settings journal December 2020

Similar Records

Noble-Lab/crema
Software · Sun May 21 20:00:00 EDT 2023 · OSTI ID:code-107336

Improving Peptide-Level Mass Spectrometry Analysis via Double Competition
Journal Article · Mon Sep 26 20:00:00 EDT 2022 · Journal of Proteome Research · OSTI ID:1895576

Proteome-wide identification of proteins and their modifications with decreased ambiguities and improved false discovery rates using unique sequence tags
Journal Article · Sat Mar 15 00:00:00 EDT 2008 · Analytical Chemistry, 80(6):1871-82 · OSTI ID:926932