Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Flying Blind, or Just Flying Under the Radar? The Underappreciated Power of De Novo Methods of Mass Spectrometric Peptide Identification

Journal Article · · Protein Science
DOI:https://doi.org/10.1002/pro.3919· OSTI ID:1668255

Mass spectrometry-based proteomics is a popular and powerful method for precise and highly multiplexed protein identification. The most common method of analyzing untargeted proteomics data is called database searching, where the database is simply a collection of protein sequences from the target organism, derived from genome sequencing. Experimental peptide tandem mass spectra are compared to simplified models of theoretical spectra calculated from the translated genomic sequences. However, in several interesting application areas, such as forensics, archaeology, venomics and others, a genome sequence may not be available, or the correct genome sequence to use is not known. In these cases, de novo peptide identification can play an important role. De novo peptide identification infers peptide sequence directly from the tandem mass spectrum without reference to a sequence database, usually using graph-based or machine learning algorithms. In this review, we provide a basic overview of de novo peptide identification methods and applications, briefly covering de novo algorithms and tools, and focusing in more depth on recent applications from venomics, metaproteomics, forensics, and characterization of antibody drugs.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1668255
Report Number(s):
PNNL-SA-153810
Journal Information:
Protein Science, Journal Name: Protein Science Journal Issue: 9 Vol. 29
Country of Publication:
United States
Language:
English

References (71)

Anatomy and evolution of database search engines—a central component of mass spectrometry based proteomic workflows journal May 2020
Mass-spectrometric exploration of proteome structure and function journal September 2016
Mass spectrometry-based proteomics journal March 2003
UVnovo: A de Novo Sequencing Algorithm Using Single Series of Fragment Ions via Chromophore Tagging and 351 nm Ultraviolet Photodissociation Mass Spectrometry journal March 2016
Spectral library searching in proteomics journal February 2016
A multi-protease, multi-dissociation, bottom-up-to-top-down proteomic view of the Loxosceles intermedia venom journal July 2017
Sequencing-Grade De novo Analysis of MS/MS Triplets (CID/HCD/ETD) From Overlapping Peptides journal May 2013
pNovo+: De Novo Peptide Sequencing Using Complementary HCD and ETD Tandem Mass Spectra journal December 2012
GutenTag:  High-Throughput Sequence Tagging via an Empirically Derived Fragmentation Model journal December 2003
DeNovoGUI: An Open Source Graphical User Interface for de Novo Sequencing of Tandem Mass Spectra journal December 2013
Future perspectives of therapeutic monoclonal antibodies journal February 2019
Proteomics in Forensic Sciences: Identification of the Nature of the Last Meal at Autopsy journal May 2018
Challenges and perspectives of metaproteomic data analysis journal November 2017
Automated Antibody De Novo Sequencing and Its Utility in Biopharmaceutical Discovery journal January 2017
Protein mass spectrometry detects multiple bloodmeals for enhanced Chagas disease vector ecology journal October 2019
De Novo Peptide Sequencing and Identification with Precision Mass Spectrometry journal January 2007
Precision De Novo Peptide Sequencing Using Mirror Proteases of Ac-LysargiNase and Trypsin for Large-scale Proteomics journal January 2019
Seeing beyond the tip of the iceberg: A deep analysis of the venome of the Brazilian Rattlesnake, Crotalus durissus terrificus journal September 2015
Error-Tolerant Identification of Peptides in Sequence Databases by Peptide Sequence Tags journal December 1994
Applications and challenges of forensic proteomics journal April 2019
Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination journal March 2017
Cancer Proteomics and the Elusive Diagnostic Biomarkers journal August 2019
pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework journal July 2019
An Alignment-Free “Metapeptide” Strategy for Metaproteomic Characterization of Microbiome Samples Using Shotgun Metagenomic Sequencing journal July 2016
De novo peptide sequencing by deep learning journal July 2017
Probabilistic Limit of Detection for Ricin Identification Using a Shotgun Proteomics Assay journal September 2019
Novor: Real-Time Peptide de Novo Sequencing Software journal June 2015
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database journal November 1994
Comprehensive de Novo Peptide Sequencing from MS/MS Pairs Generated through Complementary Collision Induced Dissociation and 351 nm Ultraviolet Photodissociation journal March 2017
PEAKS: powerful software for peptidede novo sequencing by tandem mass spectrometry journal January 2003
A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics journal October 2010
Target-Decoy Search Strategy for Mass Spectrometry-Based Proteomics book December 2009
Charting the Proteomes of Organisms with Unsequenced Genomes by MALDI-Quadrupole Time-of-Flight Mass Spectrometry and BLAST Homology Searching journal May 2001
PepExplorer: A Similarity-driven Tool for Analyzing de Novo Sequencing Results journal May 2014
2018 YPIC Challenge: A Case Study in Characterizing an Unknown Protein Sample journal September 2019
De novo peptide sequencing by tandem MS using complementary CID and electron transfer dissociation journal November 2009
TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets journal April 2019
Automated de novo protein sequencing of monoclonal antibodies journal December 2008
Complete De Novo Assembly of Monoclonal Antibody Sequences journal August 2016
A Potential Golden Age to Come-Current Tools, Recent Use Cases, and Future Avenues for De Novo Sequencing in Proteomics journal August 2018
Navigating through metaproteomics data: A logbook of database searching journal April 2015
pDeep: Predicting MS/MS Spectra of Peptides with Deep Learning journal November 2017
PepNovo:  De Novo Peptide Sequencing via Probabilistic Network Modeling journal February 2005
Fragmentation pathways of protonated peptides journal January 2005
Typing and Characterization of Bacteria Using Bottom-up Tandem Mass Spectrometry Proteomics journal April 2017
A Face in the Crowd: Recognizing Peptides Through Database Search journal August 2011
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry journal February 2007
Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification? journal March 2017
De Novo Sequencing and Homology Searching journal November 2011
Quantitative affinity purification mass spectrometry: a versatile technology to study protein–protein interactions journal July 2015
Lessons in de novo peptide sequencing by tandem mass spectrometry: LESSONS IN DE NOVO PEPTIDE SEQUENCING journal October 2013
Application of de Novo Sequencing to Large-Scale Complex Proteomics Data Sets journal January 2016
Assessing Protein Sequence Database Suitability Using De Novo Sequencing journal November 2019
Proteomics Goes to Court: A Statistical Foundation for Forensic Toxin/Organism Identification Using Bottom-Up Proteomics journal July 2018
PEAKS DB: De Novo Sequencing Assisted Database Search for Sensitive and Accurate Peptide Identification journal December 2011
Probability-based protein identification by searching sequence databases using mass spectrometry data journal December 1999
Semi-supervised learning for peptide identification from shotgun proteomics datasets journal October 2007
High-throughput metaproteomics data analysis with Unipept: A tutorial journal January 2018
MetaPro-IQ: a universal metaproteomic approach to studying human and mouse gut microbiota journal June 2016
Multi-Enzymatic Limited Digestion: The Next-Generation Sequencing for Proteomics? journal April 2019
Unipept: Tryptic Peptide-Based Biodiversity Analysis of Metaproteome Samples journal October 2012
An effective algorithm for peptide de novo sequencing from MS/MS spectra journal May 2005
DirecTag: Accurate Sequence Tags from Peptide MS/MS through Statistical Scoring journal September 2008
PeptideMapper: efficient and versatile amino acid sequence and tag mapping journal March 2017
Uncovering Thousands of New Peptides with Sequence-Mask-Search Hybrid De Novo Peptide Sequencing Framework journal October 2019
Postnovo: Postprocessing Enables Accurate and FDR-Controlled de Novo Peptide Sequencing journal September 2018
A two-step database search method improves sensitivity in peptide sequence matches for metaproteomics and proteogenomics studies journal March 2013
Proteomics evidence for kefir dairy in Early Bronze Age China journal May 2014
De Novo Peptide Sequencing via Tandem Mass Spectrometry journal October 1999
Mass spectrometry-based proteomics for the forensic identification of vomit traces journal October 2019
Ricin-like proteins from the castor plant do not influence liquid chromatography-mass spectrometry detection of ricin in forensically relevant samples journal December 2017

Similar Records

Exhaustive Database Searching for Amino Acid Mutations in Proteomes
Journal Article · Sat Dec 31 23:00:00 EST 2011 · Bioinformatics · OSTI ID:1048717

A high-throughput de novo sequencing approach for shotgun proteomics using high-resolution tandem mass spectrometry
Journal Article · Thu Dec 31 23:00:00 EST 2009 · BMC Bioinformatics · OSTI ID:1024242