Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Metabolite discovery through global annotation of untargeted metabolomics data

Journal Article · · Nature Methods
 [1];  [2];  [2];  [2];  [3];  [2];  [2];  [2];  [2];  [2];  [2];  [2];  [4];  [4];  [2];  [5]
  1. Fudan University, Shanghai (China); Princeton University, NJ (United States); University of Illinois
  2. Princeton University, NJ (United States)
  3. Fudan University, Shanghai (China); Princeton University, NJ (United States)
  4. University of Tennessee, Knoxville, TN (United States)
  5. Princeton University, NJ (United States); Ludwig Institute for Cancer Research, Princeton, NJ (United States)
Liquid chromatography–high-resolution mass spectrometry (LC-MS)-based metabolomics aims to identify and quantify all metabolites, but most LC-MS peaks remain unidentified. Here we present a global network optimization approach, NetID, to annotate untargeted LC-MS metabolomics data. The approach aims to generate, for all experimentally observed ion peaks, annotations that match the measured masses, retention times and (when available) tandem mass spectrometry fragmentation patterns. Peaks are connected based on mass differences reflecting adduction, fragmentation, isotopes, or feasible biochemical transformations. Global optimization generates a single network linking most observed ion peaks, enhances peak assignment accuracy, and produces chemically informative peak–peak relationships, including for peaks lacking tandem mass spectrometry spectra. Applying this approach to yeast and mouse data, we identified five previously unrecognized metabolites (thiamine derivatives and N-glucosyl-taurine). Isotope tracer studies indicate active flux through these metabolites. Furthermore, NetID applies existing metabolomic knowledge and global optimization to substantially improve annotation coverage and accuracy in untargeted metabolomics datasets, facilitating metabolite discovery.
Research Organization:
CABBI, Urbana, IL (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
SC0018420
OSTI ID:
1855984
Journal Information:
Nature Methods, Journal Name: Nature Methods Journal Issue: 11 Vol. 18; ISSN 1548-7091
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (63)

PubChemLite tier0 and tier1 dataset January 2019
MassBank: a public repository for sharing mass spectral data for life sciences journal July 2010
Adduct annotation in liquid chromatography/high-resolution mass spectrometry to enhance compound identification journal October 2020
The metabolomics standards initiative (MSI) journal August 2007
New methods to identify high peak density artifacts in Fourier transform mass spectra and to mitigate their effects on high-throughput metabolomic data analysis journal September 2018
Metabolomic Analysis and Visualization Engine for LC−MS Data journal December 2010
CAMERA: An Integrated Strategy for Compound Spectra Extraction and Annotation of Liquid Chromatography/Mass Spectrometry Data Sets journal December 2011
XCMS Online: A Web-Based Platform to Process Untargeted Metabolomic Data journal June 2012
RAMClust: A Novel Feature Clustering Method Enables Spectral-Matching-Based Annotation for Metabolomics Data journal June 2014
Credentialing Features: A Platform to Benchmark and Optimize Untargeted Metabolomic Methods journal September 2014
Enhanced in-Source Fragmentation Annotation Enables Novel Data Independent Acquisition and Autonomous METLIN Molecular Identification journal April 2020
Improved Annotation of Untargeted Metabolomics Data through Buffer Modifications That Shift Adduct Mass and Intensity journal July 2020
Hydrogen Rearrangement Rules: Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software journal August 2016
Biologically Consistent Annotation of Metabolomics Data journal December 2017
Annotation: A Computational Solution for Streamlining Metabolomics Analysis journal November 2017
Autonomous METLIN-Guided In-source Fragment Annotation for Untargeted Metabolomics journal January 2019
Peak Annotation and Verification Engine for Untargeted LC–MS Metabolomics journal December 2018
Structure Annotation of All Mass Spectra in Untargeted Metabolomics journal January 2019
Integrated Probabilistic Annotation: A Bayesian-Based Annotation Method for Metabolomic Profiles Integrating Biochemical Connections, Isotope Patterns, and Adduct Relationships journal September 2019
In-Source CID Ramping and Covariant Ion Analysis of Hydrophilic Interaction Chromatography Metabolomics journal March 2020
Retip: Retention Time Prediction for Compound Annotation in Untargeted Metabolomics journal May 2020
Discovery and Functional Characterization of a Yeast Sugar Alcohol Phosphatase journal September 2018
ChemSpider: An Online Chemical Information Resource journal November 2010
Chemical Discovery in the Era of Metabolomics journal April 2020
Recognizing Contamination Fragment Ions in Liquid Chromatography–Tandem Mass Spectrometry Data journal March 2021
Assigning Significance to Peptides Identified by Tandem Mass Spectrometry Using Decoy Databases journal January 2008
Cancer-associated IDH1 mutations produce 2-hydroxyglutarate journal November 2009
Glucose feeds the TCA cycle via circulating lactate journal October 2017
A cross-platform toolkit for mass spectrometry and proteomics journal October 2012
Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking journal August 2016
A roadmap for natural product discovery based on large-scale genomics and metabolomics journal September 2014
FDR-controlled metabolite annotation for high-resolution imaging mass spectrometry journal November 2016
Data processing, multi-omic pathway mapping, and metabolite activity analysis using XCMS Online journal March 2018
Metabolomics: beyond biomarkers and towards mechanisms journal March 2016
Metabolic reaction network-based recursive metabolite annotation for untargeted metabolomics journal April 2019
Ion identity molecular networking for mass spectrometry-based metabolomics in the GNPS environment journal June 2021
Mass spectrometry searches using MASST journal January 2020
Systematic classification of unknown metabolites using high-resolution fragmentation mass spectra journal November 2020
SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information journal March 2019
A cheminformatics approach to characterize metabolomes in stable-isotope-labeled organisms journal March 2019
Feature-based molecular networking in the GNPS analysis environment journal August 2020
METLIN MS2 molecular standards database: a broad chemical and biological resource journal August 2020
Reproducible molecular networking of untargeted mass spectrometry data using GNPS journal May 2020
Untargeted high-resolution paired mass distance data mining for retrieving general chemical relationships journal November 2020
Database-independent molecular formula annotation using Gibbs sampling through ZODIAC journal October 2020
Durable Remissions with Ivosidenib in IDH1 -Mutated Relapsed or Refractory AML journal June 2018
Mass spectral molecular networking of living microbial colonies journal May 2012
Topic modeling for untargeted substructure exploration in metabolomics journal November 2016
Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps journal June 2005
Solving and analyzing side-chain positioning problems using linear and integer programming journal November 2004
Probabilistic assignment of formulas to mass peaks in metabolomics experiments journal December 2008
MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach journal June 2014
Domain prediction with probabilistic directional context journal April 2017
Liquid-chromatography retention order prediction for metabolite identification journal September 2018
CliqueMS: a computational tool for annotating in-source metabolite ions from LC-MS untargeted metabolomics data based on a coelution similarity network journal March 2019
YMDB: the Yeast Metabolome Database journal November 2011
KEGG as a reference resource for gene and protein annotation journal October 2015
HMDB 4.0: the human metabolome database for 2018 journal November 2017
PubChem 2019 update: improved access to chemical data journal October 2018
Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry journal March 2007
Propagating annotations of molecular networks using in silico fragmentation journal April 2018
MolNetEnhancer: Enhanced Molecular Networks by Integrating Metabolome Mining and Annotation Tools journal July 2019
PubChemLite tier0 and tier1 dataset January 2020

Cited By (4)


Similar Records

Peak Annotation and Verification Engine for Untargeted LC–MS Metabolomics
Journal Article · Tue Dec 25 19:00:00 EST 2018 · Analytical Chemistry · OSTI ID:1491815

Improved Annotation of Untargeted Metabolomics Data through Buffer Modifications That Shift Adduct Mass and Intensity
Journal Article · Wed Jul 01 20:00:00 EDT 2020 · Analytical Chemistry · OSTI ID:1807696

Autonomous METLIN-Guided In-source Fragment Annotation for Untargeted Metabolomics
Journal Article · Thu Jan 24 19:00:00 EST 2019 · Analytical Chemistry · OSTI ID:1777377