Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Cognitive analysis of metabolomics data for systems biology

Journal Article · · Nature Protocols
 [1];  [1];  [1];  [2];  [1];  [1];  [1];  [1];  [1];  [2];  [3];  [1]
  1. The Scripps Research Inst., La Jolla, CA (United States). Center for Mass Spectrometry and Metabolomics
  2. IBM Watson Health, Cambridge, MA (United States)
  3. Waters Corp., Milford, MA (United States)
Cognitive computing is revolutionizing the way big data are processed and integrated, with artificial intelligence (AI) natural language processing (NLP) platforms helping researchers to efficiently search and digest the vast scientific literature. Most available platforms have been developed for biomedical researchers, but new NLP tools are emerging for biologists in other fields and an important example is metabolomics. NLP provides literature-based contextualization of metabolic features that decreases the time and expert-level subject knowledge required during the prioritization, identification and interpretation steps in the metabolomics data analysis pipeline. Here, we describe and demonstrate four workflows that combine metabolomics data with NLP-based literature searches of scientific databases to aid in the analysis of metabolomics data and their biological interpretation. Additionally, the four procedures can be used in isolation or consecutively, depending on the research questions. The first, used for initial metabolite annotation and prioritization, creates a list of metabolites that would be interesting for follow-up. The second workflow finds literature evidence of the activity of metabolites and metabolic pathways in governing the biological condition on a systems biology level. The third is used to identify candidate biomarkers, and the fourth looks for metabolic conditions or drug-repurposing targets that the two diseases have in common. The protocol can take 1–4 h or more to complete, depending on the processing time of the various software used.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
National Institutes of Health (NIH); USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1774918
Journal Information:
Nature Protocols, Journal Name: Nature Protocols Journal Issue: 3 Vol. 16; ISSN 1754-2189
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (117)

Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references: Growth Rates of Modern Science: A Bibliometric Analysis Based on the Number of Publications and Cited References journal April 2015
Using MetaboAnalyst 4.0 for Comprehensive and Integrative Metabolomics Data Analysis journal September 2019
Integrating electronic health record genotype and phenotype datasets to transform patient care journal January 2016
Toward mechanistic models for genotype–phenotype correlations in phenylketonuria using protein stability calculations journal January 2019
NMR Metabolomics Protocols for Drug Discovery book January 2019
Artificial intelligence in neurodegenerative disease research: use of IBM Watson to identify additional RNA-binding proteins altered in amyotrophic lateral sclerosis journal November 2017
Novel biomarkers for pre-eclampsia detected using metabolomics and machine learning journal July 2005
Metabolomics standards initiative: ontology working group work in progress journal September 2007
Proposed minimum reporting standards for data analysis in metabolomics journal August 2007
Proposed minimum reporting standards for chemical analysis: Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI) journal September 2007
Mining metabolites: extracting the yeast metabolome from the literature journal October 2010
Global metabolomics reveals metabolic dysregulation in ischemic retinopathy journal November 2015
Navigating freely-available software tools for metabolomics analysis journal August 2017
High-throughput extraction and quantification method for targeted metabolomics in murine tissues journal December 2017
Evaluation of sample preparation protocols for quantitative NMR-based metabolomics journal May 2019
Software tools, databases and resources in metabolomics: updates from 2018 to 2019 journal March 2020
A comparison of high-throughput plasma NMR protocols for comparative untargeted metabolomics journal May 2020
Plasma malonaldehyde levels during myocardial infarction journal April 1983
Association between the phenylethanolamine N-methyltransferase gene and multiple sclerosis journal March 2002
Catecholamine production and tyrosine hydroxylase expression in peripheral blood mononuclear cells from multiple sclerosis patients: effect of cell stimulation and possible relevance for activation-induced apoptosis journal December 2002
Transaldolase: From biochemistry to human disease journal July 2009
Urine metabolome analysis by gas chromatography–mass spectrometry (GC–MS): Standardization and optimization of protocols for urea removal and short-term sample storage journal October 2018
Fecal Microbiome and Volatile Organic Compound Metabolome in Obese Humans With Nonalcoholic Fatty Liver Disease journal July 2013
IBM Watson: How Cognitive Computing Can Be Applied to Big Data Challenges in Life Sciences Research journal April 2016
Metabolism Links Bacterial Biofilms and Colon Carcinogenesis journal June 2015
Metabolite extraction for high-throughput FTICR-MS-based metabolomics of grapevine leaves journal September 2016
Artificial Intelligence in Precision Cardiovascular Medicine journal May 2017
Metabolomics and systems biology: making sense of the soup journal June 2004
Repurposing drugs to treat l-DOPA-induced dyskinesia in Parkinson's disease journal March 2019
Synthesis of multi-omic data and community metabolic models reveals insights into the role of hydrogen sulfide in colon cancer journal October 2018
Induction, titration, and maintenance dosing regimen in a phase 2 study of pegvaliase for control of blood phenylalanine in adults with phenylketonuria journal November 2018
XCMS:  Processing Mass Spectrometry Data for Metabolite Profiling Using Nonlinear Peak Alignment, Matching, and Identification journal February 2006
XCMS Online: A Web-Based Platform to Process Untargeted Metabolomic Data journal June 2012
Toward ‘Omic Scale Metabolite Profiling: A Dual Separation–Mass Spectrometry Approach for Coverage of Lipid and Central Carbon Metabolism journal July 2013
Interactive XCMS Online: Simplifying Advanced Metabolomic Data Processing and Subsequent Statistical Analyses journal June 2014
Metabolite Structure Assignment Using In Silico NMR Techniques journal July 2020
Exposome-Scale Investigations Guided by Global Metabolomics, Pathway Analysis, and Cognitive Computing journal October 2017
Annotation: A Computational Solution for Streamlining Metabolomics Analysis journal November 2017
METLIN: A Technology Platform for Identifying Knowns and Unknowns journal January 2018
Autonomous METLIN-Guided In-source Fragment Annotation for Untargeted Metabolomics journal January 2019
Visual and Semantic Enrichment of Analytical Chemistry Literature Searches by Combining Text Mining and Computational Chemistry journal February 2019
A Cloud-Based Metabolite and Chemical Prioritization System for the Biology/Disease-Driven Human Proteome Project journal August 2018
Skyline for Small Molecules: A Unifying Software Package for Quantitative Metabolomics journal January 2020
ChemSpider: An Online Chemical Information Resource journal November 2010
Metabolomics activity screening for identifying metabolites that modulate phenotype journal April 2018
Metabolomics-based discovery of a metabolite that enhances oligodendrocyte maturation journal November 2017
Genome-scale study reveals reduced metabolic adaptability in patients with non-alcoholic fatty liver disease journal February 2016
MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis journal May 2015
Systems biology guided by XCMS Online metabolomics journal April 2017
Identifying metabolites by integrating metabolome databases with mass spectrometry cheminformatics journal November 2017
Meta-analysis of untargeted metabolomic data from multiple profiling experiments journal February 2012
Liquid chromatography quadrupole time-of-flight mass spectrometry characterization of metabolites guided by the METLIN database journal February 2013
A complete workflow for high-resolution spectral-stitching nanoelectrospray direct-infusion mass-spectrometry-based metabolomics and lipidomics journal January 2017
Data processing, multi-omic pathway mapping, and metabolite activity analysis using XCMS Online journal March 2018
Genomics and natural language processing journal August 2002
Metabolomics: beyond biomarkers and towards mechanisms journal March 2016
Dereplication of microbial metabolites through database search of mass spectra journal October 2018
XCMS-MRM and METLIN-MRM: a cloud library and public resource for targeted analysis of small molecules journal August 2018
A cheminformatics approach to characterize metabolomes in stable-isotope-labeled organisms journal March 2019
Mutational and phenotypic spectrum of phenylalanine hydroxylase deficiency in Zhejiang Province, China journal November 2018
Chronic Kidney Disease and the Risks of Death, Cardiovascular Events, and Hospitalization journal September 2004
Alport's Syndrome, Goodpasture's Syndrome, and Type IV Collagen journal June 2003
Metabolic niche of a prominent sulfate-reducing human gut bacterium journal July 2013
Literature-based automated discovery of tumor suppressor p53 phosphorylation and inhibition by NEK2 journal September 2018
Oligodendrocyte-specific expression and autoantigenicity of transaldolase in multiple sclerosis. journal November 1994
Scholarly article seeking, reading, and use: a continuing evolution from print to electronic in the sciences and social sciences journal April 2015
Current breathomics—a review on data pre-processing techniques and machine learning in metabolomics breath analysis journal April 2014
Type 3 Deiodinase Expression in Inflammatory Spinal Cord Lesions in Rat Experimental Autoimmune Encephalomyelitis journal December 2009
Application of metabolomics to plant genotype discrimination using statistics and machine learning journal October 2002
Boosting automatic event extraction from the literature using domain adaptation and coreference resolution journal April 2012
Metabolite identification and molecular fingerprint prediction through machine learning journal July 2012
Determining conserved metabolic biomarkers from a million database queries journal August 2015
Thalia: semantic search engine for biomedical abstracts journal October 2018
Oligodendrocyte precursor cells in the demyelinated multiple sclerosis spinal cord journal February 2002
PubTator: a web-based text mining tool for assisting biocuration journal May 2013
UniProt: a hub for protein information journal October 2014
KEGG as a reference resource for gene and protein annotation journal October 2015
STITCH 5: augmenting protein–chemical interaction networks with tissue and affinity data journal November 2015
YMDB 2.0: a significantly expanded version of the yeast metabolome database journal November 2016
KEGG: new perspectives on genomes, pathways, diseases and drugs journal November 2016
DrugBank 5.0: a major update to the DrugBank database for 2018 journal November 2017
HMDB 4.0: the human metabolome database for 2018 journal November 2017
The MetaCyc database of metabolic pathways and enzymes journal October 2017
MetaboAnalyst 4.0: towards more transparent and integrative metabolomics analysis journal May 2018
PubTator central: automated concept annotation for biomedical full text articles journal May 2019
N -Acetylcysteine alleviates gut dysbiosis and glucose metabolic disorder in high-fat diet-fed mice: N-乙酰半胱氨酸改善高脂饮食小鼠的肠道菌群失衡和糖代谢紊乱 journal July 2018
The effects of iron dextran on the oxidative stress in cardiovascular tissues of rats with chronic renal failure journal May 2004
Autoantibodies against aldehyde-modified collagen type IV are associated with risk of development of myocardial infarction journal September 2017
Networks of Scientific Papers journal July 1965
Advances in natural language processing journal July 2015
Metabolic rewiring of the hypertensive kidney journal December 2019
Metabolic adaptation to calorie restriction journal September 2020
Non-invasive prenatal testing of pregnancies at risk for phenylketonuria journal January 2018
Design and rationale of a multicentre, randomised, double-blind, placebo-controlled clinical trial to evaluate the effect of vitamin D on ventricular remodelling in patients with anterior myocardial infarction: the VITamin D in Acute Myocardial Infarction (VITDAMI) trial journal August 2016
MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data journal July 2010
Facilitating the development of controlled vocabularies for metabolomics technologies with text mining journal April 2008
One-year follow-up of B vitamin and Iron status in patients with phenylketonuria provided tetrahydrobiopterin (BH4) journal October 2018
Transcriptional profiling and biomarker identification reveal tissue specific effects of expanded ataxin-3 in a spinocerebellar ataxia type 3 mouse model journal June 2018
Literature information in PubChem: associations between PubChem records and scientific articles journal June 2016
SciLite: a platform for displaying text-mined annotations as a means to link research articles with biological data journal January 2016
Metabolomics Identifies Perturbations in Human Disorders of Propionate Metabolism journal December 2007
Metabolomics for biomarker discovery in the diagnosis, prognosis, survival and recurrence of colorectal cancer: a systematic review journal March 2017
Metabolomics Analysis for Biomarker Discovery: Advances and Challenges journal January 2013
COL4A3 Gene Variants and Diabetic Kidney Disease in MODY journal July 2018
The Pathogenic Role of the Adaptive Immune Response to Modified LDL in Diabetes journal January 2012
Metabolomics and Multi-Omics Integration: A Survey of Computational Methods and Resources journal May 2020
Fully Automated Trimethylsilyl (TMS) Derivatisation Protocol for Metabolite Profiling by GC-MS journal December 2016
Software Tools and Approaches for Compound Identification of LC-MS/MS Data in Metabolomics journal May 2018
MetaboAnalystR 2.0: From Raw Spectra to Biological Insights journal March 2019
Cleavage of Transaldolase by Granzyme B Causes the Loss of Enzymatic Activity with Retention of Antigenicity for Multiple Sclerosis Patients journal March 2010
Human Transaldolase and Cross-Reactive Viral Epitopes Identified by Autoantibodies of Multiple Sclerosis Patients journal October 1999
Semantic Scholar journal January 2018
SciFinder journal October 2018
Semantic Scholar journal January 2018
SciFinder journal October 2018
Transcriptional profiling and biomarker identification reveal tissue specific effects of expanded ataxin-3 in a spinocerebellar ataxia type 3 mouse model collection January 2018
One-year follow-up of B vitamin and Iron status in patients with phenylketonuria provided tetrahydrobiopterin (BH4) collection January 2018

Cited By (1)


Similar Records

Functional metabolomics: from biomarker discovery to metabolome reprogramming
Journal Article · Wed Jul 01 20:00:00 EDT 2015 · Protein & Cell · OSTI ID:1623629

Identification of bioactive metabolites using activity metabolomics
Journal Article · Tue Feb 26 19:00:00 EST 2019 · Nature Reviews Molecular Cell Biology · OSTI ID:2483283

A View from Above: Cloud Plots to Visualize Global Metabolomic Data
Journal Article · Sun Dec 02 19:00:00 EST 2012 · Analytical Chemistry · OSTI ID:1788445