DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Challenges in Bioinformatics Workflows for Processing Microbiome Omics Data at Scale

Journal Article · · Frontiers in Bioinformatics

The nascent field of microbiome science is transitioning from a descriptive approach of cataloging taxa and functions present in an environment to applying multi-omics methods to investigate microbiome dynamics and function. A large number of new tools and algorithms have been designed and used for very specific purposes on samples collected by individual investigators or groups. While these developments have been quite instructive, the ability to compare microbiome data generated by many groups of researchers is impeded by the lack of standardized application of bioinformatics methods. Additionally, there are few examples of broad bioinformatics workflows that can process metagenome, metatranscriptome, metaproteome and metabolomic data at scale, and no central hub that allows processing, or provides varied omics data that are findable, accessible, interoperable and reusable (FAIR). Here, we review some of the challenges that exist in analyzing omics data within the microbiome research sphere, and provide context on how the National Microbiome Data Collaborative has adopted a standardized and open access approach to address such challenges.

Sponsoring Organization:
USDOE
OSTI ID:
1840628
Journal Information:
Frontiers in Bioinformatics, Journal Name: Frontiers in Bioinformatics Vol. 1; ISSN 2673-7647
Publisher:
Frontiers Media SACopyright Statement
Country of Publication:
Switzerland
Language:
English

References (43)

Metabolomics by Gas Chromatography–Mass Spectrometry: Combined Targeted and Untargeted Profiling journal April 2016
STEPS: A grid search methodology for optimized peptide identification filtering of MS/MS database search results journal February 2013
Electrospray Ionization Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (ESI FT-ICR MS): Characterization of Complex Environmental Mixtures journal September 2002
SLURM: Simple Linux Utility for Resource Management book January 2003
Environmental metabolomics: a critical review and future perspectives journal December 2008
Data-controlled automation of liquid chromatography/tandem mass spectrometry analysis of peptide mixtures journal June 1996
Application of metatranscriptomics to soil environments journal November 2012
GA4GH: International policies and standards for data sharing across genomic research and healthcare journal November 2021
MetaboliteDetector: Comprehensive Analysis Tool for Targeted and Nontargeted GC/MS Based Metabolome Analysis journal May 2009
Evaluation and Optimization of Mass Spectrometric Settings during Data-dependent Acquisition Mode: Focus on LTQ-Orbitrap Mass Analyzers journal May 2013
Workflow systems turn raw data into scientific knowledge journal September 2019
A framework for human microbiome research journal June 2012
Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications journal May 2011
Nextflow enables reproducible computational workflows journal April 2017
Metagenomics: DNA sequencing of environmental samples journal October 2005
Microorganisms and clean energy journal November 2006
Culturing the human microbiota and culturomics journal June 2018
The National Microbiome Data Collaborative: enabling microbiome science journal April 2020
A flavin-based extracellular electron transfer mechanism in diverse Gram-positive bacteria journal September 2018
The Integrative Human Microbiome Project journal May 2019
A complete domain-to-species taxonomy for Bacteria and Archaea journal April 2020
Qiita: rapid, web-enabled microbiome meta-analysis journal October 2018
Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers journal September 2021
Moisture modulates soil reservoirs of active DNA and RNA viruses journal August 2021
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph journal January 2015
Power and sample-size estimation for microbiome studies using pairwise distances and PERMANOVA journal March 2015
The IMG/M data management and analysis system v.6.0: new tools and advanced capabilities journal October 2020
Genomes OnLine Database (GOLD) v.8: overview and updates journal November 2020
The National Microbiome Data Collaborative Data Portal: an integrated multi-omics microbiome data resource journal October 2022
Enabling the democratization of the genomics revolution with a fully integrated web-based bioinformatics platform journal November 2016
MGnify: the microbiome analysis resource in 2020 journal November 2019
DOE JGI Metagenome Workflow journal June 2021
Microbiome Metadata Standards: Report of the National Microbiome Data Collaborative’s Workshop and Follow-On Activities journal February 2021
Building the data warehouse journal September 1998
Data integration through database federation journal January 2002
The environment ontology: contextualising biological and biomedical entities journal January 2013
The Earth Microbiome project: successes and aspirations journal August 2014
Microbiome definition re-visited: old concepts and new challenges journal June 2020
Plastics and the microbiome: impacts and solutions journal January 2021
Comparing Memory-Efficient Genome Assemblers on Stand-Alone and Cloud Infrastructures journal September 2013
Singularity: Scientific containers for mobility of compute journal May 2017
Applications of Fourier Transform Ion Cyclotron Resonance (FT-ICR) and Orbitrap Based High Resolution Mass Spectrometry in Metabolomics and Lipidomics journal May 2016