A RESTful API for accessing microbial community data for MG-RAST
Abstract
Metagenomic sequencing has produced significant amounts of data in recent years. For example, as of summer 2013, MG-RAST has been used to annotate over 110,000 data sets totaling over 43 Terabases. With metagenomic sequencing finding even wider adoption in the scientific community, the existing web-based analysis tools and infrastructure in MG-RAST provide limited capability for data retrieval and analysis, such as comparative analysis between multiple data sets. Moreover, although the system provides many analysis tools, it is not comprehensive. By opening MG-RAST up via a web services API (application programmers interface) we have greatly expanded access to MG-RAST data, as well as provided a mechanism for the use of third-party analysis tools with MG-RAST data. This RESTful API makes all data and data objects created by the MG-RAST pipeline accessible as JSON objects. As part of the DOE Systems Biology Knowledgebase project (KBase, http://kbase.us) we have implemented a web services API for MG-RAST. This API complements the existing MG-RAST web interface and constitutes the basis of KBase's microbial community capabilities. In addition, the API exposes a comprehensive collection of data to programmers. This API, which uses a RESTful (Representational State Transfer) implementation, is compatible with most programming environments and shouldmore »
- Authors:
-
- Argonne National Lab. (ANL), Lement, IL (United States). Mathematics and Computer Science Division; Univ. of Chicago, Chicago, IL (United States). Computation Institute.
- Argonne National Lab. (ANL), Lement, IL (United States). Mathematics and Computer Science Division.
- Univ. of Canterbury (New Zealand)
- Publication Date:
- Research Org.:
- Argonne National Lab. (ANL), Argonne, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Biological and Environmental Research (BER); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- OSTI Identifier:
- 1212400
- Alternate Identifier(s):
- OSTI ID: 1395022
- Grant/Contract Number:
- AC02-06CH11357
- Resource Type:
- Journal Article: Accepted Manuscript
- Journal Name:
- PLoS Computational Biology (Online)
- Additional Journal Information:
- Journal Volume: 11; Journal Issue: 1; Journal ID: ISSN 1553-7358
- Publisher:
- Public Library of Science
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; 96 KNOWLEDGE MANAGEMENT AND PRESERVATION; sequence databases; information retrieval; metagenomics; web-based applications; proteases; DNA sequence analysis; database searching; quality control
Citation Formats
Wilke, Andreas, Bischof, Jared, Harrison, Travis, Brettin, Tom, D'Souza, Mark, Gerlach, Wolfgang, Matthews, Hunter, Paczian, Tobias, Wilkening, Jared, Glass, Elizabeth M., Desai, Narayan, Meyer, Folker, and Gardner, Paul P. A RESTful API for accessing microbial community data for MG-RAST. United States: N. p., 2015.
Web. doi:10.1371/journal.pcbi.1004008.
Wilke, Andreas, Bischof, Jared, Harrison, Travis, Brettin, Tom, D'Souza, Mark, Gerlach, Wolfgang, Matthews, Hunter, Paczian, Tobias, Wilkening, Jared, Glass, Elizabeth M., Desai, Narayan, Meyer, Folker, & Gardner, Paul P. A RESTful API for accessing microbial community data for MG-RAST. United States. https://doi.org/10.1371/journal.pcbi.1004008
Wilke, Andreas, Bischof, Jared, Harrison, Travis, Brettin, Tom, D'Souza, Mark, Gerlach, Wolfgang, Matthews, Hunter, Paczian, Tobias, Wilkening, Jared, Glass, Elizabeth M., Desai, Narayan, Meyer, Folker, and Gardner, Paul P. Thu .
"A RESTful API for accessing microbial community data for MG-RAST". United States. https://doi.org/10.1371/journal.pcbi.1004008. https://www.osti.gov/servlets/purl/1212400.
@article{osti_1212400,
title = {A RESTful API for accessing microbial community data for MG-RAST},
author = {Wilke, Andreas and Bischof, Jared and Harrison, Travis and Brettin, Tom and D'Souza, Mark and Gerlach, Wolfgang and Matthews, Hunter and Paczian, Tobias and Wilkening, Jared and Glass, Elizabeth M. and Desai, Narayan and Meyer, Folker and Gardner, Paul P.},
abstractNote = {Metagenomic sequencing has produced significant amounts of data in recent years. For example, as of summer 2013, MG-RAST has been used to annotate over 110,000 data sets totaling over 43 Terabases. With metagenomic sequencing finding even wider adoption in the scientific community, the existing web-based analysis tools and infrastructure in MG-RAST provide limited capability for data retrieval and analysis, such as comparative analysis between multiple data sets. Moreover, although the system provides many analysis tools, it is not comprehensive. By opening MG-RAST up via a web services API (application programmers interface) we have greatly expanded access to MG-RAST data, as well as provided a mechanism for the use of third-party analysis tools with MG-RAST data. This RESTful API makes all data and data objects created by the MG-RAST pipeline accessible as JSON objects. As part of the DOE Systems Biology Knowledgebase project (KBase, http://kbase.us) we have implemented a web services API for MG-RAST. This API complements the existing MG-RAST web interface and constitutes the basis of KBase's microbial community capabilities. In addition, the API exposes a comprehensive collection of data to programmers. This API, which uses a RESTful (Representational State Transfer) implementation, is compatible with most programming environments and should be easy to use for end users and third parties. It provides comprehensive access to sequence data, quality control results, annotations, and many other data types. Where feasible, we have used standards to expose data and metadata. Code examples are provided in a number of languages both to show the versatility of the API and to provide a starting point for users. We present an API that exposes the data in MG-RAST for consumption by our users, greatly enhancing the utility of the MG-RAST service.},
doi = {10.1371/journal.pcbi.1004008},
url = {https://www.osti.gov/biblio/1212400},
journal = {PLoS Computational Biology (Online)},
issn = {1553-7358},
number = 1,
volume = 11,
place = {United States},
year = {2015},
month = {1}
}
Web of Science
Works referenced in this record:
The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes
journal, September 2008
- Meyer, F.; Paarmann, D.; D'Souza, M.
- BMC Bioinformatics, Vol. 9, Issue 1
The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification
journal, May 2013
- Afrasiabi, Cyrus; Samad, Bushra; Dineen, David
- Nucleic Acids Research, Vol. 41, Issue W1
A Platform-Independent Method for Detecting Errors in Metagenomic Sequencing Data: DRISEE
journal, June 2012
- Keegan, Kevin P.; Trimble, William L.; Wilkening, Jared
- PLoS Computational Biology, Vol. 8, Issue 6
The M5nr: a novel non-redundant database containing protein sequences and annotations from multiple sources and associated tools
journal, January 2012
- Wilke, Andreas; Harrison, Travis; Wilkening, Jared
- BMC Bioinformatics, Vol. 13, Issue 1
Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications
journal, May 2011
- Yilmaz, Pelin; Kottmann, Renzo; Field, Dawn
- Nature Biotechnology, Vol. 29, Issue 5
InterPro in 2011: new developments in the family and domain prediction database
journal, November 2011
- Hunter, S.; Jones, P.; Mitchell, A.
- Nucleic Acids Research, Vol. 40, Issue D1
Using clouds for metagenomics: A case study
conference, August 2009
- Wilkening, Jared; Wilke, Andreas; Desai, Narayan
- 2009 IEEE International Conference on Cluster Computing and Workshops
Identifying Protein Domains with the Pfam Database
journal, September 2008
- Coggill, Penny; Finn, Robert D.; Bateman, Alex
- Current Protocols in Bioinformatics, Vol. 23, Issue 1
Accessing the SEED Genome Databases via Web Services API: Tools for Programmers
journal, January 2010
- Disz, Terry; Akhter, Sajia; Cuevas, Daniel
- BMC Bioinformatics, Vol. 11, Issue 1
The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome
journal, July 2012
- McDonald, Daniel; Clemente, Jose C.; Kuczynski, Justin
- GigaScience, Vol. 1, Issue 1
The 'rare biosphere': a reality check
journal, September 2009
- Reeder, Jens; Knight, Rob
- Nature Methods, Vol. 6, Issue 9
Works referencing / citing this record:
Towards Solving The Metagenomics Reproducibility Crisis With Cwl And Ro
other, October 2018
- Meyer, Folker
- Zenodo, 1 File (4.6 MB)
Towards Solving The Metagenomics Reproducibility Crisis With Cwl And Ro
other, October 2018
- Meyer, Folker
- Zenodo, 1 File (4.6 MB)
SAMSA: a comprehensive metatranscriptome analysis pipeline
journal, September 2016
- Westreich, Samuel T.; Korf, Ian; Mills, David A.
- BMC Bioinformatics, Vol. 17, Issue 1
MG-RAST version 4—lessons learned from a decade of low-budget ultra-high-throughput metagenome analysis
journal, September 2017
- Meyer, Folker; Bagchi, Saurabh; Chaterji, Somali
- Briefings in Bioinformatics, Vol. 20, Issue 4
MG-RAST version 4—lessons learned from a decade of low-budget ultra-high-throughput metagenome analysis
journal, September 2017
- Meyer, Folker; Bagchi, Saurabh; Chaterji, Somali
- Briefings in Bioinformatics, Vol. 20, Issue 4
SAMSA: a comprehensive metatranscriptome analysis pipeline
journal, September 2016
- Westreich, Samuel T.; Korf, Ian; Mills, David A.
- BMC Bioinformatics, Vol. 17, Issue 1
A novel and wide substrate specific polyhydroxyalkanoate (PHA) synthase from unculturable bacteria found in mangrove soil
journal, December 2017
- Foong, Choon Pin; Lakshmanan, Manoj; Abe, Hideki
- Journal of Polymer Research, Vol. 25, Issue 1
IgA regulates the composition and metabolic function of gut microbiota by promoting symbiosis between bacteria
journal, July 2018
- Nakajima, Akira; Vogelzang, Alexis; Maruya, Mikako
- Journal of Experimental Medicine, Vol. 215, Issue 8
Functional sequencing read annotation for high precision microbiome analysis
journal, November 2017
- Zhu, Chengsheng; Miller, Maximilian; Marpaka, Srinayani
- Nucleic Acids Research, Vol. 46, Issue 4
Exploring bacterial pathogen community dynamics in freshwater beach sediments: A tale of two lakes
journal, November 2019
- VanMensel, Danielle; Chaganti, Subba Rao; Droppo, Ian G.
- Environmental Microbiology, Vol. 22, Issue 2
Metagenomic evidence for the presence of phototrophic Gemmatimonadetes bacteria in diverse environments: Phototrophic Gemmatimonadetes in diverse environments
journal, January 2016
- Zeng, Yonghui; Baumbach, Jan; Barbosa, Eudes Guilherme Vieira
- Environmental Microbiology Reports, Vol. 8, Issue 1
Ancient plant DNA in lake sediments
journal, April 2017
- Parducci, Laura; Bennett, Keith D.; Ficetola, Gentile Francesco
- New Phytologist, Vol. 214, Issue 3
Complete Genome Sequence of Escherichia coli Phage vB_EcoS Sa179lw, Isolated from Surface Water in a Produce-Growing Area in Northern California
journal, July 2018
- Liao, Yen-Te; Liu, Fang; Sun, Xincheng
- Genome Announcements, Vol. 6, Issue 27
What Is the Role of Archaea in Plants? New Insights from the Vegetation of Alpine Bogs
journal, May 2018
- Taffner, Julian; Erlacher, Armin; Bragina, Anastasia
- mSphere, Vol. 3, Issue 3
Antibiotic Resistance Gene Diversity and Virulence Gene Diversity Are Correlated in Human Gut and Environmental Microbiomes
journal, May 2019
- Escudeiro, Pedro; Pothier, Joël; Dionisio, Francisco
- mSphere, Vol. 4, Issue 3
Genomics of the Uncultivated, Periodontitis-Associated Bacterium Tannerella sp. BU045 (Oral Taxon 808)
journal, June 2018
- Beall, Clifford J.; Campbell, Alisha G.; Griffen, Ann L.
- mSystems, Vol. 3, Issue 3
Taxon-Function Decoupling as an Adaptive Signature of Lake Microbial Metacommunities Under a Chronic Polymetallic Pollution Gradient
journal, May 2018
- Cheaib, Bachar; Le Boulch, Malo; Mercier, Pierre-Luc
- Frontiers in Microbiology, Vol. 9
Microscale Biosignatures and Abiotic Mineral Authigenesis in Little Hot Creek, California
journal, May 2018
- Kraus, Emily A.; Beeler, Scott R.; Mors, R. Agustin
- Frontiers in Microbiology, Vol. 9