A RESTful API for accessing microbial community data for MG-RAST
Journal Article
·
· PLoS Computational Biology (Online)
- Argonne National Lab. (ANL), Lement, IL (United States). Mathematics and Computer Science Division; Univ. of Chicago, Chicago, IL (United States). Computation Institute.
- Argonne National Lab. (ANL), Lement, IL (United States). Mathematics and Computer Science Division.
- Univ. of Canterbury (New Zealand)
Metagenomic sequencing has produced significant amounts of data in recent years. For example, as of summer 2013, MGRAST has been used to annotate over 110,000 data sets totaling over 43 Terabases. With metagenomic sequencing finding even wider adoption in the scientific community, the existing web-based analysis tools and infrastructure in MG-RAST provide limited capability for data retrieval and analysis, such as comparative analysis between multiple data sets. Moreover, although the system provides many analysis tools, it is not comprehensive. By opening MG-RAST up via a web services API (application programmers interface) we have greatly expanded access to MG-RAST data, as well as provided a mechanism for the use of third-party analysis tools with MG-RAST data. This RESTful API makes all data and data objects created by the MG-RAST pipeline accessible as JSON objects. As part of the DOE Systems Biology Knowledgebase project (KBase, http:// kbase.us) we have implemented a web services API for MG-RAST. This API complements the existing MG-RAST web interface and constitutes the basis of KBase’s microbial community capabilities. In addition, the API exposes a comprehensive collection of data to programmers. This API, which uses a RESTful (Representational State Transfer) implementation, is compatible with most programming environments and should be easy to use for end users and third parties. It provides comprehensive access to sequence data, quality control results, annotations, and many other data types. Where feasible, we have used standards to expose data and metadata. Code examples are provided in a number of languages both to show the versatility of the API and to provide a starting point for users. We present an API that exposes the data in MG-RAST for consumption by our users, greatly enhancing the utility of the MG-RAST service.
- Research Organization:
- Argonne National Lab. (ANL), Argonne, IL (United States); Argonne National Laboratory (ANL), Argonne, IL (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21); USDOE Office of Science (SC), Biological and Environmental Research (BER) (SC-23)
- Grant/Contract Number:
- AC02-06CH11357
- OSTI ID:
- 1212400
- Alternate ID(s):
- OSTI ID: 1395022
- Journal Information:
- PLoS Computational Biology (Online), Journal Name: PLoS Computational Biology (Online) Journal Issue: 1 Vol. 11; ISSN 1553-7358
- Publisher:
- Public Library of ScienceCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Accessing the SEED Genome Databases via Web Services API: Tools for Programmers
The MG-RAST API explorer: an on-ramp for RESTful query composition
Journal Article
·
Sun Jun 13 20:00:00 EDT 2010
· BMC Bioinformatics
·
OSTI ID:1626269
The MG-RAST API explorer: an on-ramp for RESTful query composition
Journal Article
·
Thu Nov 07 19:00:00 EST 2019
· BMC Bioinformatics
·
OSTI ID:1609142