iMicrobe: Tools and data-driven discovery platform for the microbiome sciences
Abstract
Background: Scientists have amassed a wealth of microbiome datasets, making it possible to study microbes in biotic and abiotic systems on a population or planetary scale; however, this potential has not been fully realized given that the tools, datasets, and computation are available in diverse repositories and locations. To address this challenge, we developed iMicrobe.us, a community-driven microbiome data marketplace and tool exchange for users to integrate their own data and tools with those from the broader community. Findings: The iMicrobe platform brings together analysis tools and microbiome datasets by leveraging National Science Foundation–supported cyberinfrastructure and computing resources from CyVerse, Agave, and XSEDE. The primary purpose of iMicrobe is to provide users with a freely available, web-based platform to (1) maintain and share project data, metadata, and analysis products, (2) search for related public datasets, and (3) use and publish bioinformatics tools that run on highly scalable computing resources. Analysis tools are implemented in containers that encapsulate complex software dependencies and run on freely available XSEDE resources via the Agave API, which can retrieve datasets from the CyVerse Data Store or any web-accessible location (e.g., FTP, HTTP). Conclusions: iMicrobe promotes data integration, sharing, and community-driven tool development by making openmore »
- Authors:
-
- Univ. of Arizona, Tucson, AZ (United States). Dept. of Biosystems Engineering
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Genomics and Systems Biology Division
- Univ. of Arizona, Tucson, AZ (United States). Dept. of Computer Science
- Univ. of Arizona, Tucson, AZ (United States). Dept. of Biosystems Engineering; Univ. of Arizona, Tucson, AZ (United States). BIO5 Inst.
- Publication Date:
- Research Org.:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
- OSTI Identifier:
- 1625348
- Grant/Contract Number:
- AC02-05CH11231
- Resource Type:
- Accepted Manuscript
- Journal Name:
- GigaScience
- Additional Journal Information:
- Journal Volume: 8; Journal Issue: 7; Journal ID: ISSN 2047-217X
- Publisher:
- BioMed Central
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; 97 MATHEMATICS AND COMPUTING; Life Sciences & Biomedicine - Other Topics; Science & Technology - Other Topics; cyberinfrastructure; cloud computing; bioinformatics; metagenomics
Citation Formats
Youens-Clark, Ken, Bomhoff, Matt, Ponsero, Alise J., Wood-Charlson, Elisha M., Lynch, Joshua, Choi, Illyoung, Hartman, John H., and Hurwitz, Bonnie L. iMicrobe: Tools and data-driven discovery platform for the microbiome sciences. United States: N. p., 2019.
Web. doi:10.1093/gigascience/giz083.
Youens-Clark, Ken, Bomhoff, Matt, Ponsero, Alise J., Wood-Charlson, Elisha M., Lynch, Joshua, Choi, Illyoung, Hartman, John H., & Hurwitz, Bonnie L. iMicrobe: Tools and data-driven discovery platform for the microbiome sciences. United States. https://doi.org/10.1093/gigascience/giz083
Youens-Clark, Ken, Bomhoff, Matt, Ponsero, Alise J., Wood-Charlson, Elisha M., Lynch, Joshua, Choi, Illyoung, Hartman, John H., and Hurwitz, Bonnie L. Tue .
"iMicrobe: Tools and data-driven discovery platform for the microbiome sciences". United States. https://doi.org/10.1093/gigascience/giz083. https://www.osti.gov/servlets/purl/1625348.
@article{osti_1625348,
title = {iMicrobe: Tools and data-driven discovery platform for the microbiome sciences},
author = {Youens-Clark, Ken and Bomhoff, Matt and Ponsero, Alise J. and Wood-Charlson, Elisha M. and Lynch, Joshua and Choi, Illyoung and Hartman, John H. and Hurwitz, Bonnie L.},
abstractNote = {Background: Scientists have amassed a wealth of microbiome datasets, making it possible to study microbes in biotic and abiotic systems on a population or planetary scale; however, this potential has not been fully realized given that the tools, datasets, and computation are available in diverse repositories and locations. To address this challenge, we developed iMicrobe.us, a community-driven microbiome data marketplace and tool exchange for users to integrate their own data and tools with those from the broader community. Findings: The iMicrobe platform brings together analysis tools and microbiome datasets by leveraging National Science Foundation–supported cyberinfrastructure and computing resources from CyVerse, Agave, and XSEDE. The primary purpose of iMicrobe is to provide users with a freely available, web-based platform to (1) maintain and share project data, metadata, and analysis products, (2) search for related public datasets, and (3) use and publish bioinformatics tools that run on highly scalable computing resources. Analysis tools are implemented in containers that encapsulate complex software dependencies and run on freely available XSEDE resources via the Agave API, which can retrieve datasets from the CyVerse Data Store or any web-accessible location (e.g., FTP, HTTP). Conclusions: iMicrobe promotes data integration, sharing, and community-driven tool development by making open source data and tools accessible to the research community in a web-based platform.},
doi = {10.1093/gigascience/giz083},
journal = {GigaScience},
number = 7,
volume = 8,
place = {United States},
year = {Tue Jul 09 00:00:00 EDT 2019},
month = {Tue Jul 09 00:00:00 EDT 2019}
}
Figures / Tables:
Works referenced in this record:
GraftM: a tool for scalable, phylogenetically informed classification of genes within metagenomes
journal, March 2018
- Boyd, Joel A.; Woodcroft, Ben J.; Tyson, Gene W.
- Nucleic Acids Research, Vol. 46, Issue 10
GraftM: a tool for scalable, phylogenetically informed classification of genes within metagenomes
journal, March 2018
- Boyd, Joel A.; Woodcroft, Ben J.; Tyson, Gene W.
- Nucleic Acids Research, Vol. 46, Issue 10
Environmental drivers of a microbial genomic transition zone in the ocean’s interior
journal, August 2017
- Mende, Daniel R.; Bryant, Jessica A.; Aylward, Frank O.
- Nature Microbiology, Vol. 2, Issue 10
Supporting data for "iMicrobe: Tools and data-driven discovery platform for the microbiome sciences"
dataset, January 2019
- Alise, Ponsero; Bonnie, Hurwitz L.; Elisha, Wood-Charlson M.
- GigaScience Database
Supporting data for "iMicrobe: Tools and data-driven discovery platform for the microbiome sciences"
dataset, January 2019
- Alise, Ponsero; Bonnie, Hurwitz L.; Elisha, Wood-Charlson M.
- GigaScience Database
Trimmomatic: a flexible trimmer for Illumina sequence data
journal, April 2014
- Bolger, Anthony M.; Lohse, Marc; Usadel, Bjoern
- Bioinformatics, Vol. 30, Issue 15
Singularity: Scientific containers for mobility of compute
journal, May 2017
- Kurtzer, Gregory M.; Sochat, Vanessa; Bauer, Michael W.
- PLOS ONE, Vol. 12, Issue 5
The iPlant Collaborative: Cyberinfrastructure for Enabling Data to Discovery for the Life Sciences
journal, January 2016
- Merchant, Nirav; Lyons, Eric; Goff, Stephen
- PLOS Biology, Vol. 14, Issue 1
The iPlant Collaborative: Cyberinfrastructure for Enabling Data to Discovery for the Life Sciences
journal, January 2016
- Merchant, Nirav; Lyons, Eric; Goff, Stephen
- PLOS Biology, Vol. 14, Issue 1
The MG-RAST metagenomics database and portal in 2015
journal, December 2015
- Wilke, Andreas; Bischof, Jared; Gerlach, Wolfgang
- Nucleic Acids Research, Vol. 44, Issue D1
The MG-RAST metagenomics database and portal in 2015
journal, December 2015
- Wilke, Andreas; Bischof, Jared; Gerlach, Wolfgang
- Nucleic Acids Research, Vol. 44, Issue D1
Polinton-like viruses are abundant in aquatic ecosystems
journal, January 2021
- Bellas, Christopher M.; Sommaruga, Ruben
- Microbiome, Vol. 9, Issue 1
Draft Genome Sequence of Comamonas aquatilis Strain LK (= CSUR P6418 = CECT 9772), Isolated from the Planarian Schmidtea mediterranea
journal, February 2021
- Kangale, Luis Johnson; Levasseur, Anthony; Raoult, Didier
- Microbiology Resource Announcements, Vol. 10, Issue 5
Singularity: Scientific containers for mobility of compute
journal, May 2017
- Kurtzer, Gregory M.; Sochat, Vanessa; Bauer, Michael W.
- PLOS ONE, Vol. 12, Issue 5
Trimmomatic: a flexible trimmer for Illumina sequence data
journal, April 2014
- Bolger, Anthony M.; Lohse, Marc; Usadel, Bjoern
- Bioinformatics, Vol. 30, Issue 15
Fast and sensitive protein alignment using DIAMOND
journal, November 2014
- Buchfink, Benjamin; Xie, Chao; Huson, Daniel H.
- Nature Methods, Vol. 12, Issue 1
MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph
journal, January 2015
- Li, Dinghua; Liu, Chi-Man; Luo, Ruibang
- Bioinformatics, Vol. 31, Issue 10
Centrifuge: rapid and sensitive classification of metagenomic sequences
journal, October 2016
- Kim, Daehwan; Song, Li; Breitwieser, Florian P.
- Genome Research, Vol. 26, Issue 12
Putative archaeal viruses from the mesopelagic ocean
journal, January 2017
- Vik, Dean R.; Roux, Simon; Brum, Jennifer R.
- PeerJ, Vol. 5
XSEDE: Accelerating Scientific Discovery
journal, September 2014
- Towns, John; Cockerill, Timothy; Dahan, Maytal
- Computing in Science & Engineering, Vol. 16, Issue 5
MetaGeneAnnotator: Detecting Species-Specific Patterns of Ribosomal Binding Site for Precise Gene Prediction in Anonymous Prokaryotic and Phage Genomes
journal, October 2008
- Noguchi, H.; Taniguchi, T.; Itoh, T.
- DNA Research, Vol. 15, Issue 6
UProC: tools for ultra-fast protein domain classification
journal, December 2014
- Meinicke, Peter
- Bioinformatics, Vol. 31, Issue 9
MetaGeneAnnotator: Detecting Species-Specific Patterns of Ribosomal Binding Site for Precise Gene Prediction in Anonymous Prokaryotic and Phage Genomes
journal, October 2008
- Noguchi, H.; Taniguchi, T.; Itoh, T.
- DNA Research, Vol. 15, Issue 6
Prodigal: prokaryotic gene recognition and translation initiation site identification
journal, March 2010
- Hyatt, Doug; Chen, Gwo-Liang; LoCascio, Philip F.
- BMC Bioinformatics, Vol. 11, Issue 1
Characterization and functional analysis of phytoene synthase gene family in tobacco
journal, January 2021
- Wang, Zhaojun; Zhang, Lin; Dong, Chen
- BMC Plant Biology, Vol. 21, Issue 1
IMG 4 version of the integrated microbial genomes comparative analysis system
journal, October 2013
- Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna
- Nucleic Acids Research, Vol. 42, Issue D1
Protocols.io: Virtual Communities for Protocol Development and Discussion
journal, August 2016
- Teytelman, Leonid; Stoliartchouk, Alexei; Kindler, Lori
- PLOS Biology, Vol. 14, Issue 8
Microbiota Assessments for the Identification and Confirmation of Slit Defect-Causing Bacteria in Milk and Cheddar Cheese
journal, February 2021
- Xue, Zhengyao; Brooks, Jason T.; Quart, Zachary
- mSystems, Vol. 6, Issue 1
Fast and sensitive protein alignment using DIAMOND
journal, November 2014
- Buchfink, Benjamin; Xie, Chao; Huson, Daniel H.
- Nature Methods, Vol. 12, Issue 1
The FAIR Guiding Principles for scientific data management and stewardship.
other, January 2016
- Wilkinson, Mark D.; Dumontier, Michel; Aalbersberg, IJsbrand Jan
- UNESCO/IOC
The FAIR Guiding Principles for scientific data management and stewardship
journal, March 2016
- Wilkinson, Mark D.; Dumontier, Michel; Aalbersberg, IJsbrand Jan
- Scientific Data, Vol. 3, Issue 1
Mash: fast genome and metagenome distance estimation using MinHash
journal, June 2016
- Ondov, Brian D.; Treangen, Todd J.; Melsted, Páll
- Genome Biology, Vol. 17, Issue 1
How to capture developmental brain dynamics: gaps and solutions
journal, May 2021
- van Atteveldt, Nienke; Vandermosten, Maaike; Weeda, Wouter
- npj Science of Learning, Vol. 6, Issue 1
SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-Seq reads
journal, February 2014
- Xie, Y.; Wu, G.; Tang, J.
- Bioinformatics, Vol. 30, Issue 12
SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-Seq reads
journal, February 2014
- Xie, Y.; Wu, G.; Tang, J.
- Bioinformatics, Vol. 30, Issue 12
Prokka: rapid prokaryotic genome annotation
journal, March 2014
- Seemann, T.
- Bioinformatics, Vol. 30, Issue 14
KBase: The United States Department of Energy Systems Biology Knowledgebase
journal, July 2018
- Arkin, Adam P.; Cottingham, Robert W.; Henry, Christopher S.
- Nature Biotechnology, Vol. 36, Issue 7
WIsH: who is the host? Predicting prokaryotic hosts from metagenomic phage contigs
journal, July 2017
- Galiez, Clovis; Siebert, Matthias; Enault, François
- Bioinformatics, Vol. 33, Issue 19
Improving the usability and archival stability of bioinformatics software
journal, February 2019
- Mangul, Serghei; Martin, Lana S.; Eskin, Eleazar
- Genome Biology, Vol. 20, Issue 1
FragGeneScan: predicting genes in short and error-prone reads
journal, August 2010
- Rho, Mina; Tang, Haixu; Ye, Yuzhen
- Nucleic Acids Research, Vol. 38, Issue 20
MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph
preprint, January 2014
- Li, Dinghua; Liu, Chi-Man; Luo, Ruibang
- arXiv
CAMERA: A Community Resource for Metagenomics
journal, March 2007
- Seshadri, Rekha; Kravitz, Saul A.; Smarr, Larry
- PLoS Biology, Vol. 5, Issue 3
CAMERA: A Community Resource for Metagenomics
journal, March 2007
- Seshadri, Rekha; Kravitz, Saul A.; Smarr, Larry
- PLoS Biology, Vol. 5, Issue 3
KBase: The United States Department of Energy Systems Biology Knowledgebase
journal, July 2018
- Arkin, Adam P.; Cottingham, Robert W.; Henry, Christopher S.
- Nature Biotechnology, Vol. 36, Issue 7
Qiita: rapid, web-enabled microbiome meta-analysis
journal, October 2018
- Gonzalez, Antonio; Navas-Molina, Jose A.; Kosciolek, Tomasz
- Nature Methods, Vol. 15, Issue 10
SOAPdenovo-Trans: De novo transcriptome assembly with short RNA-Seq reads
preprint, January 2013
- Xie, Yinlong; Wu, Gengxiong; Tang, Jingbo
- arXiv
BioContainers: an open-source and community-driven framework for software standardization
journal, March 2017
- da Veiga Leprevost, Felipe; Grüning, Björn A.; Alves Aflitos, Saulo
- Bioinformatics, Vol. 33, Issue 16
FragGeneScan: predicting genes in short and error-prone reads
journal, August 2010
- Rho, Mina; Tang, Haixu; Ye, Yuzhen
- Nucleic Acids Research, Vol. 38, Issue 20
WIsH: who is the host? Predicting prokaryotic hosts from metagenomic phage contigs
journal, July 2017
- Galiez, Clovis; Siebert, Matthias; Enault, François
- Bioinformatics, Vol. 33, Issue 19
The FAIR Guiding Principles for scientific data management and stewardship
journal, March 2016
- Wilkinson, Mark D.; Dumontier, Michel; Aalbersberg, IJsbrand Jan
- Scientific Data, Vol. 3, Issue 1
The Agave Platform: An Open, Science-as-a-Service Platform for Digital Science
conference, July 2018
- Dooley, Rion; Brandt, Steven R.; Fonner, John
- PEARC '18: Practice and Experience in Advanced Research Computing, Proceedings of the Practice and Experience on Advanced Research Computing
The Agave Platform: An Open, Science-as-a-Service Platform for Digital Science
conference, July 2018
- Dooley, Rion; Brandt, Steven R.; Fonner, John
- PEARC '18: Practice and Experience in Advanced Research Computing, Proceedings of the Practice and Experience on Advanced Research Computing
Prodigal: prokaryotic gene recognition and translation initiation site identification
journal, March 2010
- Hyatt, Doug; Chen, Gwo-Liang; LoCascio, Philip F.
- BMC Bioinformatics, Vol. 11, Issue 1
UProC: tools for ultra-fast protein domain classification
journal, December 2014
- Meinicke, Peter
- Bioinformatics, Vol. 31, Issue 9
vConTACT: an iVirus tool to classify double-stranded DNA viruses that infect Archaea and Bacteria
journal, January 2017
- Bolduc, Benjamin; Jang, Ho Bin; Doulcier, Guilhem
- PeerJ, Vol. 5
IMG 4 version of the integrated microbial genomes comparative analysis system
journal, October 2013
- Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna
- Nucleic Acids Research, Vol. 42, Issue D1
Libra: scalable k- mer–based tool for massive all-vs-all metagenome comparisons
journal, December 2018
- Choi, Illyoung; Ponsero, Alise J.; Bomhoff, Matthew
- GigaScience, Vol. 8, Issue 2
BioContainers: an open-source and community-driven framework for software standardization
journal, March 2017
- da Veiga Leprevost, Felipe; Grüning, Björn A.; Alves Aflitos, Saulo
- Bioinformatics, Vol. 33, Issue 16
Qiita: rapid, web-enabled microbiome meta-analysis
journal, October 2018
- Gonzalez, Antonio; Navas-Molina, Jose A.; Kosciolek, Tomasz
- Nature Methods, Vol. 15, Issue 10
Works referencing / citing this record:
Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity
journal, November 2019
- Watts, George S.; Thornton, James E.; Youens-Clark, Ken
- PLOS Computational Biology, Vol. 15, Issue 11
PuMA: A papillomavirus genome annotation tool
journal, July 2020
- Pace, Josh; Youens-Clark, Ken; Freeman, Cordell
- Virus Evolution, Vol. 6, Issue 2
PuMA: A papillomavirus genome annotation tool
journal, July 2020
- Pace, Josh; Youens-Clark, Ken; Freeman, Cordell
- Virus Evolution, Vol. 6, Issue 2
Figures / Tables found in this record: