DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: The IMG/M data management and analysis system v.7: content updates and new features

Abstract

The Integrated Microbial Genomes & Microbiomes system at the Department of Energy (DOE) Joint Genome Institute (JGI) continues to provide support for users to perform comparative analysis of isolate and single cell genomes, metagenomes, and metatranscriptomes. In addition to datasets produced by the JGI, IMG v.7 also includes datasets imported from public sources such as NCBI Genbank, SRA, and the DOE National Microbiome Data Collaborative (NMDC), or submitted by external users. In the past couple years, we have continued our effort to help the user community by improving the annotation pipeline, upgrading the contents with new reference database versions, and adding new analysis functionalities such as advanced scaffold search, Average Nucleotide Identity (ANI) for high-quality metagenome bins, new cassette search, improved gene neighborhood display, and improvements to metatranscriptome data display and analysis. Here, we also extended the collaboration and integration efforts with other DOE-funded projects such as NMDC and DOE Biology Knowledgebase (KBase).

Authors:
ORCiD logo [1];  [1];  [1];  [1];  [1]; ORCiD logo [1];  [1];  [1];  [1];  [1];  [1]; ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1];  [1];  [1]; ORCiD logo [1];  [1];  [1]; ORCiD logo [1] more »;  [1];  [1] « less
  1. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States). Joint Genome Institute
Publication Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
OSTI Identifier:
1907597
Grant/Contract Number:  
AC02-05CH11231
Resource Type:
Accepted Manuscript
Journal Name:
Nucleic Acids Research
Additional Journal Information:
Journal Volume: 51; Journal Issue: D1; Journal ID: ISSN 0305-1048
Publisher:
Oxford University Press
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES

Citation Formats

Chen, I-Min A., Chu, Ken, Palaniappan, Krishnaveni, Ratner, Anna, Huang, Jinghua, Huntemann, Marcel, Hajek, Patrick, Ritter, Stephan J, Webb, Cody, Wu, Dongying, Varghese, Neha J, Reddy, T. K., Mukherjee, Supratim, Ovchinnikova, Galina, Nolan, Matt, Seshadri, Rekha, Roux, Simon, Visel, Axel, Woyke, Tanja, Eloe-Fadrosh, Emiley A, Kyrpides, Nikos C, and Ivanova, Natalia N. The IMG/M data management and analysis system v.7: content updates and new features. United States: N. p., 2022. Web. doi:10.1093/nar/gkac976.
Chen, I-Min A., Chu, Ken, Palaniappan, Krishnaveni, Ratner, Anna, Huang, Jinghua, Huntemann, Marcel, Hajek, Patrick, Ritter, Stephan J, Webb, Cody, Wu, Dongying, Varghese, Neha J, Reddy, T. K., Mukherjee, Supratim, Ovchinnikova, Galina, Nolan, Matt, Seshadri, Rekha, Roux, Simon, Visel, Axel, Woyke, Tanja, Eloe-Fadrosh, Emiley A, Kyrpides, Nikos C, & Ivanova, Natalia N. The IMG/M data management and analysis system v.7: content updates and new features. United States. https://doi.org/10.1093/nar/gkac976
Chen, I-Min A., Chu, Ken, Palaniappan, Krishnaveni, Ratner, Anna, Huang, Jinghua, Huntemann, Marcel, Hajek, Patrick, Ritter, Stephan J, Webb, Cody, Wu, Dongying, Varghese, Neha J, Reddy, T. K., Mukherjee, Supratim, Ovchinnikova, Galina, Nolan, Matt, Seshadri, Rekha, Roux, Simon, Visel, Axel, Woyke, Tanja, Eloe-Fadrosh, Emiley A, Kyrpides, Nikos C, and Ivanova, Natalia N. Wed . "The IMG/M data management and analysis system v.7: content updates and new features". United States. https://doi.org/10.1093/nar/gkac976. https://www.osti.gov/servlets/purl/1907597.
@article{osti_1907597,
title = {The IMG/M data management and analysis system v.7: content updates and new features},
author = {Chen, I-Min A. and Chu, Ken and Palaniappan, Krishnaveni and Ratner, Anna and Huang, Jinghua and Huntemann, Marcel and Hajek, Patrick and Ritter, Stephan J and Webb, Cody and Wu, Dongying and Varghese, Neha J and Reddy, T. K. and Mukherjee, Supratim and Ovchinnikova, Galina and Nolan, Matt and Seshadri, Rekha and Roux, Simon and Visel, Axel and Woyke, Tanja and Eloe-Fadrosh, Emiley A and Kyrpides, Nikos C and Ivanova, Natalia N},
abstractNote = {The Integrated Microbial Genomes & Microbiomes system at the Department of Energy (DOE) Joint Genome Institute (JGI) continues to provide support for users to perform comparative analysis of isolate and single cell genomes, metagenomes, and metatranscriptomes. In addition to datasets produced by the JGI, IMG v.7 also includes datasets imported from public sources such as NCBI Genbank, SRA, and the DOE National Microbiome Data Collaborative (NMDC), or submitted by external users. In the past couple years, we have continued our effort to help the user community by improving the annotation pipeline, upgrading the contents with new reference database versions, and adding new analysis functionalities such as advanced scaffold search, Average Nucleotide Identity (ANI) for high-quality metagenome bins, new cassette search, improved gene neighborhood display, and improvements to metatranscriptome data display and analysis. Here, we also extended the collaboration and integration efforts with other DOE-funded projects such as NMDC and DOE Biology Knowledgebase (KBase).},
doi = {10.1093/nar/gkac976},
journal = {Nucleic Acids Research},
number = D1,
volume = 51,
place = {United States},
year = {Wed Nov 16 00:00:00 EST 2022},
month = {Wed Nov 16 00:00:00 EST 2022}
}

Works referenced in this record:

DOE JGI Metagenome Workflow
journal, June 2021


Adaptive seeds tame genomic sequence comparison
journal, January 2011


Accurate read-based metagenome characterization using a hierarchical suite of unique signatures
journal, March 2015

  • Freitas, Tracey Allen K.; Li, Po-E; Scholz, Matthew B.
  • Nucleic Acids Research, Vol. 43, Issue 10
  • DOI: 10.1093/nar/gkv180

Genomes OnLine Database (GOLD) v.8: overview and updates
journal, November 2020

  • Mukherjee, Supratim; Stamatis, Dimitri; Bertsch, Jon
  • Nucleic Acids Research, Vol. 49, Issue D1
  • DOI: 10.1093/nar/gkaa983

KBase: The United States Department of Energy Systems Biology Knowledgebase
journal, July 2018

  • Arkin, Adam P.; Cottingham, Robert W.; Henry, Christopher S.
  • Nature Biotechnology, Vol. 36, Issue 7
  • DOI: 10.1038/nbt.4163

CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats
journal, June 2007

  • Bland, Charles; Ramsey, Teresa L.; Sabree, Fareedah
  • BMC Bioinformatics, Vol. 8, Issue 1
  • DOI: 10.1186/1471-2105-8-209

Improved metagenomic analysis with Kraken 2
journal, November 2019


The MetaCyc database of metabolic pathways and enzymes - a 2019 update
journal, October 2019

  • Caspi, Ron; Billington, Richard; Keseler, Ingrid M.
  • Nucleic Acids Research, Vol. 48, Issue D1
  • DOI: 10.1093/nar/gkz862

Origin and Evolution of Nitrogen Fixation in Prokaryotes
journal, August 2022

  • Pi, Hong-Wei; Lin, Jinn-Jy; Chen, Chi-An
  • Molecular Biology and Evolution, Vol. 39, Issue 9
  • DOI: 10.1093/molbev/msac181

20 years of the SMART protein domain annotation resource
journal, October 2017

  • Letunic, Ivica; Bork, Peer
  • Nucleic Acids Research, Vol. 46, Issue D1
  • DOI: 10.1093/nar/gkx922

UniProt: the universal protein knowledgebase in 2021
journal, November 2020

  • Bateman, Alex; Martin, Maria-Jesus; Orchard, Sandra
  • Nucleic Acids Research, Vol. 49, Issue D1
  • DOI: 10.1093/nar/gkaa1100

Rfam: annotating non-coding RNAs in complete genomes
journal, December 2004

  • Griffiths-Jones, S.
  • Nucleic Acids Research, Vol. 33, Issue Database issue
  • DOI: 10.1093/nar/gki081

GenBank
journal, November 2017

  • Benson, Dennis A.; Cavanaugh, Mark; Clark, Karen
  • Nucleic Acids Research, Vol. 46, Issue D1
  • DOI: 10.1093/nar/gkx1094

TIGRFAMs and Genome Properties in 2013
journal, November 2012

  • Haft, Daniel H.; Selengut, Jeremy D.; Richter, Roland A.
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks1234

SignalP 4.0: discriminating signal peptides from transmembrane regions
journal, September 2011

  • Petersen, Thomas Nordahl; Brunak, Søren; von Heijne, Gunnar
  • Nature Methods, Vol. 8, Issue 10
  • DOI: 10.1038/nmeth.1701

KEGG: integrating viruses and cellular organisms
journal, October 2020

  • Kanehisa, Minoru; Furumichi, Miho; Sato, Yoko
  • Nucleic Acids Research, Vol. 49, Issue D1
  • DOI: 10.1093/nar/gkaa970

The Sequence Read Archive
journal, November 2010

  • Leinonen, R.; Sugawara, H.; Shumway, M.
  • Nucleic Acids Research, Vol. 39, Issue Database
  • DOI: 10.1093/nar/gkq1019

tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes
journal, August 2021

  • Chan, Patricia P.; Lin, Brian Y.; Mak, Allysia J.
  • Nucleic Acids Research, Vol. 49, Issue 16
  • DOI: 10.1093/nar/gkab688

Centrifuge: rapid and sensitive classification of metagenomic sequences
journal, October 2016

  • Kim, Daehwan; Song, Li; Breitwieser, Florian P.
  • Genome Research, Vol. 26, Issue 12
  • DOI: 10.1101/gr.210641.116

COG database update: focus on microbial diversity, model organisms, and widespread pathogens
journal, November 2020

  • Galperin, Michael Y.; Wolf, Yuri I.; Makarova, Kira S.
  • Nucleic Acids Research, Vol. 49, Issue D1
  • DOI: 10.1093/nar/gkaa1018

The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
journal, November 2018

  • Pandurangan, Arun Prasad; Stahlhacke, Jonathan; Oates, Matt E.
  • Nucleic Acids Research, Vol. 47, Issue D1
  • DOI: 10.1093/nar/gky1130

CATH: expanding the horizons of structure-based functional annotations for genome sequences
journal, November 2018

  • Sillitoe, Ian; Dawson, Natalie; Lewis, Tony E.
  • Nucleic Acids Research, Vol. 47, Issue D1
  • DOI: 10.1093/nar/gky1097

Prodigal: prokaryotic gene recognition and translation initiation site identification
journal, March 2010


IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes
journal, October 2018

  • Chen, I-Min A.; Chu, Ken; Palaniappan, Krishna
  • Nucleic Acids Research, Vol. 47, Issue D1
  • DOI: 10.1093/nar/gky901

HMMER web server: 2018 update
journal, June 2018

  • Potter, Simon C.; Luciani, Aurélien; Eddy, Sean R.
  • Nucleic Acids Research, Vol. 46, Issue W1
  • DOI: 10.1093/nar/gky448

The Gene Ontology resource: enriching a GOld mine
journal, December 2020

  • Carbon, Seth; Douglass, Eric; Good, Benjamin M.
  • Nucleic Acids Research, Vol. 49, Issue D1
  • DOI: 10.1093/nar/gkaa1113

Microbial species delineation using whole genome sequences
journal, July 2015

  • Varghese, Neha J.; Mukherjee, Supratim; Ivanova, Natalia
  • Nucleic Acids Research, Vol. 43, Issue 14
  • DOI: 10.1093/nar/gkv657

UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches
journal, November 2014


Infernal 1.1: 100-fold faster RNA homology searches
journal, September 2013


Genome Sequence of the PCE-Dechlorinating Bacterium Dehalococcoides ethenogenes
journal, January 2005


Infernal 1.0: inference of RNA alignments
journal, March 2009


Evidence for Nitrogen Fixation by “ Dehalococcoides ethenogenes ” Strain 195
journal, December 2009

  • Lee, Patrick K. H.; He, Jianzhong; Zinder, Stephen H.
  • Applied and Environmental Microbiology, Vol. 75, Issue 23
  • DOI: 10.1128/AEM.01886-09

InCHlib – interactive cluster heatmap for web applications
journal, September 2014

  • Škuta, Ctibor; Bartůněk, Petr; Svozil, Daniel
  • Journal of Cheminformatics, Vol. 6, Issue 1
  • DOI: 10.1186/s13321-014-0044-4

Interactive metagenomic visualization in a Web browser
journal, September 2011

  • Ondov, Brian D.; Bergman, Nicholas H.; Phillippy, Adam M.
  • BMC Bioinformatics, Vol. 12, Issue 1
  • DOI: 10.1186/1471-2105-12-385

The InterPro protein families and domains database: 20 years on
journal, November 2020

  • Blum, Matthias; Chang, Hsin-Yu; Chuguransky, Sara
  • Nucleic Acids Research, Vol. 49, Issue D1
  • DOI: 10.1093/nar/gkaa977

Pfam: The protein families database in 2021
journal, October 2020

  • Mistry, Jaina; Chuguransky, Sara; Williams, Lowri
  • Nucleic Acids Research, Vol. 49, Issue D1
  • DOI: 10.1093/nar/gkaa913

IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes
journal, November 2016

  • Hadjithomas, Michalis; Chen, I-Min A.; Chu, Ken
  • Nucleic Acids Research, Vol. 45, Issue D1
  • DOI: 10.1093/nar/gkw1103

The IMG/M data management and analysis system v.6.0: new tools and advanced capabilities
journal, October 2020

  • Chen, I-Min A.; Chu, Ken; Palaniappan, Krishnaveni
  • Nucleic Acids Research, Vol. 49, Issue D1
  • DOI: 10.1093/nar/gkaa939

Evaluation of methods for the prediction of membrane spanning regions
journal, July 2001