Supporting community annotation and user collaboration in the integrated microbial genomes (IMG) system
Abstract
Background: The exponential growth of genomic data from next generation technologies renders traditional manual expert curation effort unsustainable. Many genomic systems have included community annotation tools to address the problem. Most of these systems adopted a "Wiki-based" approach to take advantage of existing wiki technologies, but encountered obstacles in issues such as usability, authorship recognition, information reliability and incentive for community participation. Results: Here, we present a different approach, relying on tightly integrated method rather than "Wiki-based" method, to support community annotation and user collaboration in the Integrated Microbial Genomes (IMG) system. The IMG approach allows users to use existing IMG data warehouse and analysis tools to add gene, pathway and biosynthetic cluster annotations, to analyze/reorganize contigs, genes and functions using workspace datasets, and to share private user annotations and workspace datasets with collaborators. We show that the annotation effort using IMG can be part of the research process to overcome the user incentive and authorship recognition problems thus fostering collaboration among domain experts. The usability and reliability issues are addressed by the integration of curated information and analysis tools in IMG, together with DOE Joint Genome Institute (JGI) expert review. Conclusion: By incorporating annotation operations into IMG, we providemore »
- Authors:
- Publication Date:
- Research Org.:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Biological and Environmental Research (BER)
- OSTI Identifier:
- 1618545
- Alternate Identifier(s):
- OSTI ID: 1379303
- Grant/Contract Number:
- AC02-05CH11231
- Resource Type:
- Published Article
- Journal Name:
- BMC Genomics
- Additional Journal Information:
- Journal Name: BMC Genomics Journal Volume: 17 Journal Issue: 1; Journal ID: ISSN 1471-2164
- Publisher:
- Springer Science + Business Media
- Country of Publication:
- United Kingdom
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING; 60 APPLIED LIFE SCIENCES; Gene annotation; Functional curation; Manual curation; IMG; Metagenomics; Microbial genomics
Citation Formats
Chen, I-Min A., Markowitz, Victor M., Palaniappan, Krishna, Szeto, Ernest, Chu, Ken, Huang, Jinghua, Ratner, Anna, Pillay, Manoj, Hadjithomas, Michalis, Huntemann, Marcel, Mikhailova, Natalia, Ovchinnikova, Galina, Ivanova, Natalia N., and Kyrpides, Nikos C. Supporting community annotation and user collaboration in the integrated microbial genomes (IMG) system. United Kingdom: N. p., 2016.
Web. doi:10.1186/s12864-016-2629-y.
Chen, I-Min A., Markowitz, Victor M., Palaniappan, Krishna, Szeto, Ernest, Chu, Ken, Huang, Jinghua, Ratner, Anna, Pillay, Manoj, Hadjithomas, Michalis, Huntemann, Marcel, Mikhailova, Natalia, Ovchinnikova, Galina, Ivanova, Natalia N., & Kyrpides, Nikos C. Supporting community annotation and user collaboration in the integrated microbial genomes (IMG) system. United Kingdom. https://doi.org/10.1186/s12864-016-2629-y
Chen, I-Min A., Markowitz, Victor M., Palaniappan, Krishna, Szeto, Ernest, Chu, Ken, Huang, Jinghua, Ratner, Anna, Pillay, Manoj, Hadjithomas, Michalis, Huntemann, Marcel, Mikhailova, Natalia, Ovchinnikova, Galina, Ivanova, Natalia N., and Kyrpides, Nikos C. Tue .
"Supporting community annotation and user collaboration in the integrated microbial genomes (IMG) system". United Kingdom. https://doi.org/10.1186/s12864-016-2629-y.
@article{osti_1618545,
title = {Supporting community annotation and user collaboration in the integrated microbial genomes (IMG) system},
author = {Chen, I-Min A. and Markowitz, Victor M. and Palaniappan, Krishna and Szeto, Ernest and Chu, Ken and Huang, Jinghua and Ratner, Anna and Pillay, Manoj and Hadjithomas, Michalis and Huntemann, Marcel and Mikhailova, Natalia and Ovchinnikova, Galina and Ivanova, Natalia N. and Kyrpides, Nikos C.},
abstractNote = {Background: The exponential growth of genomic data from next generation technologies renders traditional manual expert curation effort unsustainable. Many genomic systems have included community annotation tools to address the problem. Most of these systems adopted a "Wiki-based" approach to take advantage of existing wiki technologies, but encountered obstacles in issues such as usability, authorship recognition, information reliability and incentive for community participation. Results: Here, we present a different approach, relying on tightly integrated method rather than "Wiki-based" method, to support community annotation and user collaboration in the Integrated Microbial Genomes (IMG) system. The IMG approach allows users to use existing IMG data warehouse and analysis tools to add gene, pathway and biosynthetic cluster annotations, to analyze/reorganize contigs, genes and functions using workspace datasets, and to share private user annotations and workspace datasets with collaborators. We show that the annotation effort using IMG can be part of the research process to overcome the user incentive and authorship recognition problems thus fostering collaboration among domain experts. The usability and reliability issues are addressed by the integration of curated information and analysis tools in IMG, together with DOE Joint Genome Institute (JGI) expert review. Conclusion: By incorporating annotation operations into IMG, we provide an integrated environment for users to perform deeper and extended data analysis and annotation in a single system that can lead to publications and community knowledge sharing as shown in the case studies.},
doi = {10.1186/s12864-016-2629-y},
journal = {BMC Genomics},
number = 1,
volume = 17,
place = {United Kingdom},
year = {Tue Apr 26 00:00:00 EDT 2016},
month = {Tue Apr 26 00:00:00 EDT 2016}
}
https://doi.org/10.1186/s12864-016-2629-y
Web of Science
Works referenced in this record:
WikiPathways: building research communities on biological pathways
journal, November 2011
- Kelder, T.; van Iersel, M. P.; Hanspers, K.
- Nucleic Acids Research, Vol. 40, Issue D1
IMG/M 4 version of the integrated metagenome comparative analysis system
journal, October 2013
- Markowitz, Victor M.; Chen, I-Min A.; Chu, Ken
- Nucleic Acids Research, Vol. 42, Issue D1, p. D568-D573
The RNA WikiProject: Community annotation of RNA families
journal, October 2008
- Daub, J.; Gardner, P. P.; Tate, J.
- RNA, Vol. 14, Issue 12
ORegAnno: an open-access community-driven resource for regulatory annotation
journal, December 2007
- Griffith, O. L.; Montgomery, S. B.; Bernier, B.
- Nucleic Acids Research, Vol. 36, Issue Database
TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes
journal, January 2007
- Selengut, J. D.; Haft, D. H.; Davidsen, T.
- Nucleic Acids Research, Vol. 35, Issue Database
The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification
journal, October 2014
- Reddy, T. B. K.; Thomas, Alex D.; Stamatis, Dimitri
- Nucleic Acids Research, Vol. 43, Issue D1
Fifteen years of microbial genomics: meeting the challenges and fulfilling the dream
journal, July 2009
- Kyrpides, Nikos C.
- Nature Biotechnology, Vol. 27, Issue 7
TOPSAN: a collaborative annotation environment for structural genomics.
journal, January 2010
- Weekes, Dana; Krishna, S. Sri; Bakolitsa, Constantina
- BMC Bioinformatics, Vol. 11, Issue 1
Whole-genome sequence annotation: 'Going wrong with confidence'
journal, May 1999
- Kyrpides, Nikos C.; Ouzounis, Christos A.
- Molecular Microbiology, Vol. 32, Issue 4
Improving Microbial Genome Annotations in an Integrated Database Context
journal, February 2013
- Chen, I-Min A.; Markowitz, Victor M.; Chu, Ken
- PLoS ONE, Vol. 8, Issue 2
IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites
journal, July 2015
- Hadjithomas, Michalis; Chen, I-Min Amy; Chu, Ken
- mBio, Vol. 6, Issue 4
Data, information, knowledge and principle: back to metabolism in KEGG
journal, November 2013
- Kanehisa, Minoru; Goto, Susumu; Sato, Yoko
- Nucleic Acids Research, Vol. 42, Issue D1
Plant-Associated Symbiotic Burkholderia Species Lack Hallmark Strategies Required in Mammalian Pathogenesis
journal, January 2014
- Angus, Annette A.; Agapakis, Christina M.; Fong, Stephanie
- PLoS ONE, Vol. 9, Issue 1
miRBase: integrating microRNA annotation and deep-sequencing data
journal, October 2010
- Kozomara, A.; Griffiths-Jones, S.
- Nucleic Acids Research, Vol. 39, Issue Database
Collaboratories: doing science on the Internet
journal, January 1996
- Kouzes, R. T.; Myers, J. D.; Wulf, W. A.
- Computer, Vol. 29, Issue 8
Genome Sequence of the Nitroaromatic Compound-Degrading Bacterium Burkholderia sp. Strain SJ98
journal, May 2012
- Kumar, S.; Vikram, S.; Raghava, G. P. S.
- Journal of Bacteriology, Vol. 194, Issue 12
The future of biocuration
journal, September 2008
- Howe, Doug; Costanzo, Maria; Fey, Petra
- Nature, Vol. 455, Issue 7209
BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources
journal, January 2009
- Wu, Chunlei; Orozco, Camilo; Boyer, Jason
- Genome Biology, Vol. 10, Issue 11
SOP for pathway inference in Integrated Microbial Genomes (IMG)
journal, December 2010
- Anderson, Iain; Chen, Amy; Markowitz, Victor
- Standards in Genomic Sciences, Vol. 5, Issue 3
Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper
journal, March 2011
- Potnis, Neha; Krasileva, Ksenia; Chow, Virginia
- BMC Genomics, Vol. 12, Issue 1
Big Data: Astronomical or Genomical?
journal, July 2015
- Stephens, Zachary D.; Lee, Skylar Y.; Faghri, Faraz
- PLOS Biology, Vol. 13, Issue 7
The COG database: an updated version includes eukaryotes
journal, January 2003
- Tatusov, Roman L.; Fedorova, Natalie D.; Jackson, John D.
- BMC Bioinformatics, Vol. 4, Article No. 41
Calling on a million minds for community annotation in WikiProteins
journal, January 2008
- Mons, Barend; Ashburner, Michael; Chichester, Christine
- Genome Biology, Vol. 9, Issue 5
Pfam: the protein families database
journal, November 2013
- Finn, Robert D.; Bateman, Alex; Clements, Jody
- Nucleic Acids Research, Vol. 42, Issue D1
Comparative genomics of three M ethanocellales strains reveal novel taxonomic and metabolic features : Comparative genomics of three
journal, April 2015
- Lyu, Zhe; Lu, Yahai
- Environmental Microbiology Reports, Vol. 7, Issue 3
Artemis: sequence visualization and annotation
journal, October 2000
- Rutherford, K.; Parkhill, J.; Crook, J.
- Bioinformatics, Vol. 16, Issue 10
A wiki for the life sciences where authorship matters
journal, August 2008
- Hoffmann, Robert
- Nature Genetics, Vol. 40, Issue 9
IMG ER: a system for microbial genome annotation expert review and curation
journal, June 2009
- Markowitz, Victor M.; Mavromatis, Konstantinos; Ivanova, Natalia N.
- Bioinformatics, Vol. 25, Issue 17
GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes
journal, May 2010
- Pati, Amrita; Ivanova, Natalia N.; Mikhailova, Natalia
- Nature Methods, Vol. 7, Issue 6
Works referencing / citing this record:
Funding knowledgebases: Towards a sustainable funding model for the UniProt use case
journal, January 2017
- Gabella, Chiara; Durinx, Christine; Appel, Ron
- F1000Research, Vol. 6
Genome analysis of the marine bacterium Kiloniella laminariae and first insights into comparative genomics with related Kiloniella species
journal, December 2019
- Wiese, Jutta; Imhoff, Johannes F.; Horn, Hannes
- Archives of Microbiology, Vol. 202, Issue 4
C-4 sterol demethylation enzymes distinguish bacterial and eukaryotic sterol synthesis
journal, May 2018
- Lee, Alysha K.; Banta, Amy B.; Wei, Jeremy H.
- Proceedings of the National Academy of Sciences, Vol. 115, Issue 23
Enabling the democratization of the genomics revolution with a fully integrated web-based bioinformatics platform
journal, November 2016
- Li, Po-E; Lo, Chien-Chi; Anderson, Joseph J.
- Nucleic Acids Research, Vol. 45, Issue 1
MicroScope in 2017: an expanding and evolving integrated resource for community expertise of microbial genomes
journal, November 2016
- Vallenet, David; Calteau, Alexandra; Cruveiller, Stéphane
- Nucleic Acids Research, Vol. 45, Issue D1
IMG/M: integrated genome and metagenome comparative data analysis system
journal, October 2016
- Chen, I-Min A.; Markowitz, Victor M.; Chu, Ken
- Nucleic Acids Research, Vol. 45, Issue D1
IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes
journal, October 2018
- Chen, I-Min A.; Chu, Ken; Palaniappan, Krishna
- Nucleic Acids Research, Vol. 47, Issue D1
High-quality draft genome sequences of Pseudomonas monteilii DSM 14164T, Pseudomonas mosselii DSM 17497T, Pseudomonas plecoglossicida DSM 15088T, Pseudomonas taiwanensis DSM 21245T and Pseudomonas vranovensis DSM 16006T: taxonomic considerations
journal, December 2019
- Peña, Arantxa; Busquets, Antonio; Gomila, Margarita
- Access Microbiology, Vol. 1, Issue 10
Reclassification of a Polynucleobacter cosmopolitanus strain isolated from tropical Lake Victoria as Polynucleobacter victoriensis sp. nov.
journal, December 2017
- Hahn, Martin W.; Schmidt, Johanna; Asiyo, Grace Ssanyu
- International Journal of Systematic and Evolutionary Microbiology, Vol. 67, Issue 12
Complete Genome Sequence of Thermoanaerobacterium sp. Strain RBIITD, a Butyrate- and Butanol-Producing Thermophile
journal, January 2018
- Biswas, Ranjita; Huntemann, Marcel; Clum, Alicia
- Genome Announcements, Vol. 6, Issue 2
Complete Genome Sequence for Asinibacterium sp. Strain OR53 and Draft Genome Sequence for Asinibacterium sp. Strain OR43, Two Bacteria Tolerant to Uranium
journal, April 2019
- Brzoska, Ryann M.; Huntemann, Marcel; Clum, Alicia
- Microbiology Resource Announcements, Vol. 8, Issue 14
GROOLS: reactive graph reasoning for genome annotation through biological processes
journal, April 2018
- Mercier, Jonathan; Josso, Adrien; Médigue, Claudine
- BMC Bioinformatics, Vol. 19, Issue 1
Funding knowledgebases: Towards a sustainable funding model for the UniProt use case
journal, January 2017
- Gabella, Chiara; Durinx, Christine; Appel, Ron
- F1000Research, Vol. 6
Genome-Scale Data Call for a Taxonomic Rearrangement of Geodermatophilaceae
journal, December 2017
- Montero-Calasanz, Maria del Carmen; Meier-Kolthoff, Jan P.; Zhang, Dao-Feng
- Frontiers in Microbiology, Vol. 8
Bacterial Metabolites Produced Under Iron Limitation Kill Pinewood Nematode and Attract Caenorhabditis elegans
journal, September 2019
- Proença, Diogo Neves; Heine, Thomas; Senges, Christoph H. R.
- Frontiers in Microbiology, Vol. 10