DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata

Abstract

With the advancement of genome-sequencing technologies, new genomes are being sequenced daily. Although these sequences are deposited in publicly available data warehouses, their functional and genomic annotations (beyond genes which are predicted automatically) mostly reside in the text of primary publications. Professional curators are hard at work extracting those annotations from the literature for the most studied organisms and depositing them in structured databases. However, the resources don’t exist to fund the comprehensive curation of the thousands of newly sequenced organisms in this manner. Here, we describe WikiGenomes (wikigenomes.org), a web application that facilitates the consumption and curation of genomic data by the entire scientific community. WikiGenomes is based on Wikidata, an openly editable knowledge graph with the goal of aggregating published knowledge into a free and open database. WikiGenomes empowers the individual genomic researcher to contribute their expertise to the curation effort and integrates the knowledge into Wikidata, enabling it to be accessed by anyone without restriction.

Authors:
 [1];  [1];  [1];  [2];  [3];  [4];  [4];  [1];  [1];  [1];  [1]
  1. Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, CA, 92037 USA
  2. Micelio, Antwerp, Belgium
  3. Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
  4. Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
Publication Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Basic Energy Sciences (BES); National Inst. of Health (NIH) (United States)
OSTI Identifier:
1454673
Alternate Identifier(s):
OSTI ID: 1411647
Grant/Contract Number:  
AC02-05CH11231; R01GM089820; U54GM114833; U54DA036134; 5R01GM080203; 5R01HG004483
Resource Type:
Published Article
Journal Name:
Database
Additional Journal Information:
Journal Name: Database Journal Volume: 2017; Journal ID: ISSN 1758-0463
Publisher:
Oxford University Press
Country of Publication:
United Kingdom
Language:
English
Subject:
96 KNOWLEDGE MANAGEMENT AND PRESERVATION

Citation Formats

Putman, Tim E., Lelong, Sebastien, Burgstaller-Muehlbacher, Sebastian, Waagmeester, Andra, Diesh, Colin, Dunn, Nathan, Munoz-Torres, Monica, Stupp, Gregory S., Wu, Chunlei, Su, Andrew I., and Good, Benjamin M. WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata. United Kingdom: N. p., 2017. Web. doi:10.1093/database/bax025.
Putman, Tim E., Lelong, Sebastien, Burgstaller-Muehlbacher, Sebastian, Waagmeester, Andra, Diesh, Colin, Dunn, Nathan, Munoz-Torres, Monica, Stupp, Gregory S., Wu, Chunlei, Su, Andrew I., & Good, Benjamin M. WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata. United Kingdom. https://doi.org/10.1093/database/bax025
Putman, Tim E., Lelong, Sebastien, Burgstaller-Muehlbacher, Sebastian, Waagmeester, Andra, Diesh, Colin, Dunn, Nathan, Munoz-Torres, Monica, Stupp, Gregory S., Wu, Chunlei, Su, Andrew I., and Good, Benjamin M. Fri . "WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata". United Kingdom. https://doi.org/10.1093/database/bax025.
@article{osti_1454673,
title = {WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata},
author = {Putman, Tim E. and Lelong, Sebastien and Burgstaller-Muehlbacher, Sebastian and Waagmeester, Andra and Diesh, Colin and Dunn, Nathan and Munoz-Torres, Monica and Stupp, Gregory S. and Wu, Chunlei and Su, Andrew I. and Good, Benjamin M.},
abstractNote = {With the advancement of genome-sequencing technologies, new genomes are being sequenced daily. Although these sequences are deposited in publicly available data warehouses, their functional and genomic annotations (beyond genes which are predicted automatically) mostly reside in the text of primary publications. Professional curators are hard at work extracting those annotations from the literature for the most studied organisms and depositing them in structured databases. However, the resources don’t exist to fund the comprehensive curation of the thousands of newly sequenced organisms in this manner. Here, we describe WikiGenomes (wikigenomes.org), a web application that facilitates the consumption and curation of genomic data by the entire scientific community. WikiGenomes is based on Wikidata, an openly editable knowledge graph with the goal of aggregating published knowledge into a free and open database. WikiGenomes empowers the individual genomic researcher to contribute their expertise to the curation effort and integrates the knowledge into Wikidata, enabling it to be accessed by anyone without restriction.},
doi = {10.1093/database/bax025},
journal = {Database},
number = ,
volume = 2017,
place = {United Kingdom},
year = {Fri Mar 24 00:00:00 EDT 2017},
month = {Fri Mar 24 00:00:00 EDT 2017}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.1093/database/bax025

Citation Metrics:
Cited by: 21 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

A Gene Wiki for Community Annotation of Gene Function
journal, July 2008


DOOR: a database for prokaryotic operons
journal, November 2008

  • Mao, Fenglou; Dam, Phuongan; Chou, Jacky
  • Nucleic Acids Research, Vol. 37, Issue suppl_1
  • DOI: 10.1093/nar/gkn757

WikiPathways: Pathway Editing for the People
journal, July 2008


ZFIN, the Zebrafish Model Organism Database: increased support for mutants and transgenics
journal, October 2012

  • Howe, Douglas G.; Bradford, Yvonne M.; Conlin, Tom
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks938

The Gene Wiki in 2011: community intelligence applied to human gene annotation
journal, November 2011

  • Good, Benjamin M.; Clarke, Erik L.; de Alfaro, Luca
  • Nucleic Acids Research, Vol. 40, Issue D1
  • DOI: 10.1093/nar/gkr925

The transcriptional landscape of Chlamydia pneumoniae
journal, January 2011


ODB: a database for operon organizations, 2011 update
journal, November 2010

  • Okuda, S.; Yoshizawa, A. C.
  • Nucleic Acids Research, Vol. 39, Issue Database
  • DOI: 10.1093/nar/gkq1090

OpenFlyData: An exemplar data web integrating gene expression data on the fruit fly Drosophila melanogaster
journal, October 2010

  • Miles, Alistair; Zhao, Jun; Klyne, Graham
  • Journal of Biomedical Informatics, Vol. 43, Issue 5
  • DOI: 10.1016/j.jbi.2010.04.004

JBrowse: A next-generation genome browser
journal, July 2009

  • Skinner, M. E.; Uzilov, A. V.; Stein, L. D.
  • Genome Research, Vol. 19, Issue 9
  • DOI: 10.1101/gr.094607.109

Next generation models for storage and representation of microbial biological annotation
journal, October 2010


Calling on a million minds for community annotation in WikiProteins
journal, January 2008

  • Mons, Barend; Ashburner, Michael; Chichester, Christine
  • Genome Biology, Vol. 9, Issue 5
  • DOI: 10.1186/gb-2008-9-5-r89

Web Apollo: a web-based genomic annotation editing platform
journal, January 2013


Mouse genome database 2016
journal, November 2015

  • Bult, Carol J.; Eppig, Janan T.; Blake, Judith A.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1211

Wikidata: a free collaborative knowledgebase
journal, September 2014

  • Vrandečić, Denny; Krötzsch, Markus
  • Communications of the ACM, Vol. 57, Issue 10
  • DOI: 10.1145/2629489

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation
journal, November 2015

  • O'Leary, Nuala A.; Wright, Mathew W.; Brister, J. Rodney
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1189

JBrowse: a dynamic web platform for genome visualization and analysis
journal, April 2016


YeastHub: a semantic web use case for integrating data in the life sciences domain
journal, June 2005


A wiki for the life sciences where authorship matters
journal, August 2008


Wikidata as a semantic framework for the Gene Wiki initiative
journal, January 2016

  • Burgstaller-Muehlbacher, Sebastian; Waagmeester, Andra; Mitraka, Elvira
  • Database, Vol. 2016
  • DOI: 10.1093/database/baw015

The future of biocuration
journal, September 2008

  • Howe, Doug; Costanzo, Maria; Fey, Petra
  • Nature, Vol. 455, Issue 7209
  • DOI: 10.1038/455047a

Centralizing content and distributing labor: a community model for curating the very long tail of microbial genomes
journal, January 2016

  • Putman, Tim E.; Burgstaller-Muehlbacher, Sebastian; Waagmeester, Andra
  • Database, Vol. 2016
  • DOI: 10.1093/database/baw028