WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata
Abstract
With the advancement of genome-sequencing technologies, new genomes are being sequenced daily. Although these sequences are deposited in publicly available data warehouses, their functional and genomic annotations (beyond genes which are predicted automatically) mostly reside in the text of primary publications. Professional curators are hard at work extracting those annotations from the literature for the most studied organisms and depositing them in structured databases. However, the resources don’t exist to fund the comprehensive curation of the thousands of newly sequenced organisms in this manner. Here, we describe WikiGenomes (wikigenomes.org), a web application that facilitates the consumption and curation of genomic data by the entire scientific community. WikiGenomes is based on Wikidata, an openly editable knowledge graph with the goal of aggregating published knowledge into a free and open database. WikiGenomes empowers the individual genomic researcher to contribute their expertise to the curation effort and integrates the knowledge into Wikidata, enabling it to be accessed by anyone without restriction.
- Authors:
-
- Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, CA, 92037 USA
- Micelio, Antwerp, Belgium
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Publication Date:
- Research Org.:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Basic Energy Sciences (BES); National Inst. of Health (NIH) (United States)
- OSTI Identifier:
- 1454673
- Alternate Identifier(s):
- OSTI ID: 1411647
- Grant/Contract Number:
- AC02-05CH11231; R01GM089820; U54GM114833; U54DA036134; 5R01GM080203; 5R01HG004483
- Resource Type:
- Published Article
- Journal Name:
- Database
- Additional Journal Information:
- Journal Name: Database Journal Volume: 2017; Journal ID: ISSN 1758-0463
- Publisher:
- Oxford University Press
- Country of Publication:
- United Kingdom
- Language:
- English
- Subject:
- 96 KNOWLEDGE MANAGEMENT AND PRESERVATION
Citation Formats
Putman, Tim E., Lelong, Sebastien, Burgstaller-Muehlbacher, Sebastian, Waagmeester, Andra, Diesh, Colin, Dunn, Nathan, Munoz-Torres, Monica, Stupp, Gregory S., Wu, Chunlei, Su, Andrew I., and Good, Benjamin M. WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata. United Kingdom: N. p., 2017.
Web. doi:10.1093/database/bax025.
Putman, Tim E., Lelong, Sebastien, Burgstaller-Muehlbacher, Sebastian, Waagmeester, Andra, Diesh, Colin, Dunn, Nathan, Munoz-Torres, Monica, Stupp, Gregory S., Wu, Chunlei, Su, Andrew I., & Good, Benjamin M. WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata. United Kingdom. https://doi.org/10.1093/database/bax025
Putman, Tim E., Lelong, Sebastien, Burgstaller-Muehlbacher, Sebastian, Waagmeester, Andra, Diesh, Colin, Dunn, Nathan, Munoz-Torres, Monica, Stupp, Gregory S., Wu, Chunlei, Su, Andrew I., and Good, Benjamin M. Fri .
"WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata". United Kingdom. https://doi.org/10.1093/database/bax025.
@article{osti_1454673,
title = {WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata},
author = {Putman, Tim E. and Lelong, Sebastien and Burgstaller-Muehlbacher, Sebastian and Waagmeester, Andra and Diesh, Colin and Dunn, Nathan and Munoz-Torres, Monica and Stupp, Gregory S. and Wu, Chunlei and Su, Andrew I. and Good, Benjamin M.},
abstractNote = {With the advancement of genome-sequencing technologies, new genomes are being sequenced daily. Although these sequences are deposited in publicly available data warehouses, their functional and genomic annotations (beyond genes which are predicted automatically) mostly reside in the text of primary publications. Professional curators are hard at work extracting those annotations from the literature for the most studied organisms and depositing them in structured databases. However, the resources don’t exist to fund the comprehensive curation of the thousands of newly sequenced organisms in this manner. Here, we describe WikiGenomes (wikigenomes.org), a web application that facilitates the consumption and curation of genomic data by the entire scientific community. WikiGenomes is based on Wikidata, an openly editable knowledge graph with the goal of aggregating published knowledge into a free and open database. WikiGenomes empowers the individual genomic researcher to contribute their expertise to the curation effort and integrates the knowledge into Wikidata, enabling it to be accessed by anyone without restriction.},
doi = {10.1093/database/bax025},
journal = {Database},
number = ,
volume = 2017,
place = {United Kingdom},
year = {Fri Mar 24 00:00:00 EDT 2017},
month = {Fri Mar 24 00:00:00 EDT 2017}
}
https://doi.org/10.1093/database/bax025
Web of Science
Works referenced in this record:
A Gene Wiki for Community Annotation of Gene Function
journal, July 2008
- Huss, Jon W.; Orozco, Camilo; Goodale, James
- PLoS Biology, Vol. 6, Issue 7
DOOR: a database for prokaryotic operons
journal, November 2008
- Mao, Fenglou; Dam, Phuongan; Chou, Jacky
- Nucleic Acids Research, Vol. 37, Issue suppl_1
WikiPathways: Pathway Editing for the People
journal, July 2008
- Pico, Alexander R.; Kelder, Thomas; van Iersel, Martijn P.
- PLoS Biology, Vol. 6, Issue 7
ZFIN, the Zebrafish Model Organism Database: increased support for mutants and transgenics
journal, October 2012
- Howe, Douglas G.; Bradford, Yvonne M.; Conlin, Tom
- Nucleic Acids Research, Vol. 41, Issue D1
The Gene Wiki in 2011: community intelligence applied to human gene annotation
journal, November 2011
- Good, Benjamin M.; Clarke, Erik L.; de Alfaro, Luca
- Nucleic Acids Research, Vol. 40, Issue D1
The transcriptional landscape of Chlamydia pneumoniae
journal, January 2011
- Albrecht, Marco; Sharma, Cynthia M.; Dittrich, Marcus T.
- Genome Biology, Vol. 12, Issue 10
ODB: a database for operon organizations, 2011 update
journal, November 2010
- Okuda, S.; Yoshizawa, A. C.
- Nucleic Acids Research, Vol. 39, Issue Database
OpenFlyData: An exemplar data web integrating gene expression data on the fruit fly Drosophila melanogaster
journal, October 2010
- Miles, Alistair; Zhao, Jun; Klyne, Graham
- Journal of Biomedical Informatics, Vol. 43, Issue 5
JBrowse: A next-generation genome browser
journal, July 2009
- Skinner, M. E.; Uzilov, A. V.; Stein, L. D.
- Genome Research, Vol. 19, Issue 9
Next generation models for storage and representation of microbial biological annotation
journal, October 2010
- Quest, Daniel J.; Land, Miriam L.; Brettin, Thomas S.
- BMC Bioinformatics, Vol. 11, Issue S6
Calling on a million minds for community annotation in WikiProteins
journal, January 2008
- Mons, Barend; Ashburner, Michael; Chichester, Christine
- Genome Biology, Vol. 9, Issue 5
Web Apollo: a web-based genomic annotation editing platform
journal, January 2013
- Lee, Eduardo; Helt, Gregg A.; Reese, Justin T.
- Genome Biology, Vol. 14, Issue 8
Mouse genome database 2016
journal, November 2015
- Bult, Carol J.; Eppig, Janan T.; Blake, Judith A.
- Nucleic Acids Research, Vol. 44, Issue D1
Wikidata: a free collaborative knowledgebase
journal, September 2014
- Vrandečić, Denny; Krötzsch, Markus
- Communications of the ACM, Vol. 57, Issue 10
Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation
journal, November 2015
- O'Leary, Nuala A.; Wright, Mathew W.; Brister, J. Rodney
- Nucleic Acids Research, Vol. 44, Issue D1
JBrowse: a dynamic web platform for genome visualization and analysis
journal, April 2016
- Buels, Robert; Yao, Eric; Diesh, Colin M.
- Genome Biology, Vol. 17, Issue 1
YeastHub: a semantic web use case for integrating data in the life sciences domain
journal, June 2005
- Cheung, K. -H.; Yip, K. Y.; Smith, A.
- Bioinformatics, Vol. 21, Issue Suppl 1
A wiki for the life sciences where authorship matters
journal, August 2008
- Hoffmann, Robert
- Nature Genetics, Vol. 40, Issue 9
Wikidata as a semantic framework for the Gene Wiki initiative
journal, January 2016
- Burgstaller-Muehlbacher, Sebastian; Waagmeester, Andra; Mitraka, Elvira
- Database, Vol. 2016
The future of biocuration
journal, September 2008
- Howe, Doug; Costanzo, Maria; Fey, Petra
- Nature, Vol. 455, Issue 7209
Centralizing content and distributing labor: a community model for curating the very long tail of microbial genomes
journal, January 2016
- Putman, Tim E.; Burgstaller-Muehlbacher, Sebastian; Waagmeester, Andra
- Database, Vol. 2016