skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Crowdsourcing and curation: perspectives from biology and natural language processing

Abstract

Crowdsourcing is increasingly utilized for performing tasks in both natural language processing and biocuration. Although there have been many applications of crowdsourcing in these fields, there have been fewer high-level discussions of the methodology and its applicability to biocuration. This paper explores crowdsourcing for biocuration through several case studies that highlight different ways of leveraging ‘the crowd’; these raise issues about the kind(s) of expertise needed, the motivations of participants, and questions related to feasibility, cost and quality. The paper is an outgrowth of a panel session held at BioCreative V (Seville, September 9–11, 2015). The session consisted of four short talks, followed by a discussion. In their talks, the panelists explored the role of expertise and the potential to improve crowd performance by training; the challenge of decomposing tasks to make them amenable to crowdsourcing; and the capture of biological data and metadata through community editing.

Authors:
 [1];  [2];  [3];  [4];  [5];  [6]
  1. The MITRE Corporation, Bedford, MA (United States)
  2. Univ. of Paris-Sorbonne, Paris (France). STIH Team
  3. Philip Morris Products S.A., Neuchatel (Switzerland). Philip Morris International R&D
  4. USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States)
  5. National Inst. of Health (NIH), Bethesda, MD (United States). National Library of Medicine. National Center for Biotechnology Information
  6. Univ. of Colorado, Denver, CO (United States). School of Medicine
Publication Date:
Research Org.:
USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); The MITRE Corporation, Bedford, MA (United States); National Inst. of Health (NIH), Bethesda, MD (United States); Univ. of Paris-Sorbonne, Paris (France)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER) (SC-23); National Inst. of Health (NIH) (United States); National Science Foundation (NSF); Institute for Research in Computer Science and Automation (INRIA) (France); Ministry of Culture (France); Philip Morris International (United States)
Contributing Org.:
Philip Morris Products S.A., Neuchatel (Switzerland); Univ. of Colorado, Denver, CO (United States)
OSTI Identifier:
1360095
Grant/Contract Number:  
SC0010838; R13-GM109648-01A1; 2R01 LM008111-09A1; LM009254-09; 1R01MH096906-01A1; IIS-1207592
Resource Type:
Journal Article: Accepted Manuscript
Journal Name:
Database
Additional Journal Information:
Journal Volume: 2016; Journal ID: ISSN 1758-0463
Publisher:
Oxford University Press
Country of Publication:
United States
Language:
English
Subject:
96 KNOWLEDGE MANAGEMENT AND PRESERVATION

Citation Formats

Hirschman, Lynette, Fort, Karën, Boué, Stéphanie, Kyrpides, Nikos, Islamaj Doğan, Rezarta, and Cohen, Kevin Bretonnel. Crowdsourcing and curation: perspectives from biology and natural language processing. United States: N. p., 2016. Web. doi:10.1093/database/baw115.
Hirschman, Lynette, Fort, Karën, Boué, Stéphanie, Kyrpides, Nikos, Islamaj Doğan, Rezarta, & Cohen, Kevin Bretonnel. Crowdsourcing and curation: perspectives from biology and natural language processing. United States. doi:10.1093/database/baw115.
Hirschman, Lynette, Fort, Karën, Boué, Stéphanie, Kyrpides, Nikos, Islamaj Doğan, Rezarta, and Cohen, Kevin Bretonnel. Mon . "Crowdsourcing and curation: perspectives from biology and natural language processing". United States. doi:10.1093/database/baw115. https://www.osti.gov/servlets/purl/1360095.
@article{osti_1360095,
title = {Crowdsourcing and curation: perspectives from biology and natural language processing},
author = {Hirschman, Lynette and Fort, Karën and Boué, Stéphanie and Kyrpides, Nikos and Islamaj Doğan, Rezarta and Cohen, Kevin Bretonnel},
abstractNote = {Crowdsourcing is increasingly utilized for performing tasks in both natural language processing and biocuration. Although there have been many applications of crowdsourcing in these fields, there have been fewer high-level discussions of the methodology and its applicability to biocuration. This paper explores crowdsourcing for biocuration through several case studies that highlight different ways of leveraging ‘the crowd’; these raise issues about the kind(s) of expertise needed, the motivations of participants, and questions related to feasibility, cost and quality. The paper is an outgrowth of a panel session held at BioCreative V (Seville, September 9–11, 2015). The session consisted of four short talks, followed by a discussion. In their talks, the panelists explored the role of expertise and the potential to improve crowd performance by training; the challenge of decomposing tasks to make them amenable to crowdsourcing; and the capture of biological data and metadata through community editing.},
doi = {10.1093/database/baw115},
journal = {Database},
number = ,
volume = 2016,
place = {United States},
year = {Mon Aug 08 00:00:00 EDT 2016},
month = {Mon Aug 08 00:00:00 EDT 2016}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 1 work
Citation information provided by
Web of Science

Save / Share: