skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: An analysis and metric of reusable data licensing practices for biomedical resources

Abstract

Data are the foundation of science, and there is an increasing focus on how data can be reused and enhanced to drive scientific discoveries. However, most seemingly “open data” do not provide legal permissions for reuse and redistribution. The inability to integrate and redistribute our collective data resources blocks innovation and stymies the creation of life-improving diagnostic and drug selection tools. To help the biomedical research and research support communities (e.g. libraries, funders, repositories, etc.) understand and navigate the data licensing landscape, the (Re)usable Data Project (RDP) assesses the licensing characteristics of data resources and how licensing behaviors impact reuse. We have created a ruleset to determine the reusability of data resources and have applied it to 56 scientific data resources (e.g. databases) to date. Here, the results show significant reuse and interoperability barriers. Inspired by game-changing projects like Creative Commons, the Wikipedia Foundation, and the Free Software movement, we hope to engage the scientific community in the discussion regarding the legal use and reuse of scientific data, including the balance of openness and how to create sustainable data resources in an increasingly competitive environment.

Authors:
ORCiD logo [1]; ORCiD logo [2]; ORCiD logo [3]; ORCiD logo [4];  [2];  [5];  [6]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. Oregon Health & Science Univ., Portland, OR (United States)
  3. Oregon State Univ., Corvallis, OR (United States)
  4. Open Knowledge International, London (United Kingdom)
  5. Oregon State Univ., Corvallis, OR (United States); Oregon Health & Science Univ., Portland, OR (United States)
  6. King Abdulaziz Univ. (Saudi Arabia)
Publication Date:
Research Org.:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC)
OSTI Identifier:
1542395
DOE Contract Number:  
AC02-05CH11231
Resource Type:
Journal Article
Journal Name:
PLoS ONE
Additional Journal Information:
Journal Volume: 14; Journal Issue: 3; Journal ID: ISSN 1932-6203
Publisher:
Public Library of Science
Country of Publication:
United States
Language:
English
Subject:
96 KNOWLEDGE MANAGEMENT AND PRESERVATION

Citation Formats

Carbon, Seth, Champieux, Robin, McMurry, Julie A., Winfree, Lilly, Wyatt, Letisha R., Haendel, Melissa A., and Mehmood, Rashid. An analysis and metric of reusable data licensing practices for biomedical resources. United States: N. p., 2019. Web. doi:10.1371/journal.pone.0213090.
Carbon, Seth, Champieux, Robin, McMurry, Julie A., Winfree, Lilly, Wyatt, Letisha R., Haendel, Melissa A., & Mehmood, Rashid. An analysis and metric of reusable data licensing practices for biomedical resources. United States. doi:10.1371/journal.pone.0213090.
Carbon, Seth, Champieux, Robin, McMurry, Julie A., Winfree, Lilly, Wyatt, Letisha R., Haendel, Melissa A., and Mehmood, Rashid. Wed . "An analysis and metric of reusable data licensing practices for biomedical resources". United States. doi:10.1371/journal.pone.0213090. https://www.osti.gov/servlets/purl/1542395.
@article{osti_1542395,
title = {An analysis and metric of reusable data licensing practices for biomedical resources},
author = {Carbon, Seth and Champieux, Robin and McMurry, Julie A. and Winfree, Lilly and Wyatt, Letisha R. and Haendel, Melissa A. and Mehmood, Rashid},
abstractNote = {Data are the foundation of science, and there is an increasing focus on how data can be reused and enhanced to drive scientific discoveries. However, most seemingly “open data” do not provide legal permissions for reuse and redistribution. The inability to integrate and redistribute our collective data resources blocks innovation and stymies the creation of life-improving diagnostic and drug selection tools. To help the biomedical research and research support communities (e.g. libraries, funders, repositories, etc.) understand and navigate the data licensing landscape, the (Re)usable Data Project (RDP) assesses the licensing characteristics of data resources and how licensing behaviors impact reuse. We have created a ruleset to determine the reusability of data resources and have applied it to 56 scientific data resources (e.g. databases) to date. Here, the results show significant reuse and interoperability barriers. Inspired by game-changing projects like Creative Commons, the Wikipedia Foundation, and the Free Software movement, we hope to engage the scientific community in the discussion regarding the legal use and reuse of scientific data, including the balance of openness and how to create sustainable data resources in an increasingly competitive environment.},
doi = {10.1371/journal.pone.0213090},
journal = {PLoS ONE},
issn = {1932-6203},
number = 3,
volume = 14,
place = {United States},
year = {2019},
month = {3}
}

Works referenced in this record:

An open source pharma roadmap
journal, April 2017


First, design for data sharing
journal, April 2016

  • Wilbanks, John; Friend, Stephen H.
  • Nature Biotechnology, Vol. 34, Issue 4
  • DOI: 10.1038/nbt.3516

Legal confusion threatens to slow data science
journal, August 2016


Open by default: a proposed copyright license and waiver agreement for open access research and data in peer-reviewed journals
journal, September 2012


The FAIR Guiding Principles for scientific data management and stewardship
journal, March 2016

  • Wilkinson, Mark D.; Dumontier, Michel; Aalbersberg, IJsbrand Jan
  • Scientific Data, Vol. 3, Issue 1
  • DOI: 10.1038/sdata.2016.18