OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Massive Data, the Digitization of Science, and Reproducibility of Results

Abstract

As the scientific enterprise becomes increasingly computational and data-driven, the nature of the information communicated must change. Without inclusion of the code and data with published computational results, we are engendering a credibility crisis in science. Controversies such as ClimateGate, the microarray-based drug sensitivity clinical trials under investigation at Duke University, and retractions from prominent journals due to unverified code suggest the need for greater transparency in our computational science. In this talk I argue that the scientific method be restored to (1) a focus on error control as central to scientific communication and (2) complete communication of the underlying methodology producing the results, ie. reproducibility. I outline barriers to these goals based on recent survey work (Stodden 2010), and suggest solutions such as the “Reproducible Research Standard” (Stodden 2009), giving open licensing options designed to create an intellectual property framework for scientists consonant with longstanding scientific norms.

Authors:
Publication Date:
OSTI Identifier:
1026265
Resource Type:
Multimedia
Country of Publication:
CERN
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING; COMPUTATIONAL METHODS; MASSIVE DATA; CODE; REPRODUCIBILITY; SHARING; COPYRIGHT; CREDIBILITY

Citation Formats

Stodden, Victoria. Massive Data, the Digitization of Science, and Reproducibility of Results. CERN: N. p., 2010. Web.
Stodden, Victoria. Massive Data, the Digitization of Science, and Reproducibility of Results. CERN.
Stodden, Victoria. Fri . "Massive Data, the Digitization of Science, and Reproducibility of Results". CERN. https://www.osti.gov/servlets/purl/1026265.
@article{osti_1026265,
title = {Massive Data, the Digitization of Science, and Reproducibility of Results},
author = {Stodden, Victoria},
abstractNote = {As the scientific enterprise becomes increasingly computational and data-driven, the nature of the information communicated must change. Without inclusion of the code and data with published computational results, we are engendering a credibility crisis in science. Controversies such as ClimateGate, the microarray-based drug sensitivity clinical trials under investigation at Duke University, and retractions from prominent journals due to unverified code suggest the need for greater transparency in our computational science. In this talk I argue that the scientific method be restored to (1) a focus on error control as central to scientific communication and (2) complete communication of the underlying methodology producing the results, ie. reproducibility. I outline barriers to these goals based on recent survey work (Stodden 2010), and suggest solutions such as the “Reproducible Research Standard” (Stodden 2009), giving open licensing options designed to create an intellectual property framework for scientists consonant with longstanding scientific norms.},
doi = {},
journal = {},
number = ,
volume = ,
place = {CERN},
year = {Fri Jul 02 00:00:00 EDT 2010},
month = {Fri Jul 02 00:00:00 EDT 2010}
}

Multimedia:

Save / Share:
Search Science Cinema