skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A Computing Environment to Support Repeatable Scientific Big Data Experimentation of World-Wide Scientific Literature

Abstract

A principal tenant of the scientific method is that experiments must be repeatable and relies on ceteris paribus (i.e., all other things being equal). As a scientific community, involved in data sciences, we must investigate ways to establish an environment where experiments can be repeated. We can no longer allude to where the data comes from, we must add rigor to the data collection and management process from which our analysis is conducted. This paper describes a computing environment to support repeatable scientific big data experimentation of world-wide scientific literature, and recommends a system that is housed at the Oak Ridge National Laboratory in order to provide value to investigators from government agencies, academic institutions, and industry entities. The described computing environment also adheres to the recently instituted digital data management plan mandated by multiple US government agencies, which involves all stages of the digital data life cycle including capture, analysis, sharing, and preservation. It particularly focuses on the sharing and preservation of digital research data. The details of this computing environment are explained within the context of cloud services by the three layer classification of Software as a Service , Platform as a Service , and Infrastructure as amore » Service .« less

Authors:
 [1];  [1];  [1];  [1]
  1. ORNL
Publication Date:
Research Org.:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Org.:
Work for Others (WFO)
OSTI Identifier:
1190759
DOE Contract Number:
DE-AC05-00OR22725
Resource Type:
Conference
Resource Relation:
Conference: 15th Internatonal Conference on Scientometrics & Informetrics, Istanbul, Turkey, 20150629, 20150703
Country of Publication:
United States
Language:
English
Subject:
96 KNOWLEDGE MANAGEMENT AND PRESERVATION; Science policy and research assessment; Methods and techniques; "Software as a Service; Platform as a Service; and Infrastructure as a Service"

Citation Formats

Schlicher, Bob G, Kulesz, James J, Abercrombie, Robert K, and Kruse, Kara L. A Computing Environment to Support Repeatable Scientific Big Data Experimentation of World-Wide Scientific Literature. United States: N. p., 2015. Web.
Schlicher, Bob G, Kulesz, James J, Abercrombie, Robert K, & Kruse, Kara L. A Computing Environment to Support Repeatable Scientific Big Data Experimentation of World-Wide Scientific Literature. United States.
Schlicher, Bob G, Kulesz, James J, Abercrombie, Robert K, and Kruse, Kara L. Thu . "A Computing Environment to Support Repeatable Scientific Big Data Experimentation of World-Wide Scientific Literature". United States. doi:. https://www.osti.gov/servlets/purl/1190759.
@article{osti_1190759,
title = {A Computing Environment to Support Repeatable Scientific Big Data Experimentation of World-Wide Scientific Literature},
author = {Schlicher, Bob G and Kulesz, James J and Abercrombie, Robert K and Kruse, Kara L},
abstractNote = {A principal tenant of the scientific method is that experiments must be repeatable and relies on ceteris paribus (i.e., all other things being equal). As a scientific community, involved in data sciences, we must investigate ways to establish an environment where experiments can be repeated. We can no longer allude to where the data comes from, we must add rigor to the data collection and management process from which our analysis is conducted. This paper describes a computing environment to support repeatable scientific big data experimentation of world-wide scientific literature, and recommends a system that is housed at the Oak Ridge National Laboratory in order to provide value to investigators from government agencies, academic institutions, and industry entities. The described computing environment also adheres to the recently instituted digital data management plan mandated by multiple US government agencies, which involves all stages of the digital data life cycle including capture, analysis, sharing, and preservation. It particularly focuses on the sharing and preservation of digital research data. The details of this computing environment are explained within the context of cloud services by the three layer classification of Software as a Service , Platform as a Service , and Infrastructure as a Service .},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Thu Jan 01 00:00:00 EST 2015},
month = {Thu Jan 01 00:00:00 EST 2015}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: