skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: DataONE: A Data Federation with Provenance Support

Abstract

DataONE is a federated data network focusing on earth and environmental science data. We present the provenance and search features of DataONE by means of an example involving three earth scientists who interact through a DataONE Member Node. DataONE provenance systems enable reproducible research and facilitate proper attribution of scientific results transitively across generations of derived data products.

Authors:
 [1];  [2];  [3];  [2];  [1];  [1];  [4];  [5];  [2];  [6];  [2]
  1. University of Illinois at Urbana-Champaign
  2. National Center for Ecological Analysis and Synthesis, Santa Barbara
  3. Universidad Popular Aut├│noma del Estado de Puebla, Puebla, Mexico
  4. Newcastle University, UK
  5. Woods Hole Research Center (WHRC), Massachusetts
  6. University of Kansas
Publication Date:
Research Org.:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Org.:
Work for Others (WFO)
OSTI Identifier:
1325494
DOE Contract Number:
AC05-00OR22725
Resource Type:
Book
Country of Publication:
United States
Language:
English

Citation Formats

Cao, Yang, Jones, Christopher, Cuevas-Vicenttin, Victor, Jones, Matthew B., Ludascher, Bertram, McPhillips, Timothy, Missier, Paolo, Schwalm, Christopher, Slaughter, Peter, Vieglais, Dave, and Walker, Lauren. DataONE: A Data Federation with Provenance Support. United States: N. p., 2016. Web.
Cao, Yang, Jones, Christopher, Cuevas-Vicenttin, Victor, Jones, Matthew B., Ludascher, Bertram, McPhillips, Timothy, Missier, Paolo, Schwalm, Christopher, Slaughter, Peter, Vieglais, Dave, & Walker, Lauren. DataONE: A Data Federation with Provenance Support. United States.
Cao, Yang, Jones, Christopher, Cuevas-Vicenttin, Victor, Jones, Matthew B., Ludascher, Bertram, McPhillips, Timothy, Missier, Paolo, Schwalm, Christopher, Slaughter, Peter, Vieglais, Dave, and Walker, Lauren. Fri . "DataONE: A Data Federation with Provenance Support". United States. doi:.
@article{osti_1325494,
title = {DataONE: A Data Federation with Provenance Support},
author = {Cao, Yang and Jones, Christopher and Cuevas-Vicenttin, Victor and Jones, Matthew B. and Ludascher, Bertram and McPhillips, Timothy and Missier, Paolo and Schwalm, Christopher and Slaughter, Peter and Vieglais, Dave and Walker, Lauren},
abstractNote = {DataONE is a federated data network focusing on earth and environmental science data. We present the provenance and search features of DataONE by means of an example involving three earth scientists who interact through a DataONE Member Node. DataONE provenance systems enable reproducible research and facilitate proper attribution of scientific results transitively across generations of derived data products.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Fri Jan 01 00:00:00 EST 2016},
month = {Fri Jan 01 00:00:00 EST 2016}
}

Book:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this book.

Save / Share:
  • Despite the methodical and painstaking efforts made by scientists to record their scientific findings and protocols, a knowledge gap problem continues to persist today between producers of scientific results and consumers because technology is performing the exchange of data as opposed to scientists making direct contact. Provenance is a means to formalize how this knowledge is transferred. However, for it to be meaningful to scientists, the provenance research community needs continued contributions from the scientific community to extend and leverage provenance-based vocabularies and technology from the provenance community. Going forward the provenance community must also be vigilant to meet scalabilitymore » needs of data intensive science« less
  • The process of Data Mining converts information to knowledge by utilizing tools from the disciplines of computational statistics, database technologies, machine learning, signal processing, nonlinear dynamics, process modeling, simulation, and allied disciplines. Data Mining allows business problems to be analyzed from diverse perspectives, including dimensionality reduction, correlation and co-occurrence, clustering and classification, regression and forecasting, anomaly detection, and change analysis. The predictive insights generated from Data Mining can be further utilized through real-time analysis and decision sciences, as well as through human-driven analysis based on management by exceptions or by objectives, to generate actionable knowledge. The tools that enable themore » transformation of raw data to actionable predictive insights are collectively referred as Decision Support tools. This chapter presents a new formalization of the decision process, leading to a new Decision Superiority model, partially motivated by the Joint Directors of Laboratories (JDL) Data Fusion Model. In addition, it examines the growing importance of Data Fusion concepts.« less
  • Over 500 papers by authors from 37 countries were presented at this triennial congress. Theoretical and practical subjects covering the entire field of automatic control are reviewed in the six volumes. Each volume includes a plenary paper, transcripts of panel discussions, an author index, subject index, and contents of the other volumes in the set.
  • The paper summarizes the events of the Federation's 11th annual meeting held in Denver, Colorado, October 15-16, 1975. It includes summaries of remarks and papers presented and the results of the meeting of the board of directors. Following the summaries of the remarks by Calvin L. Rampton and Jack M. Campbell, Philip M. Burgess, executive director of the Federation reports the findings, conclusions, and recommendations of a report by the Executive Office of the President entitled: ''Strengthening Public Management in the Intergovernmental System''; its implications for the states and the region are summarized. Summaries are given by nine governmental, state,more » or industrial spokesmen on strip mining and utility siting. Governors from Colorado, Montana, Utah, and New Mexico and Congressmen from Colorado and New Mexico provided summary statements. Issues of concern to the region included information on natural resources, agricultural resources, housing and community development, transportation, human resources, market development, telecommunications, arts and humanities, and regional coordinating. (MCW)« less