skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: End-to-end Provenance Traceability and Reproducibility Through %60%60Palletized'' Simulation Data.

Abstract

Trusting simulation output is crucial for Sandia's mission objectives. We rely on these simulations to perform our high-consequence mission tasks given our treaty obligations. Other science and modelling needs, while they may not be high-consequence, still require the strongest levels of trust to enable using the result as the foundation for both practical applications and future research. To this end, the computing community has developed work- flow and provenance systems to aid in both automating simulation and modelling execution, but to also aid in determining exactly how was some output created so that conclusions can be drawn from the data. Current approaches for workflows and provenance systems are all at the user level and have little to no system level support making them fragile, difficult to use, and incomplete so- lutions. The introduction of container technology is a first step towards encapsulating and tracking artifacts used in creating data and resulting insights, but their current implemen- tation is focused solely on making it easy to deploy an application in an isolated "sandbox" and maintaining a strictly read-only mode to avoid any potential changes to the application. All storage activities are still using the system-level shared storage. This project was anmore » initial exploration into extending the container concept to also include storage and to use writeable containers, auto generated by the system, as a way to link the contained data back to the simulation and input deck used to create it.« less

Authors:
; ;
Publication Date:
Research Org.:
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Sponsoring Org.:
USDOE National Nuclear Security Administration (NNSA)
OSTI Identifier:
1531316
Report Number(s):
SAND2018-11117
669642
DOE Contract Number:  
AC04-94AL85000
Resource Type:
Technical Report
Country of Publication:
United States
Language:
English

Citation Formats

Lofstead, Gerald Fredrick, Younge, Andrew J, and Baker, Joshua. End-to-end Provenance Traceability and Reproducibility Through %60%60Palletized'' Simulation Data.. United States: N. p., 2018. Web. doi:10.2172/1531316.
Lofstead, Gerald Fredrick, Younge, Andrew J, & Baker, Joshua. End-to-end Provenance Traceability and Reproducibility Through %60%60Palletized'' Simulation Data.. United States. doi:10.2172/1531316.
Lofstead, Gerald Fredrick, Younge, Andrew J, and Baker, Joshua. Mon . "End-to-end Provenance Traceability and Reproducibility Through %60%60Palletized'' Simulation Data.". United States. doi:10.2172/1531316. https://www.osti.gov/servlets/purl/1531316.
@article{osti_1531316,
title = {End-to-end Provenance Traceability and Reproducibility Through %60%60Palletized'' Simulation Data.},
author = {Lofstead, Gerald Fredrick and Younge, Andrew J and Baker, Joshua},
abstractNote = {Trusting simulation output is crucial for Sandia's mission objectives. We rely on these simulations to perform our high-consequence mission tasks given our treaty obligations. Other science and modelling needs, while they may not be high-consequence, still require the strongest levels of trust to enable using the result as the foundation for both practical applications and future research. To this end, the computing community has developed work- flow and provenance systems to aid in both automating simulation and modelling execution, but to also aid in determining exactly how was some output created so that conclusions can be drawn from the data. Current approaches for workflows and provenance systems are all at the user level and have little to no system level support making them fragile, difficult to use, and incomplete so- lutions. The introduction of container technology is a first step towards encapsulating and tracking artifacts used in creating data and resulting insights, but their current implemen- tation is focused solely on making it easy to deploy an application in an isolated "sandbox" and maintaining a strictly read-only mode to avoid any potential changes to the application. All storage activities are still using the system-level shared storage. This project was an initial exploration into extending the container concept to also include storage and to use writeable containers, auto generated by the system, as a way to link the contained data back to the simulation and input deck used to create it.},
doi = {10.2172/1531316},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2018},
month = {10}
}