Science Capsule: Towards Sharing and Reproducibility of Scientific Workflows
Conference
·
· Workshop on Workflows in Support of Large-Scale Science.
Workflows are increasingly processing large volumes of data from scientific instruments, experiments and sensors. These workflows often consist of complex data processing and analysis steps that might include a diverse ecosystem of tools and also often involve human-in-the-loop steps. Sharing and reproducing these workflows with collaborators and the larger community is critical but hard to do without the entire context of the workflow including user notes and execution environment. In this paper, we describe Science Capsule, which is a framework to capture, share, and reproduce scientific workflows. Science Capsule captures, manages and represents both computational and human elements of a workflow. It automatically captures and processes events associated with the execution and data life cycle of workflows, and lets users add other types and forms of scientific artifacts. Science Capsule also allows users to create `workflow snapshots' that keep track of the different versions of a workflow and their lineage, allowing scientists to incrementally share and extend workflows between users. Our results show that Science Capsule is capable of processing and organizing events in near real-time for high-throughput experimental and data analysis workflows without incurring any significant performance overheads.
- Research Organization:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
- DOE Contract Number:
- AC02-05CH11231
- OSTI ID:
- 1833998
- Conference Information:
- Journal Name: Workshop on Workflows in Support of Large-Scale Science. Journal Volume: 2021
- Country of Publication:
- United States
- Language:
- English
Similar Records
Scientific Process Automation and Workflow Management
Science Capsule: Capturing the Data Life Cycle (Science Capsule) v0.1.0
Book
·
Thu Dec 31 23:00:00 EST 2009
·
OSTI ID:972328
Science Capsule: Capturing the Data Life Cycle (Science Capsule) v0.1.0
Software
·
Fri Jun 05 20:00:00 EDT 2020
·
OSTI ID:code-51417