skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Reusability First: Toward FAIR Workflows

Conference ·
OSTI ID:1827005

The FAIR principles of open science (Findable, Accessible, Interoperable, and Reusable) have had transformative effects on modern large-scale computational science. In particular, they have encouraged more open access to and use of data, an important consideration as collaboration among teams of researchers accelerates and the use of workflows by those teams to solve problems increases. How best to apply the FAIR principles to workflows themselves, and software more generally, is not yet well understood. We argue that the software engineering concept of technical debt management provides a useful guide for application of those principles to workflows, and in particular that it implies reusability should be considered as ‘first among equals’. Moreover, our approach recognizes a continuum of reusability where we can make explicit and selectable the tradeoffs required in workflows for both their users and developers.To this end, we propose a new abstraction approach for reusable workflows, with demonstrations for both synthetic workloads and real-world computational biology workflows. Through application of novel systems and tools that are based on this abstraction, these experimental workflows are refactored to rightsize the granularity of workflow components to efficiently fill the gap between end-user simplicity and general customizability. Our work makes it easier to selectively reason about and automate the connections between trade-offs across user and developer concerns when exposing degrees of freedom for reuse. Additionally, by exposing fine-grained reusability abstractions we enable performance optimizations, as we demonstrate on both institutional-scale and leadership-class HPC resources.

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1827005
Resource Relation:
Conference: 2021 IEEE International Conference on Cluster Computing (CLUSTER) - Portland, Oregon, United States of America - 9/7/2021 4:00:00 AM-9/10/2021 4:00:00 AM
Country of Publication:
United States
Language:
English

Similar Records

Sim2Ls: FAIR simulation workflows and data
Journal Article · Thu Mar 10 00:00:00 EST 2022 · PLoS ONE · OSTI ID:1827005

F*** workflows: when parts of FAIR are missing
Conference · Sat Oct 01 00:00:00 EDT 2022 · OSTI ID:1827005

FAIR principles for AI models with a practical application for accelerated high energy diffraction microscopy
Journal Article · Thu Nov 10 00:00:00 EST 2022 · Scientific Data · OSTI ID:1827005

Related Subjects