skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Performance characterization of scientific workflows for the optimal use of Burst Buffers

Journal Article · · Future Generations Computer Systems

Scientific discoveries are increasingly dependent upon the analysis of large volumes of data from observations and simulations of complex phenomena. Scientists compose the complex analyses as workflows and execute them on large-scale HPC systems. The workflow structures are in contrast with monolithic single simulations that have often been the primary use case on HPC systems. Simultaneously, new storage paradigms such as Burst Buffers are becoming available on HPC platforms. In this paper, we analyze the performance characteristics of a Burst Buffer and two representative scientific workflows with the aim of optimizing the usage of a Burst Buffer, extending our previous analyses (Daley et al., 2016). Our key contributions are (a) developing a performance analysis methodology pertinent to Burst Buffers, (b) improving the use of a Burst Buffer in workflows with bandwidth-sensitive and metadata-sensitive I/O workloads, (c) highlighting the key data management challenges when incorporating a Burst Buffer in the studied scientific workflows.

Research Organization:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). National Energy Research Scientific Computing Center (NERSC)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1544343
Journal Information:
Future Generations Computer Systems, Journal Name: Future Generations Computer Systems; ISSN 0167-739X
Publisher:
Elsevier
Country of Publication:
United States
Language:
English

References (7)

Group-based variant calling leveraging next-generation supercomputing for large-scale whole-genome sequencing studies journal September 2015
24/7 Characterization of petascale I/O workloads conference August 2009
MODIS land data storage, gridding, and compositing methodology: Level 2 grid journal July 1998
CAMP: Community Access MODIS Pipeline journal July 2014
GPAW - massively parallel electronic structure calculations with Python-based software journal January 2011
Montage: a grid-enabled engine for delivering custom science-grade mosaics on demand conference September 2004
On Timely Staging of HPC Job Input Data journal September 2013

Cited By (1)

A slice-based decentralized NFV framework for an end-to-end QoS-based dynamic resource allocation journal January 2020

Similar Records

Performance characterization of scientific workflows for the optimal use of Burst Buffers
Journal Article · Thu Dec 28 00:00:00 EST 2017 · Future Generations Computer Systems · OSTI ID:1544343

Final Report for File System Support for Burst Buffers on HPC Systems
Technical Report · Mon Nov 27 00:00:00 EST 2017 · OSTI ID:1544343

Measuring the impact of burst buffers on data-intensive scientific workflows
Journal Article · Mon Jun 17 00:00:00 EDT 2019 · Future Generations Computer Systems · OSTI ID:1544343

Related Subjects