Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Extreme-scale workflows: A perspective from the JLESC international community

Journal Article · · Future Generations Computer Systems
 [1];  [1];  [2];  [3];  [4];  [1]
  1. Argonne National Laboratory (ANL), Argonne, IL (United States)
  2. Univ. Paris-Saclay, Gif-sur-Yvette (France); Centre National de la Recherche Scientifique (CNRS) (France)
  3. Univ. of Grenoble Alpes, Grenoble (France); Centre National de la Recherche Scientifique (CNRS) (France)
  4. Barcelona Supercomputing Center (Spain)
The Joint Laboratory for Extreme-Scale Computing (JLESC) focuses on software challenges in high-performance computing systems to meet the needs of today’s science campaigns, which often require large resources, consist of multiple tasks, and generate vast amounts of data. In this context, extreme-scale workflows have been the key factor in enabling scientific discoveries by helping scientists automate the dependencies and data exchanges between workflow tasks, instead of managing those manually. Here, in this paper, we present representative extreme-scale workflows and feature workflow systems developed by JLESC participating institutions. We present lessons learned while developing these tools, alongside with the open challenges and future research directions in the field of extreme-scale workflows.
Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
2440426
Alternate ID(s):
OSTI ID: 2427007
Journal Information:
Future Generations Computer Systems, Journal Name: Future Generations Computer Systems Vol. 161; ISSN 0167-739X
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (19)

DataSpaces: an interaction and coordination framework for coupled simulation workflows journal February 2011
Workflows and e-Science: An overview of workflow system features and capabilities journal May 2009
Efficient development of high performance data analytics in Python journal October 2020
Enabling dynamic and intelligent workflows for HPC, data analytics, and AI convergence journal September 2022
Dataclay: A distributed data store for effective inter-player data sharing journal September 2017
Nextflow enables reproducible computational workflows journal April 2017
Nyx: A MASSIVELY PARALLEL AMR CODE FOR COMPUTATIONAL COSMOLOGY journal February 2013
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2020 update journal June 2020
LowFive: In Situ Data Transport for High-Performance Workflows conference May 2023
Heterogeneous Hierarchical Workflow Composition journal July 2019
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)
  • Lofstead, Jay F.; Klasky, Scott; Schwan, Karsten
  • Proceedings of the 6th international workshop on Challenges of large applications in distributed environments - CLADE '08 https://doi.org/10.1145/1383529.1383533
conference January 2008
An overview of the HDF5 technology suite and its applications conference January 2011
Damaris: Addressing Performance Variability in Data Management for Post-Petascale Simulations journal October 2016
The challenges of elastic in situ analysis and visualization
  • Dorier, Matthieu; Yildiz, Orcun; Peterka, Tom
  • Proceedings of the Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization https://doi.org/10.1145/3364228.3364234
conference November 2019
funcX: A Federated Function Serving Fabric for Science
  • Chard, Ryan; Babuji, Yadu; Li, Zhuozhao
  • HPDC '20: The 29th International Symposium on High-Performance Parallel and Distributed Computing, Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing https://doi.org/10.1145/3369583.3392683
conference June 2020
PyCOMPSs: Parallel computational workflows in Python journal July 2016
In situ and in-transit analysis of cosmological simulations journal August 2016
Melissa: coordinating large-scale ensemble runs for deep learning and sensitivity analyses journal June 2023
Dask: Parallel Computation with Blocked algorithms and Task Scheduling conference January 2015

Similar Records

A characterization of workflow management systems for extreme-scale applications
Journal Article · Wed Feb 15 19:00:00 EST 2017 · Future Generations Computer Systems · OSTI ID:1408072

PANORAMA: An approach to performance modeling and diagnosis of extreme-scale workflows
Journal Article · Mon Jul 13 20:00:00 EDT 2015 · International Journal of High Performance Computing Applications · OSTI ID:1265426

The future of scientific workflows
Journal Article · Tue Apr 25 20:00:00 EDT 2017 · International Journal of High Performance Computing Applications · OSTI ID:1422587