FireWorks: a Dynamic Workflow System Designed for High-Throughput Applications
Journal Article
·
· Concurrency and Computation. Practice and Experience
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Energy and Technologies Division
- Univ. of California, San Diego, CA (United States). Dept. of Nanoengineering
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division
- Catholic Univ. of Louvain, Louvain-la-Neuve (Belgium). Inst. of Condensed Matter and Nanosciences (IMCN) and European Theoretical Spectroscopy Facility (ETSF)
This work introduces FireWorks, a workflow software for running high-throughput calculation workflows at supercomputing centers. FireWorks has been used to complete over 50 million CPU-hours worth of computational chemistry and materials science calculations at the National Energy Research Supercomputing Center. It has been designed to serve the demanding high-throughput computing needs of these applications, with extensive support for (i) concurrent execution through job packing, (ii) failure detection and correction, (iii) provenance and reporting for long-running projects, (iv) automated duplicate detection, and (v) dynamic workflows (i.e., modifying the workflow graph during runtime). We have found that these features are highly relevant to enabling modern data-driven and high-throughput science applications, and we discuss our implementation strategy that rests on Python and NoSQL databases (MongoDB). Finally, we present performance data and limitations of our approach along with planned future work.
- Research Organization:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- European Union (EU); USDOE Office of Energy Efficiency and Renewable Energy (EERE); USDOE Office of Science (SC), Basic Energy Sciences (BES) (SC-22). Joint Center for Energy Storage Research (JCESR); USDOE Office of Science (SC), Biological and Environmental Research (BER) (SC-23)
- Grant/Contract Number:
- AC02-05CH11231
- OSTI ID:
- 1474899
- Alternate ID(s):
- OSTI ID: 1401389
- Journal Information:
- Concurrency and Computation. Practice and Experience, Journal Name: Concurrency and Computation. Practice and Experience Journal Issue: 17 Vol. 27; ISSN 1532-0626
- Publisher:
- WileyCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Convergence and pitfalls of density functional perturbation theory phonons calculations from a high-throughput perspective
|
journal | March 2018 |
Semantic Interoperability and Characterization of Data Provenance in Computational Molecular Engineering
|
journal | December 2019 |
| BioExcel-2 Deliverable 2.1 – State of the Art and Initial Roadmap | text | January 2019 |
Similar Records
Measuring the impact of burst buffers on data-intensive scientific workflows
A Job Sizing Strategy for High-Throughput Scientific Workflows
DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers
Journal Article
·
Sun Jun 16 20:00:00 EDT 2019
· Future Generations Computer Systems
·
OSTI ID:1603369
A Job Sizing Strategy for High-Throughput Scientific Workflows
Journal Article
·
Wed Jan 31 23:00:00 EST 2018
· IEEE Transactions on Parallel and Distributed Systems
·
OSTI ID:1472078
DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers
Conference
·
Sun Aug 01 00:00:00 EDT 2021
·
OSTI ID:1827006