Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Adaptive multi-level checkpointing

Patent ·
OSTI ID:1735238
In some examples, with respect to adaptive multi-level checkpointing, a transfer parameter associated with transfer of checkpoint data from a node-local storage to a parallel file system may be ascertained for the checkpoint data stored in the node-local storage. The transfer parameter may be compared to a specified transfer parameter threshold. A determination may be made, based on the comparison of the transfer parameter to the specified transfer parameter threshold, as to whether to transfer the checkpoint data from the node-local storage to the parallel file system.
Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC52-07NA27344
Assignee:
Hewlett-Packard Development Company, L.P. (Houston, TX)
Patent Number(s):
10,769,017
Application Number:
15/960,302
OSTI ID:
1735238
Country of Publication:
United States
Language:
English

References (4)

Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System
  • Moody, Adam; Bronevetsky, Greg; Mohror, Kathryn
  • 2010 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2010.18
conference November 2010
Optimizing checkpoint data placement with guaranteed burst buffer endurance in large-scale hierarchical storage systems journal February 2017
A 1 PB/s file system to checkpoint three million MPI tasks
  • Rajachandrasekar, Raghunath; Moody, Adam; Mohror, Kathryn
  • Proceedings of the 22nd international symposium on High-performance parallel and distributed computing - HPDC '13 https://doi.org/10.1145/2493123.2462908
conference January 2013
Optimization of a Multilevel Checkpoint Model with Uncertain Execution Scales
  • Di, Sheng; Bautista-Gome, Leonardo; Cappello, Franck
  • SC14: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2014.79
conference November 2014

Similar Records

Accelerating shared file checkpoint with local burst buffers
Patent · Tue Apr 12 00:00:00 EDT 2022 · OSTI ID:1892825

Cloud object store for checkpoints of high performance computing applications using decoupling middleware
Patent · Tue Apr 19 00:00:00 EDT 2016 · OSTI ID:1247993

Template based parallel checkpointing in a massively parallel computer system
Patent · Mon Jan 12 23:00:00 EST 2009 · OSTI ID:985865

Related Subjects