skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Checkpoint triggering in a computer system

Patent ·
OSTI ID:1320886

According to an aspect, a method for triggering creation of a checkpoint in a computer system includes executing a task in a processing node of the computer system and determining whether it is time to read a monitor associated with a metric of the task. The monitor is read to determine a value of the metric based on determining that it is time to read the monitor. A threshold for triggering creation of the checkpoint is determined based on the value of the metric. Based on determining that the value of the metric has crossed the threshold, the checkpoint including state data of the task is created to enable restarting execution of the task upon a restart operation.

Research Organization:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
B599858
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION (Armonk, NY)
Patent Number(s):
9,436,552
Application Number:
14/302,947
OSTI ID:
1320886
Resource Relation:
Patent File Date: 2014 Jun 12
Country of Publication:
United States
Language:
English

References (11)

System and method for providing checkpointing with precompile directives and supporting software to produce checkpoints, independent of environment constraints patent December 2000
Computer system, management computer, storage system, and backup management method patent August 2008
Template based parallel checkpointing in a massively parallel computer system patent December 2009
Risk indices for enhanced throughput in computing systems patent July 2011
Fault tolerant computing systems using checkpoints patent August 2014
Optimum checkpoint frequency patent November 2014
Cruz: Application-Transparent Distributed Checkpoint-Restart on Standard Operating Systems conference January 2005
DyMeLoR: Dynamic Memory Logger and Restorer Library for Optimistic Simulation Objects with Generic Memory Layout
  • Toccaceli, Roberto; Quaglia, Francesco
  • 2008 ACM/IEEE/SCS Workshop on Principles of Advanced and Distributed Simulation ( PADS), 2008 22nd Workshop on Principles of Advanced and Distributed Simulation https://doi.org/10.1109/PADS.2008.23
conference June 2008
ickp: a consistent checkpointer for multicomputers journal July 1994
Optimizing Checkpoint Sizes in the C3 System conference January 2005
The performance of consistent checkpointing
  • Elnozahy, E. N.; Johnson, D. B.; Zwaenepoel, W.
  • [1992] 11th Symposium on Reliable Distributed Systems, [1992] Proceedings 11th Symposium on Reliable Distributed Systems https://doi.org/10.1109/RELDIS.1992.235144
conference January 1992

Similar Records

Checkpoint triggering in a computer system
Patent · Tue Oct 02 00:00:00 EDT 2018 · OSTI ID:1320886

Checkpoint triggering in a computer system
Patent · Tue Mar 10 00:00:00 EDT 2020 · OSTI ID:1320886

Checkpointing for a hybrid computing node
Patent · Tue Mar 08 00:00:00 EST 2016 · OSTI ID:1320886

Related Subjects