skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Network support for system initiated checkpoints

Patent ·
OSTI ID:1532128

A system, method and computer program product for supporting system initiated checkpoints in parallel computing systems. The system and method generates selective control signals to perform checkpointing of system related data in presence of messaging activity associated with a user application running at the node. The checkpointing is initiated by the system such that checkpoint data of a plurality of network nodes may be obtained even in the presence of user applications running on highly parallel computers that include ongoing user messaging activity.

Research Organization:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
B554331
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Number(s):
8,856,261
Application Number:
13/729,937
OSTI ID:
1532128
Resource Relation:
Patent File Date: 2012-12-28
Country of Publication:
United States
Language:
English

References (6)

Method of checkpointing parallel processes in execution within plurality of process domains patent October 2008
Method and apparatus for achieving system-directed checkpointing without specialized hardware assistance patent September 2003
Selective preservation of network state during a checkpoint patent-application October 2008
Storage access validation to data messages using partial storage address data indexed entries containing permissible address range validation for message source patent October 1999
Apparatus For Enhancing Performance Of A Parallel Processing Environment, And Associated Methods patent-application July 2010
Methods, media and systems for managing a distributed application running in a plurality of digital processing devices patent-application October 2007