DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Non-volatile memory for checkpoint storage

Abstract

A system, method and computer program product for supporting system initiated checkpoints in high performance parallel computing systems and storing of checkpoint data to a non-volatile memory storage device. The system and method generates selective control signals to perform checkpointing of system related data in presence of messaging activity associated with a user application running at the node. The checkpointing is initiated by the system such that checkpoint data of a plurality of network nodes may be obtained even in the presence of user applications running on highly parallel computers that include ongoing user messaging activity. In one embodiment, the non-volatile memory is a pluggable flash memory card.

Inventors:
; ; ; ; ; ; ; ; ;
Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1149606
Patent Number(s):
8788879
Application Number:
13/004,005
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
B554331
Resource Type:
Patent
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Blumrich, Matthias A., Chen, Dong, Cipolla, Thomas M., Coteus, Paul W., Gara, Alan, Heidelberger, Philip, Jeanson, Mark J., Kopcsay, Gerard V., Ohmacht, Martin, and Takken, Todd E. Non-volatile memory for checkpoint storage. United States: N. p., 2014. Web.
Blumrich, Matthias A., Chen, Dong, Cipolla, Thomas M., Coteus, Paul W., Gara, Alan, Heidelberger, Philip, Jeanson, Mark J., Kopcsay, Gerard V., Ohmacht, Martin, & Takken, Todd E. Non-volatile memory for checkpoint storage. United States.
Blumrich, Matthias A., Chen, Dong, Cipolla, Thomas M., Coteus, Paul W., Gara, Alan, Heidelberger, Philip, Jeanson, Mark J., Kopcsay, Gerard V., Ohmacht, Martin, and Takken, Todd E. Tue . "Non-volatile memory for checkpoint storage". United States. https://www.osti.gov/servlets/purl/1149606.
@article{osti_1149606,
title = {Non-volatile memory for checkpoint storage},
author = {Blumrich, Matthias A. and Chen, Dong and Cipolla, Thomas M. and Coteus, Paul W. and Gara, Alan and Heidelberger, Philip and Jeanson, Mark J. and Kopcsay, Gerard V. and Ohmacht, Martin and Takken, Todd E.},
abstractNote = {A system, method and computer program product for supporting system initiated checkpoints in high performance parallel computing systems and storing of checkpoint data to a non-volatile memory storage device. The system and method generates selective control signals to perform checkpointing of system related data in presence of messaging activity associated with a user application running at the node. The checkpointing is initiated by the system such that checkpoint data of a plurality of network nodes may be obtained even in the presence of user applications running on highly parallel computers that include ongoing user messaging activity. In one embodiment, the non-volatile memory is a pluggable flash memory card.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Tue Jul 22 00:00:00 EDT 2014},
month = {Tue Jul 22 00:00:00 EDT 2014}
}

Works referenced in this record:

Method of checkpointing parallel processes in execution within plurality of process domains
patent, October 2008


Checkpointing in massively parallel processing
patent, January 2012


Novel massively parallel supercomputer
patent-application, May 2004


Method of checkpointing parallel processes in execution within plurality of process domains
patent-application, February 2006


Selective preservation of network state during a checkpoint
patent-application, October 2008