skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Scalable Replay with Partial-Order Dependencies for Message-Logging Fault Tolerance

Abstract

Deterministic replay of a parallel application is commonly used for discovering bugs or to recover from a hard fault with message-logging fault tolerance. For message passing programs, a major source of overhead during forward execution is recording the order in which messages are sent and received. During replay, this ordering must be used to deterministically reproduce the execution. Previous work in replay algorithms often makes minimal assumptions about the programming model and application in order to maintain generality. However, in many cases, only a partial order must be recorded due to determinism intrinsic in the code, ordering constraints imposed by the execution model, and events that are commutative (their relative execution order during replay does not need to be reproduced exactly). In this paper, we present a novel algebraic framework for reasoning about the minimum dependencies required to represent the partial order for different concurrent orderings and interleavings. By exploiting this theory, we improve on an existing scalable message-logging fault tolerance scheme. The improved scheme scales to 131,072 cores on an IBM BlueGene/P with up to 2x lower overhead than one that records a total order.

Authors:
; ; ; ; ;
Publication Date:
Research Org.:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1178512
Report Number(s):
PNNL-SA-103978
KJ0402000
DOE Contract Number:  
AC05-76RL01830
Resource Type:
Conference
Resource Relation:
Conference: IEEE International Conference on Cluster Computing (CLUSTER 2014), September 22-26, 2014, Madrid, Spain, 19-28
Country of Publication:
United States
Language:
English
Subject:
replay; partial-order dependencies; fault tolerance; message logging

Citation Formats

Lifflander, Jonathan, Meneses, Esteban, Menon, Harshita, Miller, Phil, Krishnamoorthy, Sriram, and Kale, Laxmikant. Scalable Replay with Partial-Order Dependencies for Message-Logging Fault Tolerance. United States: N. p., 2014. Web. doi:10.1109/CLUSTER.2014.6968739.
Lifflander, Jonathan, Meneses, Esteban, Menon, Harshita, Miller, Phil, Krishnamoorthy, Sriram, & Kale, Laxmikant. Scalable Replay with Partial-Order Dependencies for Message-Logging Fault Tolerance. United States. doi:10.1109/CLUSTER.2014.6968739.
Lifflander, Jonathan, Meneses, Esteban, Menon, Harshita, Miller, Phil, Krishnamoorthy, Sriram, and Kale, Laxmikant. Mon . "Scalable Replay with Partial-Order Dependencies for Message-Logging Fault Tolerance". United States. doi:10.1109/CLUSTER.2014.6968739.
@article{osti_1178512,
title = {Scalable Replay with Partial-Order Dependencies for Message-Logging Fault Tolerance},
author = {Lifflander, Jonathan and Meneses, Esteban and Menon, Harshita and Miller, Phil and Krishnamoorthy, Sriram and Kale, Laxmikant},
abstractNote = {Deterministic replay of a parallel application is commonly used for discovering bugs or to recover from a hard fault with message-logging fault tolerance. For message passing programs, a major source of overhead during forward execution is recording the order in which messages are sent and received. During replay, this ordering must be used to deterministically reproduce the execution. Previous work in replay algorithms often makes minimal assumptions about the programming model and application in order to maintain generality. However, in many cases, only a partial order must be recorded due to determinism intrinsic in the code, ordering constraints imposed by the execution model, and events that are commutative (their relative execution order during replay does not need to be reproduced exactly). In this paper, we present a novel algebraic framework for reasoning about the minimum dependencies required to represent the partial order for different concurrent orderings and interleavings. By exploiting this theory, we improve on an existing scalable message-logging fault tolerance scheme. The improved scheme scales to 131,072 cores on an IBM BlueGene/P with up to 2x lower overhead than one that records a total order.},
doi = {10.1109/CLUSTER.2014.6968739},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2014},
month = {9}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: