Global to push GA events into
skip to main content

Title: Methods, apparatus and system for selective duplication of subtasks

A method for selective duplication of subtasks in a high-performance computing system includes: monitoring a health status of one or more nodes in a high-performance computing system, where one or more subtasks of a parallel task execute on the one or more nodes; identifying one or more nodes as having a likelihood of failure which exceeds a first prescribed threshold; selectively duplicating the one or more subtasks that execute on the one or more nodes having a likelihood of failure which exceeds the first prescribed threshold; and notifying a messaging library that one or more subtasks were duplicated.
Inventors:
; ; ; ;
Issue Date:
OSTI Identifier:
1478646
Assignee:
International Business Machines Corporation (Armonk, NY) OSTI
Patent Number(s):
10,073,739
Application Number:
14/957,584
Contract Number:
B599858
Resource Relation:
Patent File Date: 2015 Dec 02
Research Org:
International Business Machines Corporation, Armonk, New York (United States)
Sponsoring Org:
USDOE
Country of Publication:
United States
Language:
English