skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Cooperative fault-tolerant distributed computing U.S. Department of Energy Grant DE-FG02-02ER25537 Final Report

Technical Report ·
DOI:https://doi.org/10.2172/916972· OSTI ID:916972

The Harness project has developed novel software frameworks for the execution of high-end simulations in a fault-tolerant manner on distributed resources. The H2O subsystem comprises the kernel of the Harness framework, and controls the key functions of resource management across multiple administrative domains, especially issues of access and allocation. It is based on a “pluggable” architecture that enables the aggregated use of distributed heterogeneous resources for high performance computing. The major contributions of the Harness II project result in significantly enhancing the overall computational productivity of high-end scientific applications by enabling robust, failure-resilient computations on cooperatively pooled resource collections.

Research Organization:
Emory University, Atlanta, GA
Sponsoring Organization:
USDOE Office of Science (SC)
DOE Contract Number:
FG02-02ER25537
OSTI ID:
916972
Report Number(s):
DOE/ER/25537-1; TRN: US201006%%621
Country of Publication:
United States
Language:
English