Resilient data staging through MxN distributed transactions.
- Georgia Institute of Technology, Atlanta, GA
Scientific computing-driven discoveries are frequently driven from workflows that use persistent storage as a staging area for data between operations. With the bad and progressively worse bandwidth vs. data size issues as we continue towards exascale, eliminating persistent storage through techniques like data staging will both enable these workflows to continue online, but also enable more interactive workflows reducing the time to scientific discoveries. Data staging has shown to be an effective way for applications running on high-end computing platforms to offload expensive I/O operations and to manage the tremendous amounts of data they produce. This data staging approach, however, lacks the ACID style guarantees traditional straight-to-disk methods provide. Distributed transactions are a proven way to add ACID properties to data movements, however distributed transactions follow 1xN data movement semantics, where our highly parallel HPC environments employ MxN data movement semantics. In this paper we present a novel protocol that extends distributed transaction terminology to include MxN semantics which allows our data staging areas to benefit from ACID properties. We show that with our protocol we can provide resilient data staging with a limited performance penalty over current data staging implementations.
- Research Organization:
- Sandia National Laboratories
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC04-94AL85000
- OSTI ID:
- 1031296
- Report Number(s):
- SAND2011-8739
- Country of Publication:
- United States
- Language:
- English
Similar Records
Integration of scanning probe microscope with high-performance computing: Fixed-policy and reward-driven workflows implementation
Designing the Cloud-based DOE Systems Biology Knowledgebase
Journal Article
·
Sun Sep 15 20:00:00 EDT 2024
· Review of Scientific Instruments
·
OSTI ID:2571043
Designing the Cloud-based DOE Systems Biology Knowledgebase
Conference
·
Thu Sep 01 00:00:00 EDT 2011
·
OSTI ID:1028554