Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Delta: Data Reduction for Integrated Application Workflows

Technical Report ·
DOI:https://doi.org/10.2172/1193147· OSTI ID:1193147
 [1];  [2];  [2]
  1. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Florida International Univ. (FIU), Miami, FL (United States)
  2. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Integrated Application Workflows (IAWs) run multiple simulation workflow components concurrently on an HPC resource connecting these components using compute area resources and compensating for any performance or data processing rate mismatches. These IAWs require high frequency and high volume data transfers between compute nodes and staging area nodes during the lifetime of a large parallel computation. The available network band-width between the two areas may not be enough to efficiently support the data movement. As the processing power available to compute resources increases, the requirements for this data transfer will become more difficult to satisfy and perhaps will not be satisfiable at all since network capabilities are not expanding at a comparable rate. Furthermore, energy consumption in HPC environments is expected to grow by an order of magnitude as exascale systems become a reality. The energy cost of moving large amounts of data frequently will contribute to this issue. It is necessary to reduce the volume of data without reducing the quality of data when it is being processed and analyzed. Delta resolves the issue by addressing the lifetime data transfer operations. Delta removes subsequent identical copies of already transmitted data during transfers and restores those copies once the data has reached the destination. Delta is able to identify duplicated information and determine the most space efficient way to represent it. Initial tests show about 50% reduction in data movement while maintaining the same data quality and transmission frequency.
Research Organization:
Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA), Office of Defense Nuclear Security
DOE Contract Number:
AC04-94AL85000
OSTI ID:
1193147
Report Number(s):
SAND--2015-5029; 594733
Country of Publication:
United States
Language:
English

Similar Records

Accelerating Advanced Light Source Science Through Multi-Facility HPC Workflows
Conference · Sat Nov 15 19:00:00 EST 2025 · OSTI ID:3005978

STAR Data Production Workflow on HPC: Lessons Learned & Best Practices
Conference · Sat Mar 09 23:00:00 EST 2019 · OSTI ID:1771139

Autonomic Management of Application Workflows on Hybrid Computing Infrastructure
Journal Article · Fri Dec 31 19:00:00 EST 2010 · Scientific Programming · OSTI ID:1243135

Related Subjects