Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Functional Partitioning to Optimize End-to-End Performance on Many-core Architectures

Conference ·
OSTI ID:993785

Scaling computations on emerging massive-core supercomputers is a daunting task, which coupled with the significantly lagging system I/O capabilities exacerbates applications end-to-end performance. The I/O bottleneck often negates potential performance benefits of assigning additional compute cores to an application. In this paper, we address this issue via a novel functional partitioning (FP) runtime environment that allocates cores to specific application tasks - checkpointing, de-duplication, and scientific data format transformation - so that the deluge of cores can be brought to bear on the entire gamut of application activities. The focus is on utilizing the extra cores to support HPC application I/O activities and also leverage solid-state disks in this context. For example, our evaluation shows that dedicating 1 core on an oct-core machine for checkpointing and its assist tasks using FP can improve overall execution time of a FLASH benchmark on 80 and 160 cores by 43.95% and 41.34%, respectively.

Research Organization:
Oak Ridge National Laboratory (ORNL)
Sponsoring Organization:
ORNL LDRD Director's R&D
DOE Contract Number:
AC05-00OR22725
OSTI ID:
993785
Country of Publication:
United States
Language:
English

Similar Records

Scalable Incremental Checkpointing using GPU-Accelerated De-Duplication
Conference · Sat Dec 31 23:00:00 EST 2022 · OSTI ID:2229849

Scalable Incremental Checkpointing using GPU-Accelerated De-Duplication
Conference · Tue Aug 01 00:00:00 EDT 2023 · OSTI ID:2000364

PLEXUS: A Pattern-Oriented Runtime System Architecture for Resilient Extreme-Scale High-Performance Computing Systems
Conference · Mon Nov 30 23:00:00 EST 2020 · OSTI ID:1766396