Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Exploring Architectural-Aware Affinity Policies in Modern HPC Runtimes

Conference ·
Modern commodity and High-Performance Computing (HPC) systems are evolving with complex CPU architectures. These architectures now feature higher core and NUMA domain counts and implement features such as hyperthreading. When considering significant differences in hardware configurations, library availability, and hardware-tailored system/software stacks, which could substantially vary from one system to another, performance portability is hard to achieve. Throughout the years, this trend resulted in an increasingly high burden on application developers to fine-tune their workloads for each architecture. This work explores how hardware-dependent aspects such as locality/process/thread affinity affect performance in modern CPU architectures. We focus our study on the Global Memory and Threading (GMT) distributed runtime system as a representative of Partitioned Global Address Space (PGAS) software stacks commonly adopted for productivity. In particular, to appreciate performance implications, we evaluate GMT’s thread affinity policies, and, introduce two new ones which exploit architectural awareness. Finally, we explore alternative NUMA configurations via different process bindings and perform a scalability study on three HPC clusters with varying CPU architectures and NUMA layouts. Our analysis indicates that more complex architectures are more affected by affinity and binding policies and highlights the importance of setting proper runtime configurations to achieve superior performance.
Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
2481707
Report Number(s):
PNNL-SA-197771
Country of Publication:
United States
Language:
English

Similar Records

A Runtime Environment for Supporting Research in Resilient HPC System Software & Tools
Conference · Mon Dec 31 23:00:00 EST 2012 · OSTI ID:1118750

A Runtime Environment for Supporting Research in Resilient HPC System Software & Tools
Conference · Wed Jan 29 23:00:00 EST 2014 · 2013 FIRST INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR) · OSTI ID:1567332

Locality-Aware Parallel Process Mapping for Multi-Core HPC Systems
Conference · Fri Dec 31 23:00:00 EST 2010 · OSTI ID:1024716

Related Subjects