Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Locality-Aware Parallel Process Mapping for Multi-Core HPC Systems

Conference ·
OSTI ID:1024716
High Performance Computing (HPC) systems are composed of servers containing an ever-increasing number of cores. With such high processor core counts, non-uniform memory access (NUMA) architectures are almost universally used to reduce inter-processor and memory communication bottlenecks by distributing processors and memory throughout a server-internal networking topology. Application studies have shown that the tuning of processes placement in a server s NUMA networking topology to the application can have a dramatic impact on performance. The performance implications are magnified when running a parallel job across multiple server nodes, especially with large scale HPC applications. This paper presents the Locality-Aware Mapping Algorithm (LAMA) for distributing the individual processes of a parallel application across processing resources in an HPC system, paying particular attention to the internal server NUMA topologies. The algorithm is able to support both homogeneous and heterogeneous hardware systems, and dynamically adapts to the available hardware and user-specified process layout at run-time. As implemented in Open MPI, the LAMA provides 362,880 mapping permutations and is able to naturally scale out to additional hardware resources as they become available in future architectures.
Research Organization:
Oak Ridge National Laboratory (ORNL); Center for Computational Sciences
Sponsoring Organization:
SC USDOE - Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1024716
Country of Publication:
United States
Language:
English

Similar Records

Critical Path-Based Thread Placement for NUMA Systems
Journal Article · Sat Dec 31 23:00:00 EST 2011 · Performance Evaluation Review · OSTI ID:1048161

Exploring Architectural-Aware Affinity Policies in Modern HPC Runtimes
Conference · Wed Jul 17 00:00:00 EDT 2024 · OSTI ID:2481707

Affinity-aware checkpoint restart
Journal Article · Sun Dec 07 19:00:00 EST 2014 · ACM Digital Library · OSTI ID:1342535