Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Invited Paper: Benchmarking and Optimizing Data Movement on Emerging Heterogeneous Architectures

Conference · · 2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
 [1]
  1. Univ. of New Mexico, Albuquerque, NM (United States)

As supercomputers evolve, nodes are continually increasing in complexity. As a result, each generation of parallel systems brings new performance challenges. For instance, on recent systems inter-node communication has outperformed inter-socket, resulting in poor performance of many node-aware communication optimizations. Communication optimizations are critical for the performance and scalability of parallel applications, but are dependent on the parallel architecture, which varies significantly among recent generations of supercomputers. Furthermore, this paper investigates the performance of various paths of data movement on recent generations of systems, and analyzes the increased complexity of communication, particularly on recent heterogeneous systems. The paper also introduces MPI Advance, a communication library that enables optimizations to be created based on benchmark analysis of each emerging system.

Research Organization:
Univ. of New Mexico, Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
DOE Contract Number:
NA0003966
OSTI ID:
2205621
Journal Information:
2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Conference: 2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), St. Petersburg, FL (United States), 15-19 May 2023
Country of Publication:
United States
Language:
English

References (18)

Node-Aware Improvements to Allreduce conference November 2019
MPI Collectives and Datatypes for Hierarchical All-to-all Communication conference September 2014
Designing a Highly-Scalable Operating System: The Blue Gene/L Story conference November 2006
A Locality-Aware Bruck Allgather conference September 2022
Hierarchical Collectives in MPICH2 book January 2009
Optimization of Collective Reduction Operations book January 2004
Improving Performance Models for Irregular Point-to-Point Communication
  • Bienz, Amanda; Gropp, William D.; Olson, Luke N.
  • EuroMPI'18: 25th European MPI Users' Group Meeting, Proceedings of the 25th European MPI Users' Group Meeting https://doi.org/10.1145/3236367.3236368
conference September 2018
Exascale science: the next frontier in high performance computing conference January 2010
The IBM Blue Gene/Q Compute Chip journal March 2012
Kernel-assisted and topology-aware MPI collective communications on multicore/many-core platforms journal July 2013
Designing topology-aware collective communication algorithms for large scale InfiniBand clusters: Case studies with Scatter and Gather conference April 2010
Decomposing MPI Collectives for Exploiting Multi-lane Communication conference September 2020
Stepping up to Summit journal March 2018
Efficient algorithms for all-to-all communications in multiport message-passing systems journal January 1997
The Exascale Era is Upon Us: The Frontier supercomputer may be the first to reach 1,000,000,000,000,000,000 operations per second journal January 2022
Optimization of Collective Communication Operations in MPICH journal February 2005
MPI collective communications on the blue gene/p supercomputer: algorithms and optimizations conference January 2009
Modeling MPI Communication Performance on SMP Nodes: Is it Time to Retire the Ping Pong Test conference January 2016