Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Scalable High Performance Message Passing over InfiniBand for Open MPI

Conference ·
OSTI ID:923113

InfiniBand (IB) is a popular network technology for modern high-performance computing systems. MPI implementations traditionally support IB using a reliable, connection-oriented (RC) transport. However, per-process resource usage that grows linearly with the number of processes, makes this approach prohibitive for large-scale systems. IB provides an alternative in the form of a connectionless unreliable datagram transport (UD), which allows for near-constant resource usage and initialization overhead as the process count increases. This paper describes a UD-based implementation for IB in Open MPI as a scalable alternative to existing RC-based schemes. We use the software reliability capabilities of Open MPI to provide the guaranteed delivery semantics required by MPI. Results show that UD not only requires fewer resources at scale, but also allows for shorter MPI startup times. A connectionless model also improves performance for applications that tend to send small messages to many different processes.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA
Sponsoring Organization:
USDOE
DOE Contract Number:
W-7405-ENG-48
OSTI ID:
923113
Report Number(s):
UCRL-CONF-235949
Country of Publication:
United States
Language:
English

Similar Records

Efficient On-demand Connection Management Mechanisms with PGAS Models on InfiniBand
Conference · Mon May 17 00:00:00 EDT 2010 · OSTI ID:986276

Dynamic Time-Variant Connection Management for PGAS Models on InfiniBand
Conference · Thu Sep 01 00:00:00 EDT 2011 · OSTI ID:1024543

X-SRQ - Improving Scalability and Performance of Multi-Core InfiniBand Clusters
Conference · Mon Dec 31 23:00:00 EST 2007 · OSTI ID:1016037