| | |
Summary: High Performance RDMA-based Multi-port All-gather on Multi-rail QsNetII
Ying Qian Ahmad Afsahi
Department of Electrical and Computer Engineering
Queen's University, Kingston, ON, Canada K7L 3N6
ying.qian@ece.queensu.ca ahmad.afsahi@queensu.ca
Abstract
Scientific applications written in MPI use collective
communications intensively. Efficient and scalable
implementation of such collective operations is
therefore crucial to the performance of MPI
applications running on clusters. Quadrics QsNetII
is
a high-performance network that implements some
collectives at its Elan user-level library. Its MPI
implementation uses such primitives directly.
Quadrics communication software supports point-
to-point message striping over multi-rail QsNetII
networks. However, multi-rail collectives, other than
broadcast, are not supported. In this work, we
propose, design and implement a number of RDMA-
|