| | |
Summary: Scalable Collective Communication on the ASCI Q Machine
Fabrizio Petrini Juan Fernandez Eitan Frachtenberg
Salvador Coll
CCS-3 Modeling, Algorithms, and Informatics Group
Computer and Computational Sciences (CCS) Division
Los Alamos National Laboratory, Los Alamos, NM 87545 USA
¡
fabrizio,juanf,eitanf,scoll¢ @lanl.gov
Abstract
Scientific codes spend a considerable part of their run
time executing collective communication operations. Such
operations can also be critical for efficient resource man-
agement in large-scale machines. Therefore, scalable col-
lective communication is a key factor to achieve good per-
formance in large-scale parallel computers.
In this paper we describe the performance and scala-
bility of some common collective communication patterns
on the ASCI Q machine. Experimental results conducted
on a 1024-node/4096-processor segment show that the net-
|