Scalable NIC-based reduction on large-scale clusters

Moody, A; Fernández, J C; Petrini, F; Panda, Dhabaleswar K

doi:10.1145/1048935.1050209

Title: Scalable NIC-based reduction on large-scale clusters

Conference · Wed Jan 01 00:00:00 EST 2003

DOI:https://doi.org/10.1145/1048935.1050209· OSTI ID:976664

Moody, A ^[1]; Fernández, J C ^[2]; Petrini, F ^[3]; Panda, Dhabaleswar K

Adam
Juan C.
Fabrizio

Many parallel algorithms require effiaent support for reduction mllectives. Over the years, researchers have developed optimal reduction algonduns by taking inm account system size, dam size, and complexities of reduction operations. However, all of these algorithm have assumed the faa that the reduction precessing takes place on the host CPU. Modem Network Interface Cards (NICs) sport programmable processors with substantial memory and thus introduce a fresh variable into the equation This raises the following intersting challenge: Can we take advantage of modern NICs to implementJost redudion operations? In this paper, we take on this challenge in the context of large-scale clusters. Through experiments on the 960-node, 1920-processor or ASCI Linux Cluster (ALC) located at the Lawrence Livermore National Laboratory, we show that NIC-based reductions indeed perform with reduced latency and immed consistency over host-based aleorithms for the wmmon case and that these benefits scale as the system grows. In the largest configuration tested--1812 processors-- our NIC-based algorithm can sum a single element vector in 73 ps with 32-bi integers and in 118 with Mbit floating-point numnbers. These results represent an improvement, respeaively, of 121% and 39% with resvect w the {approx}roductionle vel MPI library

View Conference

Cite

Export

Save

Research Organization:: Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)

Sponsoring Organization:: USDOE

OSTI ID:: 976664

Report Number(s):: LA-UR-03-3208; TRN: US201017%%808

Resource Relation:: Conference: Submitted to: Supercomputing 2003, Phoenix, AZ, November 2003

Country of Publication:: United States

Language:: English

Similar Records

NIC-based Reduction Algorithms for Large-scale Clusters

Journal Article · Fri Jul 30 00:00:00 EDT 2004 · International Journal of High Performance Computing and Networking, vol. 4, no. 3-2, June 1, 2006, pp. 122-136 · OSTI ID:976664

Petrini, F; Moody, A T; Fernandez, J; +2 more

Bringing large-scale multiple genome analysis one step closer: ScalaBLAST and beyond

Technical Report · Fri Jun 01 00:00:00 EDT 2007 · OSTI ID:976664

Oehmen, Christopher S; Sofia, Heidi J; Baxter, Douglas; +5 more

Software-Driven Network Architecture for Synchronous Data Acquisition

Technical Report · Fri Jul 10 00:00:00 EDT 2020 · OSTI ID:976664

McMillian, Gary; McMillian, Brett; DeWitt, Michael

Related Subjects

99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE
ALGORITHMS
CONFIGURATION
LAWRENCE LIVERMORE NATIONAL LABORATORY
VECTORS
SUPERCOMPUTERS
PARALLEL PROCESSING

Title: Scalable NIC-based reduction on large-scale clusters

Citation Formats

Similar Records

Related Subjects