Scalable All-pairs Shortest Paths for Huge Graphs on Multi-GPU Clusters

Sao, Piyush; Lu, Hao; Kannan, Ramakrishnan {Ramki}; Thakkar, Vijay; Vuduc, Richard; Potok, Thomas

Scalable All-pairs Shortest Paths for Huge Graphs on Multi-GPU Clusters

Conference · Mon Jun 01 04:00:00 EDT 2020

OSTI ID:1814306

^[1]; Lu, Hao ^[1]; ^[1]; Thakkar, Vijay ^[2]; Vuduc, Richard ^[3]; ^[1]

ORNL
Georgia Institute of Technology
Georgia Institute of Technology, Atlanta

We present an optimized Floyd-Warshall (Floyd-Warshall) algorithm that computes the All-pairs shortest path (APSP) for GPU accelerated clusters. The Floyd-Warshall algorithm due to its structural similarities to matrix-multiplication is well suited for highly parallel GPU architectures. To achieve high parallel efficiency, we address two key algorithmic challenges: reducing high communication overhead and addressing limited GPU memory. To reduce high communication costs, we redesign the parallel (a) to expose more parallelism, (b) aggressively overlap communication and computation with pipelined and asynchronous scheduling of operations, and (c) tailored MPI-collective. To cope with limited GPU memory, we employ an offload model, where the data resides on the host and is transferred to GPU on-demand. The proposed optimizations are supported with detailed performance models for tuning. Our optimized parallel Floyd-Warshall implementation is up to 5x faster than a strong baseline and achieves 8.1 PetaFLOPS/sec on 256~nodes of the Summit supercomputer at Oak Ridge National Laboratory. This performance represents 70% of the theoretical peak and 80% parallel efficiency. The offload algorithm can handle 2.5x larger graphs with a 20% increase in overall running time.

Research Organization:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)

Sponsoring Organization:: USDOE; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)

DOE Contract Number:: AC05-00OR22725

OSTI ID:: 1814306

Country of Publication:: United States

Language:: English

Similar Records

A supernodal all-pairs shortest path algorithm

Conference · Fri Jan 31 23:00:00 EST 2020 · OSTI ID:1648874

Scalable Knowledge Graph Analytics at 136 Petaflop/s

Conference · Sun Nov 01 00:00:00 EDT 2020 · OSTI ID:1798621

Evaluating asynchronous Schwarz solvers on GPUs

Journal Article · Sun Aug 09 20:00:00 EDT 2020 · International Journal of High Performance Computing Applications · OSTI ID:1778413

Scalable All-pairs Shortest Paths for Huge Graphs on Multi-GPU Clusters

Citation Formats

Similar Records

Related Subjects