Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Applying graph partitioning methods in measurement-based dynamic load balancing

Technical Report ·
DOI:https://doi.org/10.2172/1114706· OSTI ID:1114706
 [1];  [2];  [3];  [3];  [4]
  1. Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
  2. Univ. of Bordeaux (France). Bordeaux Lab. for Research in Computer Science
  3. Univ. of Illinois, Urbana-Champaign, IL (United States)
  4. Univ. of Bordeaux (France). Bordeaux Lab. for Research in Computer Science
Load imbalance leads to an increasing waste of resources as an application is scaled to more and more processors. Achieving the best parallel efficiency for a program requires optimal load balancing which is a NP-hard problem. However, finding near-optimal solutions to this problem for complex computational science and engineering applications is becoming increasingly important. Charm++, a migratable objects based programming model, provides a measurement-based dynamic load balancing framework. This framework instruments and then migrates over-decomposed objects to balance computational load and communication at runtime. This paper explores the use of graph partitioning algorithms, traditionally used for partitioning physical domains/meshes, for measurement-based dynamic load balancing of parallel applications. In particular, we present repartitioning methods developed in a graph partitioning toolbox called SCOTCH that consider the previous mapping to minimize migration costs. We also discuss a new imbalance reduction algorithm for graphs with irregular load distributions. We compare several load balancing algorithms using microbenchmarks on Intrepid and Ranger and evaluate the effect of communication, number of cores and number of objects on the benefit achieved from load balancing. New algorithms developed in SCOTCH lead to better performance compared to the METIS partitioners for several cases, both in terms of the application execution time and fewer number of objects migrated.
Research Organization:
Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
W-7405-ENG-48
OSTI ID:
1114706
Report Number(s):
LLNL-TR--501974
Country of Publication:
United States
Language:
English

Similar Records

Applying Graph Partitioning Methods in Measurement-Based Dynamic Load Balancing
Technical Report · Sun Feb 26 23:00:00 EST 2012 · OSTI ID:1093410

Parallel algorithms for dynamically partitioning unstructured grids
Conference · Sat Oct 01 00:00:00 EDT 1994 · OSTI ID:10189870

Fast shared-memory streaming multilevel graph partitioning
Journal Article · Fri Sep 11 20:00:00 EDT 2020 · Journal of Parallel and Distributed Computing · OSTI ID:1844373