Applying graph partitioning methods in measurement-based dynamic load balancing
- Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
- Univ. of Bordeaux (France). Bordeaux Lab. for Research in Computer Science
- Univ. of Illinois, Urbana-Champaign, IL (United States)
- Univ. of Bordeaux (France). Bordeaux Lab. for Research in Computer Science
Load imbalance leads to an increasing waste of resources as an application is scaled to more and more processors. Achieving the best parallel efficiency for a program requires optimal load balancing which is a NP-hard problem. However, finding near-optimal solutions to this problem for complex computational science and engineering applications is becoming increasingly important. Charm++, a migratable objects based programming model, provides a measurement-based dynamic load balancing framework. This framework instruments and then migrates over-decomposed objects to balance computational load and communication at runtime. This paper explores the use of graph partitioning algorithms, traditionally used for partitioning physical domains/meshes, for measurement-based dynamic load balancing of parallel applications. In particular, we present repartitioning methods developed in a graph partitioning toolbox called SCOTCH that consider the previous mapping to minimize migration costs. We also discuss a new imbalance reduction algorithm for graphs with irregular load distributions. We compare several load balancing algorithms using microbenchmarks on Intrepid and Ranger and evaluate the effect of communication, number of cores and number of objects on the benefit achieved from load balancing. New algorithms developed in SCOTCH lead to better performance compared to the METIS partitioners for several cases, both in terms of the application execution time and fewer number of objects migrated.
- Research Organization:
- Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- W-7405-ENG-48
- OSTI ID:
- 1114706
- Report Number(s):
- LLNL-TR--501974
- Country of Publication:
- United States
- Language:
- English
Similar Records
Applying Graph Partitioning Methods in Measurement-Based Dynamic Load Balancing
Parallel algorithms for dynamically partitioning unstructured grids
Fast shared-memory streaming multilevel graph partitioning
Technical Report
·
Sun Feb 26 23:00:00 EST 2012
·
OSTI ID:1093410
Parallel algorithms for dynamically partitioning unstructured grids
Conference
·
Sat Oct 01 00:00:00 EDT 1994
·
OSTI ID:10189870
Fast shared-memory streaming multilevel graph partitioning
Journal Article
·
Fri Sep 11 20:00:00 EDT 2020
· Journal of Parallel and Distributed Computing
·
OSTI ID:1844373