A matrix-algebraic formulation of distributed-memory maximal cardinality matching algorithms in bipartite graphs
Abstract
We describe parallel algorithms for computing maximal cardinality matching in a bipartite graph on distributed-memory systems. Unlike traditional algorithms that match one vertex at a time, our algorithms process many unmatched vertices simultaneously using a matrix-algebraic formulation of maximal matching. This generic matrix-algebraic framework is used to develop three efficient maximal matching algorithms with minimal changes. The newly developed algorithms have two benefits over existing graph-based algorithms. First, unlike existing parallel algorithms, cardinality of matching obtained by the new algorithms stays constant with increasing processor counts, which is important for predictable and reproducible performance. Second, relying on bulk-synchronous matrix operations, these algorithms expose a higher degree of parallelism on distributed-memory platforms than existing graph-based algorithms. We report high-performance implementations of three maximal matching algorithms using hybrid OpenMP-MPI and evaluate the performance of these algorithm using more than 35 real and randomly generated graphs. On real instances, our algorithms achieve up to 200 × speedup on 2048 cores of a Cray XC30 supercomputer. Even higher speedups are obtained on larger synthetically generated graphs where our algorithms show good scaling on up to 16,384 cores.
- Authors:
-
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division
- Publication Date:
- Research Org.:
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- OSTI Identifier:
- 1377506
- Alternate Identifier(s):
- OSTI ID: 1397980
- Grant/Contract Number:
- AC02-05CH11231
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Parallel Computing
- Additional Journal Information:
- Journal Volume: 58; Journal Issue: C; Journal ID: ISSN 0167-8191
- Publisher:
- Elsevier
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING; cardinality matching; bipartite graph; parallel algorithm; matrix-algebra
Citation Formats
Azad, Ariful, and Buluç, Aydın. A matrix-algebraic formulation of distributed-memory maximal cardinality matching algorithms in bipartite graphs. United States: N. p., 2016.
Web. doi:10.1016/j.parco.2016.05.007.
Azad, Ariful, & Buluç, Aydın. A matrix-algebraic formulation of distributed-memory maximal cardinality matching algorithms in bipartite graphs. United States. https://doi.org/10.1016/j.parco.2016.05.007
Azad, Ariful, and Buluç, Aydın. Mon .
"A matrix-algebraic formulation of distributed-memory maximal cardinality matching algorithms in bipartite graphs". United States. https://doi.org/10.1016/j.parco.2016.05.007. https://www.osti.gov/servlets/purl/1377506.
@article{osti_1377506,
title = {A matrix-algebraic formulation of distributed-memory maximal cardinality matching algorithms in bipartite graphs},
author = {Azad, Ariful and Buluç, Aydın},
abstractNote = {We describe parallel algorithms for computing maximal cardinality matching in a bipartite graph on distributed-memory systems. Unlike traditional algorithms that match one vertex at a time, our algorithms process many unmatched vertices simultaneously using a matrix-algebraic formulation of maximal matching. This generic matrix-algebraic framework is used to develop three efficient maximal matching algorithms with minimal changes. The newly developed algorithms have two benefits over existing graph-based algorithms. First, unlike existing parallel algorithms, cardinality of matching obtained by the new algorithms stays constant with increasing processor counts, which is important for predictable and reproducible performance. Second, relying on bulk-synchronous matrix operations, these algorithms expose a higher degree of parallelism on distributed-memory platforms than existing graph-based algorithms. We report high-performance implementations of three maximal matching algorithms using hybrid OpenMP-MPI and evaluate the performance of these algorithm using more than 35 real and randomly generated graphs. On real instances, our algorithms achieve up to 200 × speedup on 2048 cores of a Cray XC30 supercomputer. Even higher speedups are obtained on larger synthetically generated graphs where our algorithms show good scaling on up to 16,384 cores.},
doi = {10.1016/j.parco.2016.05.007},
journal = {Parallel Computing},
number = C,
volume = 58,
place = {United States},
year = {Mon May 16 00:00:00 EDT 2016},
month = {Mon May 16 00:00:00 EDT 2016}
}
Web of Science