Overlapping clusters for distributed computation.
Conference
·
OSTI ID:1030279
- Google Research, New York, NY
- Microsoft Corporation, Redmond, WA
Scalable, distributed algorithms must address communication problems. We investigate overlapping clusters, or vertex partitions that intersect, for graph computations. This setup stores more of the graph than required but then affords the ease of implementation of vertex partitioned algorithms. Our hope is that this technique allows us to reduce communication in a computation on a distributed graph. The motivation above draws on recent work in communication avoiding algorithms. Mohiyuddin et al. (SC09) design a matrix-powers kernel that gives rise to an overlapping partition. Fritzsche et al. (CSC2009) develop an overlapping clustering for a Schwarz method. Both techniques extend an initial partitioning with overlap. Our procedure generates overlap directly. Indeed, Schwarz methods are commonly used to capitalize on overlap. Elsewhere, overlapping communities (Ahn et al, Nature 2009; Mishra et al. WAW2007) are now a popular model of structure in social networks. These have long been studied in statistics (Cole and Wishart, CompJ 1970). We present two types of results: (i) an estimated swapping probability {rho}{infinity}; and (ii) the communication volume of a parallel PageRank solution (link-following {alpha} = 0.85) using an additive Schwarz method. The volume ratio is the amount of extra storage for the overlap (2 means we store the graph twice). Below, as the ratio increases, the swapping probability and PageRank communication volume decreases.
- Research Organization:
- Sandia National Laboratories
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC04-94AL85000
- OSTI ID:
- 1030279
- Report Number(s):
- SAND2010-7728C
- Country of Publication:
- United States
- Language:
- English
Similar Records
A Partitioning Algorithm for Block-Diagonal Matrices With Overlap
GoFFish: A Sub-Graph Centric Framework for Large-Scale Graph Analytics1
A nested dissection approach to sparse matrix partitioning for parallel computations.
Journal Article
·
Fri Feb 01 23:00:00 EST 2008
· Parallel Computing
·
OSTI ID:941101
GoFFish: A Sub-Graph Centric Framework for Large-Scale Graph Analytics1
Conference
·
Mon Aug 25 00:00:00 EDT 2014
·
OSTI ID:1332680
A nested dissection approach to sparse matrix partitioning for parallel computations.
Journal Article
·
Fri Aug 01 00:00:00 EDT 2008
· Proposed for publication in SIAM Journal on Scientific Computing.
·
OSTI ID:947836