skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Parallel Heuristics for Scalable Community Detection

Conference ·
OSTI ID:1140103

Community detection has become a fundamental operation in numerous graph-theoretic applications. It is used to reveal natural divisions that exist within real world networks without imposing prior size or cardinality constraints on the set of communities. Despite its potential for application, there is only limited support for community detection on large-scale parallel computers, largely owing to the irregular and inherently sequential nature of the underlying heuristics. In this paper, we present parallelization heuristics for fast community detection using the Louvain method as the serial template. The Louvain method is an iterative heuristic for modularity optimization. Originally developed by Blondel et al. in 2008, the method has become increasingly popular owing to its ability to detect high modularity community partitions in a fast and memory-efficient manner. However, the method is also inherently sequential, thereby limiting its scalability to problems that can be solved on desktops. Here, we observe certain key properties of this method that present challenges for its parallelization, and consequently propose multiple heuristics that are designed to break the sequential barrier. Our heuristics are agnostic to the underlying parallel architecture. For evaluation purposes, we implemented our heuristics on shared memory (OpenMP) and distributed memory (MapReduce-MPI) machines, and tested them over real world graphs derived from multiple application domains (internet, biological, natural language processing). Experimental results demonstrate the ability of our heuristics to converge to high modularity solutions comparable to those output by the serial algorithm in nearly the same number of iterations, while also drastically reducing time to solution.

Research Organization:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1140103
Report Number(s):
PNNL-SA-99348; 400470000
Resource Relation:
Conference: 28th IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPS 2014), May 19-23, 2014, Phoenix, Arizona, 1374-1385
Country of Publication:
United States
Language:
English

Similar Records

Grappolo
Software · Wed Feb 12 00:00:00 EST 2014 · OSTI ID:1140103

Parallel heuristics for scalable community detection
Journal Article · Sat Aug 01 00:00:00 EDT 2015 · Parallel Computing · OSTI ID:1140103

Distributed Louvain Algorithm for Graph Community Detection
Conference · Tue May 22 00:00:00 EDT 2018 · OSTI ID:1140103

Related Subjects