DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Community detection in hypergraphs via mutual information maximization

Journal Article · · Scientific Reports

Abstract The hypergraph community detection problem seeks to identify groups of related vertices in hypergraph data. We propose an information-theoretic hypergraph community detection algorithm which compresses the observed data in terms of community labels and community-edge intersections. This algorithm can also be viewed as maximum-likelihood inference in a degree-corrected microcanonical stochastic blockmodel. We perform the compression/inference step via simulated annealing. Unlike several recent algorithms based on canonical models, our microcanonical algorithm does not require inference of statistical parameters such as vertex degrees or pairwise group connection rates. Through synthetic experiments, we find that our algorithm succeeds down to recently-conjectured thresholds for sparse random hypergraphs. We also find competitive performance in cluster recovery tasks on several hypergraph data sets.

Sponsoring Organization:
USDOE
Grant/Contract Number:
AC05-76RL01830
OSTI ID:
2327054
Alternate ID(s):
OSTI ID: 2438439
Journal Information:
Scientific Reports, Journal Name: Scientific Reports Journal Issue: 1 Vol. 14; ISSN 2045-2322
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (30)

The Why, How, and When of Representations for Complex Systems journal January 2021
What Are Higher-Order Networks? journal August 2023
Consistency of spectral clustering in stochastic block models journal February 2015
Understanding the Metropolis-Hastings Algorithm journal November 1995
A review of stochastic block models and extensions for graph clustering journal December 2019
High-Quality Hypergraph Partitioning journal December 2022
Nonbacktracking Spectral Clustering of Nonuniform Hypergraphs journal April 2023
Merge-split Markov chain Monte Carlo for community detection journal July 2020
The Minimum Description Length Principle book January 2007
Community detection in large hypergraphs journal July 2023
Hypergraph Cuts with General Splitting Functions journal August 2022
High-Resolution Measurements of Face-to-Face Contact Patterns in a Primary School journal August 2011
Limit laws for self-loops and multiple edges in the configuration model
  • Angel, Omer; van der Hofstad, Remco; Holmgren, Cecilia
  • Annales de l'Institut Henri Poincaré, Probabilités et Statistiques, Vol. 55, Issue 3 https://doi.org/10.1214/18-AIHP926
journal August 2019
Multilevel hypergraph partitioning conference January 1997
Configuration models of random hypergraphs journal June 2020
Spectral redemption in clustering sparse networks journal November 2013
Nonparametric Bayesian inference of the microcanonical stochastic block model journal January 2017
Optimization by Simulated Annealing journal May 1983
Community Detection in General Stochastic Block models: Fundamental Limits and Efficient Algorithms for Recovery conference October 2015
Stochastic blockmodels and community structure in networks journal January 2011
A Mathematical Theory of Communication journal July 1948
Model Selection and the Principle of Minimum Description Length journal June 2001
Community detection in graphs journal February 2010
Generative hypergraph clustering: From blockmodels to modularity journal July 2021
An information-theoretic framework for resolving community structure in complex networks journal May 2007
Maximum likelihood estimation of sparse networks with missing observations journal December 2021
Contact Patterns in a High School: A Comparison between Data Collected Using Wearable Sensors, Contact Diaries and Friendship Surveys journal September 2015
Networks book July 2018
Oracle inequalities for network models and sparse graphon estimation journal February 2017
A reductive approach to hypergraph clustering: An application to image segmentation journal July 2012