Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

MCS+: An Efficient Algorithm for Crawling the Community Structure in Multiplex Networks

Journal Article · · ACM Transactions on Knowledge Discovery from Data
DOI:https://doi.org/10.1145/3451527· OSTI ID:1810757
 [1];  [2];  [1]
  1. Syracuse Univ., NY (United States)
  2. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

In this article, we consider the problem of crawling a multiplex network to identify the community structure of a layer-of-interest. A multiplex network is one where there are multiple types of relationships between the nodes. In many multiplex networks, some layers might be easier to explore (in terms of time, money etc.). We propose MCS+, an algorithm that can use the information from the easier to explore layers to help in the exploration of a layer-of-interest that is expensive to explore. We consider the goal of exploration to be generating a sample that is representative of the communities in the complete layer-of-interest. This work has practical applications in areas such as exploration of dark (e.g., criminal) networks, online social networks, biological networks, and so on. For example, in a terrorist network, relationships such as phone records, e-mail records, and so on are easier to collect; in contrast, data on the face-to-face communications are much harder to collect, but also potentially more valuable. We perform extensive experimental evaluations on real-world networks, and we observe that MCS+ consistently outperforms the best baseline—the similarity of the sample that MCS+ generates to the real network is up to three times that of the best baseline in some networks. We also perform theoretical and experimental evaluations on the scalability of MCS+ to network properties, and find that it scales well with the budget, number of layers in the multiplex network, and the average degree in the original network.

Research Organization:
Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
AC04-94AL85000; NA0003525
OSTI ID:
1810757
Report Number(s):
SAND--2020-14338J; 693137
Journal Information:
ACM Transactions on Knowledge Discovery from Data, Journal Name: ACM Transactions on Knowledge Discovery from Data Journal Issue: 1 Vol. 16; ISSN 1556-4681
Publisher:
Association for Computing Machinery (ACM)Copyright Statement
Country of Publication:
United States
Language:
English

References (48)

Value-Difference Based Exploration: Adaptive Control between Epsilon-Greedy and Softmax book January 2011
ABACUS: frequent pAttern mining-BAsed Community discovery in mUltidimensional networkS journal July 2013
The anatomy of a large-scale hypertextual Web search engine journal April 1998
Identifying the community structure of the international-trade multi-network journal June 2011
Social Network Analysis book January 1994
Finite-time Analysis of the Multiarmed Bandit Problem journal May 2002
Network neuroscience journal February 2017
Modularity and community structure in networks journal May 2006
Maps of random walks on complex networks reveal community structure journal January 2008
Multirelational organization of large-scale social networks in an online world journal July 2010
Community structure in social and biological networks journal June 2002
Navigability of interconnected networks under random failures journal May 2014
Building Protein-Protein Interaction Networks with Proteomics and Informatics Tools journal July 2011
Detecting the overlapping and hierarchical community structure in complex networks journal March 2009
Comparing community structure identification journal September 2005
Fast unfolding of communities in large networks journal October 2008
Evaluating accuracy of community detection using the relative normalized mutual information journal November 2015
Multilayer networks journal July 2014
BioGRID: a general repository for interaction datasets journal January 2006
Link prediction in multiplex online social networks journal February 2017
Finding and evaluating community structure in networks journal February 2004
Finding community structure in very large networks journal December 2004
Stochastic blockmodels and community structure in networks journal January 2011
Limits of modularity maximization in community detection journal December 2011
Metropolis Algorithms for Representative Subgraph Sampling conference December 2008
Uncoverning Groups via Heterogeneous Interaction Analysis conference December 2009
Community Detection in Networks with Node Attributes
  • Yang, Jaewon; McAuley, Julian; Leskovec, Jure
  • 2013 IEEE International Conference on Data Mining (ICDM), 2013 IEEE 13th International Conference on Data Mining https://doi.org/10.1109/ICDM.2013.167
conference December 2013
Predicted max degree sampling: Sampling in directed networks to maximize node coverage through crawling
  • Laishram, Ricky; Areekijseree, Katchaguy; Soundarajan, Sucheta
  • 2017 IEEE Conference on Computer Communications: Workshops (INFOCOM WKSHPS), 2017 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) https://doi.org/10.1109/INFCOMW.2017.8116502
conference May 2017
Overlapping Multi-Bandit Best Arm Identification conference July 2019
Integrating Omics Data With a Multiplex Network-Based Approach for the Identification of Cancer Subtypes journal June 2016
The Distribution of the Flora in the Alpine Zone.1 journal February 1912
Towards real-world complexity: an introduction to multiplex networks journal February 2015
A multilayer approach to multiplexity and link prediction in online geo-social networks journal July 2016
Modeling the multi-layer nature of the European Air Transport Network: Resilience and passengers re-scheduling under random failures journal January 2013
Sampling from large graphs conference January 2006
Sampling community structure conference January 2010
RolX: structural role extraction & mining in large graphs
  • Henderson, Keith; Gallagher, Brian; Eliassi-Rad, Tina
  • Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '12 https://doi.org/10.1145/2339530.2339723
conference January 2012
Overlapping community detection at scale: a nonnegative matrix factorization approach conference January 2013
node2vec: Scalable Feature Learning for Networks conference January 2016
Guidelines for Online Network Crawling: A Study of Data Collection Approaches and Network Properties
  • Areekijseree, Katchaguy; Laishram, Ricky; Soundarajan, Sucheta
  • WebSci '18: 10th ACM Conference on Web Science, Proceedings of the 10th ACM Conference on Web Science https://doi.org/10.1145/3201064.3201066
conference May 2018
Adaptive on-line page importance computation conference January 2003
Birds of a Feather: Homophily in Social Networks journal August 2001
A two-step framework for inferring direct protein-protein interaction network from AP-MS data journal September 2017
HIPPIE: Integrating Protein Interaction Networks with Experiment Based Quality Scores journal February 2012
Crawling the Community Structure of Multiplex Networks journal July 2019
Multiplexity in Adult Friendships journal June 1979
Characterizing interactions in online social networks during exceptional events journal August 2015
Computing Communities in Large Networks Using Random Walks journal January 2006

Similar Records

Efficient Sampling of Complex Interdependent and Multiplex Networks
Journal Article · Fri Oct 01 00:00:00 EDT 2021 · Journal of Complex Networks · OSTI ID:1829811

Technical services for mine communications research. Task D Applicability of available multiplex carrier equipment for mine telephone systems. Final report, May 1974--Jun 1975
Technical Report · Sun Jun 01 00:00:00 EDT 1975 · OSTI ID:7138744

Multiplexed communication over a high-speed quantum channel
Journal Article · Mon Mar 15 00:00:00 EDT 2010 · Physical Review. A · OSTI ID:21408399