skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: MCS+: An Efficient Algorithm for Crawling the Community Structure in Multiplex Networks

Journal Article · · ACM Transactions on Knowledge Discovery from Data
DOI:https://doi.org/10.1145/3451527· OSTI ID:1810757
 [1];  [2];  [1]
  1. Syracuse Univ., NY (United States)
  2. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

In this article, we consider the problem of crawling a multiplex network to identify the community structure of a layer-of-interest. A multiplex network is one where there are multiple types of relationships between the nodes. In many multiplex networks, some layers might be easier to explore (in terms of time, money etc.). We propose MCS+, an algorithm that can use the information from the easier to explore layers to help in the exploration of a layer-of-interest that is expensive to explore. We consider the goal of exploration to be generating a sample that is representative of the communities in the complete layer-of-interest. This work has practical applications in areas such as exploration of dark (e.g., criminal) networks, online social networks, biological networks, and so on. For example, in a terrorist network, relationships such as phone records, e-mail records, and so on are easier to collect; in contrast, data on the face-to-face communications are much harder to collect, but also potentially more valuable. We perform extensive experimental evaluations on real-world networks, and we observe that MCS+ consistently outperforms the best baseline—the similarity of the sample that MCS+ generates to the real network is up to three times that of the best baseline in some networks. We also perform theoretical and experimental evaluations on the scalability of MCS+ to network properties, and find that it scales well with the budget, number of layers in the multiplex network, and the average degree in the original network.

Research Organization:
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
AC04-94AL85000; NA0003525
OSTI ID:
1810757
Report Number(s):
SAND-2020-14338J; 693137
Journal Information:
ACM Transactions on Knowledge Discovery from Data, Vol. 16, Issue 1; ISSN 1556-4681
Publisher:
Association for Computing Machinery (ACM)Copyright Statement
Country of Publication:
United States
Language:
English

References (48)

Community structure in social and biological networks journal June 2002
Link prediction in multiplex online social networks journal February 2017
Crawling the Community Structure of Multiplex Networks journal July 2019
node2vec: Scalable Feature Learning for Networks conference January 2016
A multilayer approach to multiplexity and link prediction in online geo-social networks journal July 2016
Fast unfolding of communities in large networks journal October 2008
RolX: structural role extraction & mining in large graphs
  • Henderson, Keith; Gallagher, Brian; Eliassi-Rad, Tina
  • Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '12 https://doi.org/10.1145/2339530.2339723
conference January 2012
A two-step framework for inferring direct protein-protein interaction network from AP-MS data journal September 2017
Adaptive on-line page importance computation conference January 2003
Multiplexity in Adult Friendships journal June 1979
Stochastic blockmodels and community structure in networks journal January 2011
ABACUS: frequent pAttern mining-BAsed Community discovery in mUltidimensional networkS journal July 2013
BioGRID: a general repository for interaction datasets journal January 2006
Overlapping community detection at scale: a nonnegative matrix factorization approach conference January 2013
The Distribution of the Flora in the Alpine Zone.1 journal February 1912
Community Detection in Networks with Node Attributes
  • Yang, Jaewon; McAuley, Julian; Leskovec, Jure
  • 2013 IEEE International Conference on Data Mining (ICDM), 2013 IEEE 13th International Conference on Data Mining https://doi.org/10.1109/ICDM.2013.167
conference December 2013
HIPPIE: Integrating Protein Interaction Networks with Experiment Based Quality Scores journal February 2012
Social Network Analysis book January 1994
Navigability of interconnected networks under random failures journal May 2014
Overlapping Multi-Bandit Best Arm Identification conference July 2019
The anatomy of a large-scale hypertextual Web search engine journal April 1998
Network neuroscience journal February 2017
Modeling the multi-layer nature of the European Air Transport Network: Resilience and passengers re-scheduling under random failures journal January 2013
Metropolis Algorithms for Representative Subgraph Sampling conference December 2008
Finding community structure in very large networks journal December 2004
Characterizing interactions in online social networks during exceptional events journal August 2015
Sampling from large graphs conference January 2006
Predicted max degree sampling: Sampling in directed networks to maximize node coverage through crawling
  • Laishram, Ricky; Areekijseree, Katchaguy; Soundarajan, Sucheta
  • 2017 IEEE Conference on Computer Communications: Workshops (INFOCOM WKSHPS), 2017 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) https://doi.org/10.1109/INFCOMW.2017.8116502
conference May 2017
Detecting the overlapping and hierarchical community structure in complex networks journal March 2009
Integrating Omics Data With a Multiplex Network-Based Approach for the Identification of Cancer Subtypes journal June 2016
Guidelines for Online Network Crawling: A Study of Data Collection Approaches and Network Properties
  • Areekijseree, Katchaguy; Laishram, Ricky; Soundarajan, Sucheta
  • WebSci '18: 10th ACM Conference on Web Science, Proceedings of the 10th ACM Conference on Web Science https://doi.org/10.1145/3201064.3201066
conference May 2018
Towards real-world complexity: an introduction to multiplex networks journal February 2015
Computing Communities in Large Networks Using Random Walks journal January 2006
Identifying the community structure of the international-trade multi-network journal June 2011
Building Protein-Protein Interaction Networks with Proteomics and Informatics Tools journal July 2011
Uncoverning Groups via Heterogeneous Interaction Analysis conference December 2009
Finding and evaluating community structure in networks journal February 2004
Multilayer networks journal July 2014
Evaluating accuracy of community detection using the relative normalized mutual information journal November 2015
Finite-time Analysis of the Multiarmed Bandit Problem journal May 2002
Multirelational organization of large-scale social networks in an online world journal July 2010
Modularity and community structure in networks journal May 2006
Maps of random walks on complex networks reveal community structure journal January 2008
Value-Difference Based Exploration: Adaptive Control between Epsilon-Greedy and Softmax book January 2011
Comparing community structure identification journal September 2005
Birds of a Feather: Homophily in Social Networks journal August 2001
Limits of modularity maximization in community detection journal December 2011
Sampling community structure conference January 2010

Similar Records

Efficient Sampling of Complex Interdependent and Multiplex Networks
Journal Article · Fri Oct 01 00:00:00 EDT 2021 · Journal of Complex Networks · OSTI ID:1810757

Microbial Forensics: A Scientific Assessment
Conference · Mon Feb 17 00:00:00 EST 2003 · OSTI ID:1810757

U.S. and Russian Collaboration in the Area of Nuclear Forensics
Conference · Mon Oct 22 00:00:00 EDT 2007 · OSTI ID:1810757