Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Fast Katz and commuters : efficient estimation of social relatedness.

Conference ·
OSTI ID:1039022

Motivated by social network data mining problems such as link prediction and collaborative filtering, significant research effort has been devoted to computing topological measures including the Katz score and the commute time. Existing approaches typically approximate all pairwise relationships simultaneously. In this paper, we are interested in computing: the score for a single pair of nodes, and the top-k nodes with the best scores from a given source node. For the pairwise problem, we apply an iterative algorithm that computes upper and lower bounds for the measures we seek. This algorithm exploits a relationship between the Lanczos process and a quadrature rule. For the top-k problem, we propose an algorithm that only accesses a small portion of the graph and is related to techniques used in personalized PageRank computing. To test the scalability and accuracy of our algorithms we experiment with three real-world networks and find that these algorithms run in milliseconds to seconds without any preprocessing.

Research Organization:
Sandia National Laboratories
Sponsoring Organization:
USDOE
DOE Contract Number:
AC04-94AL85000
OSTI ID:
1039022
Report Number(s):
SAND2010-8829C
Country of Publication:
United States
Language:
English

Similar Records

Fast katz and commuters : efficient estimation of social relatedness in large networks.
Conference · Wed Sep 01 00:00:00 EDT 2010 · OSTI ID:1030228

Making social networks more human: A topological approach
Journal Article · Wed Jul 24 00:00:00 EDT 2019 · Statistical Analysis and Data Mining · OSTI ID:1559509