Numerically approximating centrality for graph ranking guarantees
- Georgia Inst. of Technology, Atlanta, GA (United States)
- Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Many real-world datasets can be represented as graphs. Using iterative solvers to approximate graph centrality measures allows us to obtain a ranking vector on the nodes of the graph, consisting of a number for each vertex in the graph identifying its relative importance. In this study the centrality measures we use are Katz Centrality and PageRank. Given an approximate solution, we use the residual to accurately estimate how much of the ranking matches the ranking given by the exact solution. Using probabilistic matrix norms, we obtain bounds on the accuracy of the approximation compared to the exact solution with respect to the highly ranked nodes and apply numerical analysis to the computation of centrality with iterative methods. This relates the numerical accuracy of the linear solver to the data analysis accuracy of finding the correct ranking. In particular, we answer the question of which pairwise rankings are reliable given an approximate solution to the linear system. Experiments on many real-world undirected and directed networks up to several million vertices and several hundred million edges validate our theory and show that we are able to accurately estimate large portions of the approximation. We also analyze the difference between global centrality scores and personalized scores (w.r.t. specific seed vertices). By analyzing convergence error, we develop confidence in the ranking schemes of data mining. We show we are able to accurately guarantee ranking of vertices with an approximation to centrality metrics faster than current methods.
- Research Organization:
- Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA); National Science Foundation (NSF)
- Grant/Contract Number:
- AC52-07NA27344; 1339745; LLNL-JRNL-739840
- OSTI ID:
- 1872304
- Alternate ID(s):
- OSTI ID: 1703019
- Report Number(s):
- LLNL-JRNL-739840; 893684
- Journal Information:
- Journal of Computational Science, Vol. 26; ISSN 1877-7503
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Designing an Optimized Water Quality Monitoring Network with Reserved Monitoring Locations
|
journal | April 2019 |
Similar Records
Fast katz and commuters : efficient estimation of social relatedness in large networks.
Active Betweenness Cardinality: Algorithms and Applications