Random Walk with Restart over Dynamic Graphs

Weiren Yu, Julie McCann
2016 2016 IEEE 16th International Conference on Data Mining (ICDM)  
Random Walk with Restart (RWR) is an appealing measure of proximity between nodes based on graph structures. Since real graphs are often large and subject to minor changes, it is prohibitively expensive to recompute proximities from scratch. Previous methods use LU decomposition and degree reordering heuristics, entailing O(|V | 3 ) time and O(|V | 2 ) memory to compute all (|V | 2 ) pairs of node proximities in a static graph. In this paper, a dynamic scheme to assess RWR proximities is
more » ... d: (1) For unit update, we characterize the changes to all-pairs proximities as the outer product of two vectors. We notice that the multiplication of an RWR matrix and its transition matrix, unlike traditional matrix multiplications, is commutative. This can greatly reduce the computation of all-pairs proximities from O(|V | 3 ) to O(|∆|) time for each update without loss of accuracy, where |∆| (≪ |V | 2 ) is the number of affected proximities. (2) To avoid O(|V | 2 ) memory for all pairs of outputs, we also devise efficient partitioning techniques for our dynamic model, which can compute all pairs of proximities segment-wisely within O(l|V |) memory and O(⌈ |V | l ⌉) I/O costs, where 1 ≤ l ≤ |V | is a user-controlled trade-off between memory and I/O costs. (3) For bulk updates, we also devise aggregation and hashing methods, which can discard many unnecessary updates further and handle chunks of unit updates simultaneously. Our experimental results on various datasets demonstrate that our methods can be 1-2 orders of magnitude faster than other competitors while securing scalability and exactness.
doi:10.1109/icdm.2016.0070 dblp:conf/icdm/YuM16 fatcat:4r75e335bnfffnkv73ptdlmx5m