Distributed PageRank computation with improved round complexities

PageRank is a classic measure that effectively evaluates the importance of nodes in large graphs. It has been applied in numerous applications spanning data mining, Web algorithms, recommendation systems, load balancing, search and connectivity structures identification. Computing PageRank for large...

Full description

Saved in:
Bibliographic Details
Main Authors: Luo, Siqiang, Wu, Xiaowei, Kao, Ben
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/161776
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-161776
record_format dspace
spelling sg-ntu-dr.10356-1617762022-09-20T00:53:44Z Distributed PageRank computation with improved round complexities Luo, Siqiang Wu, Xiaowei Kao, Ben School of Computer Science and Engineering Engineering::Computer science and engineering PageRank Distributed Computation PageRank is a classic measure that effectively evaluates the importance of nodes in large graphs. It has been applied in numerous applications spanning data mining, Web algorithms, recommendation systems, load balancing, search and connectivity structures identification. Computing PageRank for large graphs is challenging and this has motivated the studies of distributed algorithms to compute PageRank. Previously, little works have been spent on the distributed PageRank algorithms with strong guarantees on both complexity and accuracy. In this paper, we focus on the theoretical aspect and study the complexity of distributed PageRank computation based on the well-known congested-clique model with a bandwidth generalization. An existing algorithm proposed by Sarma et al. (2015) can be applied in this model, which estimates PageRanks in an n-node graph using, with high probability, O(logn) communication rounds and a bandwidth of O((logn)3) bits. We present Radar-Push (RP), which is a distributed PageRank algorithm that is strictly better in round complexities. Specifically, Radar-Push uses O(loglogn) communication rounds and an edge bandwidth of O((logn)3) bits. We further show that Radar-Push can be adapted to efficiently compute an important variant of PageRank, namely, the batch one-hop personalized PageRank, in O(loglogn) communication rounds. Ministry of Education (MOE) Nanyang Technological University This research is supported by the Ministry of Education, Singapore, under its Academic Research Fund Tier 1 (RG18/21) and Tier 1 Seed Funding (RS05/21). This research is also supported by NTU startup grant. Xiaowei Wu is funded by FDCT (0143/2020/A3, SKL-IOTSC-2021–2023), the SRG of University of Macau (SRG2020-00020-IOTSC) and GDST (2020B1212030003). 2022-09-20T00:53:44Z 2022-09-20T00:53:44Z 2022 Journal Article Luo, S., Wu, X. & Kao, B. (2022). Distributed PageRank computation with improved round complexities. Information Sciences, 607, 109-125. https://dx.doi.org/10.1016/j.ins.2022.05.108 0020-0255 https://hdl.handle.net/10356/161776 10.1016/j.ins.2022.05.108 2-s2.0-85131464446 607 109 125 en RG18/21 RS05/21 Information Sciences © 2022 Elsevier Inc. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
PageRank
Distributed Computation
spellingShingle Engineering::Computer science and engineering
PageRank
Distributed Computation
Luo, Siqiang
Wu, Xiaowei
Kao, Ben
Distributed PageRank computation with improved round complexities
description PageRank is a classic measure that effectively evaluates the importance of nodes in large graphs. It has been applied in numerous applications spanning data mining, Web algorithms, recommendation systems, load balancing, search and connectivity structures identification. Computing PageRank for large graphs is challenging and this has motivated the studies of distributed algorithms to compute PageRank. Previously, little works have been spent on the distributed PageRank algorithms with strong guarantees on both complexity and accuracy. In this paper, we focus on the theoretical aspect and study the complexity of distributed PageRank computation based on the well-known congested-clique model with a bandwidth generalization. An existing algorithm proposed by Sarma et al. (2015) can be applied in this model, which estimates PageRanks in an n-node graph using, with high probability, O(logn) communication rounds and a bandwidth of O((logn)3) bits. We present Radar-Push (RP), which is a distributed PageRank algorithm that is strictly better in round complexities. Specifically, Radar-Push uses O(loglogn) communication rounds and an edge bandwidth of O((logn)3) bits. We further show that Radar-Push can be adapted to efficiently compute an important variant of PageRank, namely, the batch one-hop personalized PageRank, in O(loglogn) communication rounds.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Luo, Siqiang
Wu, Xiaowei
Kao, Ben
format Article
author Luo, Siqiang
Wu, Xiaowei
Kao, Ben
author_sort Luo, Siqiang
title Distributed PageRank computation with improved round complexities
title_short Distributed PageRank computation with improved round complexities
title_full Distributed PageRank computation with improved round complexities
title_fullStr Distributed PageRank computation with improved round complexities
title_full_unstemmed Distributed PageRank computation with improved round complexities
title_sort distributed pagerank computation with improved round complexities
publishDate 2022
url https://hdl.handle.net/10356/161776
_version_ 1745574630833258496