Distributed PageRank computation with improved round complexities
PageRank is a classic measure that effectively evaluates the importance of nodes in large graphs. It has been applied in numerous applications spanning data mining, Web algorithms, recommendation systems, load balancing, search and connectivity structures identification. Computing PageRank for large...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/161776 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-161776 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1617762022-09-20T00:53:44Z Distributed PageRank computation with improved round complexities Luo, Siqiang Wu, Xiaowei Kao, Ben School of Computer Science and Engineering Engineering::Computer science and engineering PageRank Distributed Computation PageRank is a classic measure that effectively evaluates the importance of nodes in large graphs. It has been applied in numerous applications spanning data mining, Web algorithms, recommendation systems, load balancing, search and connectivity structures identification. Computing PageRank for large graphs is challenging and this has motivated the studies of distributed algorithms to compute PageRank. Previously, little works have been spent on the distributed PageRank algorithms with strong guarantees on both complexity and accuracy. In this paper, we focus on the theoretical aspect and study the complexity of distributed PageRank computation based on the well-known congested-clique model with a bandwidth generalization. An existing algorithm proposed by Sarma et al. (2015) can be applied in this model, which estimates PageRanks in an n-node graph using, with high probability, O(logn) communication rounds and a bandwidth of O((logn)3) bits. We present Radar-Push (RP), which is a distributed PageRank algorithm that is strictly better in round complexities. Specifically, Radar-Push uses O(loglogn) communication rounds and an edge bandwidth of O((logn)3) bits. We further show that Radar-Push can be adapted to efficiently compute an important variant of PageRank, namely, the batch one-hop personalized PageRank, in O(loglogn) communication rounds. Ministry of Education (MOE) Nanyang Technological University This research is supported by the Ministry of Education, Singapore, under its Academic Research Fund Tier 1 (RG18/21) and Tier 1 Seed Funding (RS05/21). This research is also supported by NTU startup grant. Xiaowei Wu is funded by FDCT (0143/2020/A3, SKL-IOTSC-2021–2023), the SRG of University of Macau (SRG2020-00020-IOTSC) and GDST (2020B1212030003). 2022-09-20T00:53:44Z 2022-09-20T00:53:44Z 2022 Journal Article Luo, S., Wu, X. & Kao, B. (2022). Distributed PageRank computation with improved round complexities. Information Sciences, 607, 109-125. https://dx.doi.org/10.1016/j.ins.2022.05.108 0020-0255 https://hdl.handle.net/10356/161776 10.1016/j.ins.2022.05.108 2-s2.0-85131464446 607 109 125 en RG18/21 RS05/21 Information Sciences © 2022 Elsevier Inc. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering PageRank Distributed Computation |
spellingShingle |
Engineering::Computer science and engineering PageRank Distributed Computation Luo, Siqiang Wu, Xiaowei Kao, Ben Distributed PageRank computation with improved round complexities |
description |
PageRank is a classic measure that effectively evaluates the importance of nodes in large graphs. It has been applied in numerous applications spanning data mining, Web algorithms, recommendation systems, load balancing, search and connectivity structures identification. Computing PageRank for large graphs is challenging and this has motivated the studies of distributed algorithms to compute PageRank. Previously, little works have been spent on the distributed PageRank algorithms with strong guarantees on both complexity and accuracy. In this paper, we focus on the theoretical aspect and study the complexity of distributed PageRank computation based on the well-known congested-clique model with a bandwidth generalization. An existing algorithm proposed by Sarma et al. (2015) can be applied in this model, which estimates PageRanks in an n-node graph using, with high probability, O(logn) communication rounds and a bandwidth of O((logn)3) bits. We present Radar-Push (RP), which is a distributed PageRank algorithm that is strictly better in round complexities. Specifically, Radar-Push uses O(loglogn) communication rounds and an edge bandwidth of O((logn)3) bits. We further show that Radar-Push can be adapted to efficiently compute an important variant of PageRank, namely, the batch one-hop personalized PageRank, in O(loglogn) communication rounds. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Luo, Siqiang Wu, Xiaowei Kao, Ben |
format |
Article |
author |
Luo, Siqiang Wu, Xiaowei Kao, Ben |
author_sort |
Luo, Siqiang |
title |
Distributed PageRank computation with improved round complexities |
title_short |
Distributed PageRank computation with improved round complexities |
title_full |
Distributed PageRank computation with improved round complexities |
title_fullStr |
Distributed PageRank computation with improved round complexities |
title_full_unstemmed |
Distributed PageRank computation with improved round complexities |
title_sort |
distributed pagerank computation with improved round complexities |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/161776 |
_version_ |
1745574630833258496 |