Large-scale graph label propagation on GPUs

Graph label propagation (LP) is a core component in many downstream applications such as fraud detection, recommendation and image segmentation. In this paper, we propose GLP, a GPU-based framework to enable efficient LP processing on large-scale graphs. By investigating the data processing pipeline...

Full description

Saved in:
Bibliographic Details
Main Authors: YE, Chang, LI, Yuchen, HE, Bingsheng, LI, Zhao, SUN, Jianling
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8468
https://ink.library.smu.edu.sg/context/sis_research/article/9471/viewcontent/Large_scaleGraphLabel_av.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9471
record_format dspace
spelling sg-smu-ink.sis_research-94712024-01-04T09:38:30Z Large-scale graph label propagation on GPUs YE, Chang LI, Yuchen HE, Bingsheng LI, Zhao SUN, Jianling Graph label propagation (LP) is a core component in many downstream applications such as fraud detection, recommendation and image segmentation. In this paper, we propose GLP, a GPU-based framework to enable efficient LP processing on large-scale graphs. By investigating the data processing pipeline in a large e-commerce platform, we have identified two key challenges on integrating GPU-accelerated LP processing to the pipeline: (1) programmability for evolving application logics; (2) demand for real-time performance. Motivated by these challenges, we offer a set of expressive APIs that data engineers can customize and deploy efficient LP algorithms on GPUs with ease. To achieve better performance, we propose novel GPU-centric optimizations by leveraging the community as well as power-law properties of large graphs. Further, we significantly reduce the expensive data transfer cost between CPUs and GPUs by enabling LP processing on compressed graphs. Extensive experiments have confirmed the effectiveness of our proposed approaches over the state-of-the-art GPU methods. Furthermore, our proposed solution supports a real billion-scale graph workload for fraud detection and achieves 13.2× speedup to the current in-house solution running on a high-end multicore machine with compressed graphs. 2023-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8468 info:doi/10.1109/TKDE.2023.3336329 https://ink.library.smu.edu.sg/context/sis_research/article/9471/viewcontent/Large_scaleGraphLabel_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University GPU computing graph label propagation Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic GPU computing
graph
label propagation
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle GPU computing
graph
label propagation
Databases and Information Systems
Numerical Analysis and Scientific Computing
YE, Chang
LI, Yuchen
HE, Bingsheng
LI, Zhao
SUN, Jianling
Large-scale graph label propagation on GPUs
description Graph label propagation (LP) is a core component in many downstream applications such as fraud detection, recommendation and image segmentation. In this paper, we propose GLP, a GPU-based framework to enable efficient LP processing on large-scale graphs. By investigating the data processing pipeline in a large e-commerce platform, we have identified two key challenges on integrating GPU-accelerated LP processing to the pipeline: (1) programmability for evolving application logics; (2) demand for real-time performance. Motivated by these challenges, we offer a set of expressive APIs that data engineers can customize and deploy efficient LP algorithms on GPUs with ease. To achieve better performance, we propose novel GPU-centric optimizations by leveraging the community as well as power-law properties of large graphs. Further, we significantly reduce the expensive data transfer cost between CPUs and GPUs by enabling LP processing on compressed graphs. Extensive experiments have confirmed the effectiveness of our proposed approaches over the state-of-the-art GPU methods. Furthermore, our proposed solution supports a real billion-scale graph workload for fraud detection and achieves 13.2× speedup to the current in-house solution running on a high-end multicore machine with compressed graphs.
format text
author YE, Chang
LI, Yuchen
HE, Bingsheng
LI, Zhao
SUN, Jianling
author_facet YE, Chang
LI, Yuchen
HE, Bingsheng
LI, Zhao
SUN, Jianling
author_sort YE, Chang
title Large-scale graph label propagation on GPUs
title_short Large-scale graph label propagation on GPUs
title_full Large-scale graph label propagation on GPUs
title_fullStr Large-scale graph label propagation on GPUs
title_full_unstemmed Large-scale graph label propagation on GPUs
title_sort large-scale graph label propagation on gpus
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8468
https://ink.library.smu.edu.sg/context/sis_research/article/9471/viewcontent/Large_scaleGraphLabel_av.pdf
_version_ 1787590775144448000