GPU-accelerated graph label propagation for real-time fraud detection

Fraud detection is a pressing challenge for most financial and commercial platforms. In this paper, we study the processing pipeline of fraud detection in a large e-commerce platform of TaoBao. Graph label propagation (LP) is a core component in this pipeline to detect suspicious clusters from the u...

Full description

Saved in:
Bibliographic Details
Main Authors: YE, Chang, LI, Yuchen, HE, Bingsheng, LI, Zhao, SUN, Jianling
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6135
https://ink.library.smu.edu.sg/context/sis_research/article/7138/viewcontent/3448016.3452774.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7138
record_format dspace
spelling sg-smu-ink.sis_research-71382021-09-29T12:12:45Z GPU-accelerated graph label propagation for real-time fraud detection YE, Chang LI, Yuchen HE, Bingsheng LI, Zhao SUN, Jianling Fraud detection is a pressing challenge for most financial and commercial platforms. In this paper, we study the processing pipeline of fraud detection in a large e-commerce platform of TaoBao. Graph label propagation (LP) is a core component in this pipeline to detect suspicious clusters from the user-interaction graph. Furthermore, the run-time of the LP component occupies 75% overhead of TaoBao’s automated detection pipeline. To enable real-time fraud detection, we propose a GPU-based framework, called GLP, to support large-scale LP workloads in enterprises. We have identified two key challenges when integrating GPU acceleration into TaoBao’s data processing pipeline: (1) programmability for evolving fraud detection logics; (2) demand for real-time performance. Motivated by these challenges, we offer a set of expressive APIs that data engineers can customize and deploy efficient LP algorithms on GPUs with ease. We propose novel GPU-centric optimizations by leveraging the community as well as power-law properties of large graphs. Extensive experiments have confirmed the effectiveness of our proposed optimizations. With a single GPU, GLP supports a real billion-scale graph workload from the fraud detection pipeline of TaoBao and achieves 8.2x speedup to the current in-house distributed solution running on high-end multicore machines. 2021-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6135 info:doi/10.1145/3448016.3452774 https://ink.library.smu.edu.sg/context/sis_research/article/7138/viewcontent/3448016.3452774.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
YE, Chang
LI, Yuchen
HE, Bingsheng
LI, Zhao
SUN, Jianling
GPU-accelerated graph label propagation for real-time fraud detection
description Fraud detection is a pressing challenge for most financial and commercial platforms. In this paper, we study the processing pipeline of fraud detection in a large e-commerce platform of TaoBao. Graph label propagation (LP) is a core component in this pipeline to detect suspicious clusters from the user-interaction graph. Furthermore, the run-time of the LP component occupies 75% overhead of TaoBao’s automated detection pipeline. To enable real-time fraud detection, we propose a GPU-based framework, called GLP, to support large-scale LP workloads in enterprises. We have identified two key challenges when integrating GPU acceleration into TaoBao’s data processing pipeline: (1) programmability for evolving fraud detection logics; (2) demand for real-time performance. Motivated by these challenges, we offer a set of expressive APIs that data engineers can customize and deploy efficient LP algorithms on GPUs with ease. We propose novel GPU-centric optimizations by leveraging the community as well as power-law properties of large graphs. Extensive experiments have confirmed the effectiveness of our proposed optimizations. With a single GPU, GLP supports a real billion-scale graph workload from the fraud detection pipeline of TaoBao and achieves 8.2x speedup to the current in-house distributed solution running on high-end multicore machines.
format text
author YE, Chang
LI, Yuchen
HE, Bingsheng
LI, Zhao
SUN, Jianling
author_facet YE, Chang
LI, Yuchen
HE, Bingsheng
LI, Zhao
SUN, Jianling
author_sort YE, Chang
title GPU-accelerated graph label propagation for real-time fraud detection
title_short GPU-accelerated graph label propagation for real-time fraud detection
title_full GPU-accelerated graph label propagation for real-time fraud detection
title_fullStr GPU-accelerated graph label propagation for real-time fraud detection
title_full_unstemmed GPU-accelerated graph label propagation for real-time fraud detection
title_sort gpu-accelerated graph label propagation for real-time fraud detection
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/6135
https://ink.library.smu.edu.sg/context/sis_research/article/7138/viewcontent/3448016.3452774.pdf
_version_ 1770575834727317504