Exploiting reuse for GPU subgraph enumeration

Subgraph enumeration is important for many applications such as network motif discovery, community detection, and frequent subgraph mining. To accelerate the execution, recent works utilize graphics processing units (GPUs) to parallelize subgraph enumeration. The performances of these parallel schem...

Full description

Saved in:
Bibliographic Details
Main Authors: GUO, Wentiao, LI, Yuchen, TAN, Kian-Lee
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
GPU
Online Access:https://ink.library.smu.edu.sg/sis_research/7130
https://ink.library.smu.edu.sg/context/sis_research/article/8133/viewcontent/09247538.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8133
record_format dspace
spelling sg-smu-ink.sis_research-81332024-03-04T07:50:37Z Exploiting reuse for GPU subgraph enumeration GUO, Wentiao LI, Yuchen TAN, Kian-Lee Subgraph enumeration is important for many applications such as network motif discovery, community detection, and frequent subgraph mining. To accelerate the execution, recent works utilize graphics processing units (GPUs) to parallelize subgraph enumeration. The performances of these parallel schemes are dominated by the set intersection operations which account for up to $95\%$ of the total processing time. (Un)surprisingly, a significant portion (as high as $99\%$) of these operations is actually redundant, i.e., the same set of vertices is repeatedly encountered and evaluated. Therefore, in this paper, we seek to salvage and recycle the results of such operations to avoid repeated computation. Our solution consists of two phases. In the first phase, we generate a reusable plan that determines the opportunity for reuse. The plan is based on a novel reuse discovery mechanism that can identify available results to prevent redundant computation. In the second phase, the plan is executed to produce the subgraph enumeration results. This processing is based on a newly designed reusable parallel search strategy that can efficiently maintain and retrieve the results of set intersection operations. Our implementation on GPUs shows that our approach can achieve up to $5$ times speedups compared with the state-of-the-art GPU solutions. 2022-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7130 info:doi/10.1109/TKDE.2020.3035564 https://ink.library.smu.edu.sg/context/sis_research/article/8133/viewcontent/09247538.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Graphics processing units Acceleration Pattern matching Data structures Instruction sets Runtime Data mining Subgraph enumeration GPU Reuse Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Graphics processing units
Acceleration
Pattern matching
Data structures
Instruction sets
Runtime
Data mining
Subgraph enumeration
GPU
Reuse
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Graphics processing units
Acceleration
Pattern matching
Data structures
Instruction sets
Runtime
Data mining
Subgraph enumeration
GPU
Reuse
Databases and Information Systems
Numerical Analysis and Scientific Computing
GUO, Wentiao
LI, Yuchen
TAN, Kian-Lee
Exploiting reuse for GPU subgraph enumeration
description Subgraph enumeration is important for many applications such as network motif discovery, community detection, and frequent subgraph mining. To accelerate the execution, recent works utilize graphics processing units (GPUs) to parallelize subgraph enumeration. The performances of these parallel schemes are dominated by the set intersection operations which account for up to $95\%$ of the total processing time. (Un)surprisingly, a significant portion (as high as $99\%$) of these operations is actually redundant, i.e., the same set of vertices is repeatedly encountered and evaluated. Therefore, in this paper, we seek to salvage and recycle the results of such operations to avoid repeated computation. Our solution consists of two phases. In the first phase, we generate a reusable plan that determines the opportunity for reuse. The plan is based on a novel reuse discovery mechanism that can identify available results to prevent redundant computation. In the second phase, the plan is executed to produce the subgraph enumeration results. This processing is based on a newly designed reusable parallel search strategy that can efficiently maintain and retrieve the results of set intersection operations. Our implementation on GPUs shows that our approach can achieve up to $5$ times speedups compared with the state-of-the-art GPU solutions.
format text
author GUO, Wentiao
LI, Yuchen
TAN, Kian-Lee
author_facet GUO, Wentiao
LI, Yuchen
TAN, Kian-Lee
author_sort GUO, Wentiao
title Exploiting reuse for GPU subgraph enumeration
title_short Exploiting reuse for GPU subgraph enumeration
title_full Exploiting reuse for GPU subgraph enumeration
title_fullStr Exploiting reuse for GPU subgraph enumeration
title_full_unstemmed Exploiting reuse for GPU subgraph enumeration
title_sort exploiting reuse for gpu subgraph enumeration
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7130
https://ink.library.smu.edu.sg/context/sis_research/article/8133/viewcontent/09247538.pdf
_version_ 1794549749355905024