GraphMP: I/O-Efficient big graph analytics on a single commodity machine

Recent studies showed that single-machine graph processing systems can be as highly competitive as cluster-based approaches on large-scale problems. While several out-of-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce per...

Full description

Saved in:
Bibliographic Details
Main Authors: SUN, Peng, WEN, Yonggang, TA, Nguyen Binh Duong, XIAO, Xiaokui
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2020
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4847
https://ink.library.smu.edu.sg/context/sis_research/article/5850/viewcontent/GraphMP__PV.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5850
record_format dspace
spelling sg-smu-ink.sis_research-58502022-07-26T09:19:07Z GraphMP: I/O-Efficient big graph analytics on a single commodity machine SUN, Peng WEN, Yonggang TA, Nguyen Binh Duong XIAO, Xiaokui Recent studies showed that single-machine graph processing systems can be as highly competitive as cluster-based approaches on large-scale problems. While several out-of-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce performance in many practical cases. In this paper, we propose GraphMP to tackle big graph analytics on a single machine. GraphMP achieves low disk I/O overhead with three techniques. First, we design a vertex-centric sliding window (VSW) computation model to avoid reading and writing vertices on disk. Second, we propose a selective scheduling method to skip loading and processing unnecessary edge shards on disk. Third, we use a compressed edge cache mechanism to fully utilize the available memory of a machine to reduce the amount of disk accesses for edges. Extensive evaluations have shown that GraphMP could outperform existing single-machine out-of-core systems such as GraphChi, X-Stream and GridGraph by up to 30, and can be as highly competitive as distributed graph engines like Pregel+, PowerGraph and Chaos. 2020-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4847 info:doi/10.1109/TBDATA.2019.2908384 https://ink.library.smu.edu.sg/context/sis_research/article/5850/viewcontent/GraphMP__PV.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Graph Processing Big Data Parallel Computing Vertex-Centric Programming Model Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Graph Processing
Big Data
Parallel Computing
Vertex-Centric Programming Model
Software Engineering
spellingShingle Graph Processing
Big Data
Parallel Computing
Vertex-Centric Programming Model
Software Engineering
SUN, Peng
WEN, Yonggang
TA, Nguyen Binh Duong
XIAO, Xiaokui
GraphMP: I/O-Efficient big graph analytics on a single commodity machine
description Recent studies showed that single-machine graph processing systems can be as highly competitive as cluster-based approaches on large-scale problems. While several out-of-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce performance in many practical cases. In this paper, we propose GraphMP to tackle big graph analytics on a single machine. GraphMP achieves low disk I/O overhead with three techniques. First, we design a vertex-centric sliding window (VSW) computation model to avoid reading and writing vertices on disk. Second, we propose a selective scheduling method to skip loading and processing unnecessary edge shards on disk. Third, we use a compressed edge cache mechanism to fully utilize the available memory of a machine to reduce the amount of disk accesses for edges. Extensive evaluations have shown that GraphMP could outperform existing single-machine out-of-core systems such as GraphChi, X-Stream and GridGraph by up to 30, and can be as highly competitive as distributed graph engines like Pregel+, PowerGraph and Chaos.
format text
author SUN, Peng
WEN, Yonggang
TA, Nguyen Binh Duong
XIAO, Xiaokui
author_facet SUN, Peng
WEN, Yonggang
TA, Nguyen Binh Duong
XIAO, Xiaokui
author_sort SUN, Peng
title GraphMP: I/O-Efficient big graph analytics on a single commodity machine
title_short GraphMP: I/O-Efficient big graph analytics on a single commodity machine
title_full GraphMP: I/O-Efficient big graph analytics on a single commodity machine
title_fullStr GraphMP: I/O-Efficient big graph analytics on a single commodity machine
title_full_unstemmed GraphMP: I/O-Efficient big graph analytics on a single commodity machine
title_sort graphmp: i/o-efficient big graph analytics on a single commodity machine
publisher Institutional Knowledge at Singapore Management University
publishDate 2020
url https://ink.library.smu.edu.sg/sis_research/4847
https://ink.library.smu.edu.sg/context/sis_research/article/5850/viewcontent/GraphMP__PV.pdf
_version_ 1770575062174269440