GraphMP: an efficient semi-external-memory big graph processing system on a single machine

Recent studies showed that single-machine graph processing systems can be as highly competitive as clusterbased approaches on large-scale problems. While several outof-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce perfo...

Full description

Saved in:
Bibliographic Details
Main Authors: SUN, Peng, WEN, Yonggang, TA, Nguyen Binh Duong, XIAO, Xiaokui
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4764
https://ink.library.smu.edu.sg/context/sis_research/article/5767/viewcontent/1707.02557.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5767
record_format dspace
spelling sg-smu-ink.sis_research-57672020-01-16T10:27:44Z GraphMP: an efficient semi-external-memory big graph processing system on a single machine SUN, Peng WEN, Yonggang TA, Nguyen Binh Duong XIAO, Xiaokui Recent studies showed that single-machine graph processing systems can be as highly competitive as clusterbased approaches on large-scale problems. While several outof-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce performance in many practical cases. In this paper, we propose GraphMP to tackle big graph analytics on a single machine. GraphMP achieves low disk I/O overhead with three techniques. First, we design a vertex-centric sliding window (VSW) computation model to avoid reading and writing vertices on disk. Second, we propose a selective scheduling method to skip loading and processing unnecessary edge shards on disk. Third, we use a compressed edge cache mechanism to fully utilize the available memory of a machine to reduce the amount of disk accesses for edges. Extensive evaluations have shown that GraphMP could outperform state-of-the-art systems such as GraphChi, X-Stream and GridGraph by 31.6x, 54.5x and 23.1x respectively, when running popular graph applications on a billion-vertex graph. 2017-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4764 info:doi/10.1109/ICPADS.2017.00045 https://ink.library.smu.edu.sg/context/sis_research/article/5767/viewcontent/1707.02557.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Graph Processing Big Data Parallel Computing Computer and Systems Architecture Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Graph Processing
Big Data
Parallel Computing
Computer and Systems Architecture
Software Engineering
spellingShingle Graph Processing
Big Data
Parallel Computing
Computer and Systems Architecture
Software Engineering
SUN, Peng
WEN, Yonggang
TA, Nguyen Binh Duong
XIAO, Xiaokui
GraphMP: an efficient semi-external-memory big graph processing system on a single machine
description Recent studies showed that single-machine graph processing systems can be as highly competitive as clusterbased approaches on large-scale problems. While several outof-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce performance in many practical cases. In this paper, we propose GraphMP to tackle big graph analytics on a single machine. GraphMP achieves low disk I/O overhead with three techniques. First, we design a vertex-centric sliding window (VSW) computation model to avoid reading and writing vertices on disk. Second, we propose a selective scheduling method to skip loading and processing unnecessary edge shards on disk. Third, we use a compressed edge cache mechanism to fully utilize the available memory of a machine to reduce the amount of disk accesses for edges. Extensive evaluations have shown that GraphMP could outperform state-of-the-art systems such as GraphChi, X-Stream and GridGraph by 31.6x, 54.5x and 23.1x respectively, when running popular graph applications on a billion-vertex graph.
format text
author SUN, Peng
WEN, Yonggang
TA, Nguyen Binh Duong
XIAO, Xiaokui
author_facet SUN, Peng
WEN, Yonggang
TA, Nguyen Binh Duong
XIAO, Xiaokui
author_sort SUN, Peng
title GraphMP: an efficient semi-external-memory big graph processing system on a single machine
title_short GraphMP: an efficient semi-external-memory big graph processing system on a single machine
title_full GraphMP: an efficient semi-external-memory big graph processing system on a single machine
title_fullStr GraphMP: an efficient semi-external-memory big graph processing system on a single machine
title_full_unstemmed GraphMP: an efficient semi-external-memory big graph processing system on a single machine
title_sort graphmp: an efficient semi-external-memory big graph processing system on a single machine
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/4764
https://ink.library.smu.edu.sg/context/sis_research/article/5767/viewcontent/1707.02557.pdf
_version_ 1770575024809312256