GraphMP: an efficient semi-external-memory big graph processing system on a single machine
Recent studies showed that single-machine graph processing systems can be as highly competitive as clusterbased approaches on large-scale problems. While several outof-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce perfo...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2017
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4764 https://ink.library.smu.edu.sg/context/sis_research/article/5767/viewcontent/1707.02557.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-5767 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-57672020-01-16T10:27:44Z GraphMP: an efficient semi-external-memory big graph processing system on a single machine SUN, Peng WEN, Yonggang TA, Nguyen Binh Duong XIAO, Xiaokui Recent studies showed that single-machine graph processing systems can be as highly competitive as clusterbased approaches on large-scale problems. While several outof-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce performance in many practical cases. In this paper, we propose GraphMP to tackle big graph analytics on a single machine. GraphMP achieves low disk I/O overhead with three techniques. First, we design a vertex-centric sliding window (VSW) computation model to avoid reading and writing vertices on disk. Second, we propose a selective scheduling method to skip loading and processing unnecessary edge shards on disk. Third, we use a compressed edge cache mechanism to fully utilize the available memory of a machine to reduce the amount of disk accesses for edges. Extensive evaluations have shown that GraphMP could outperform state-of-the-art systems such as GraphChi, X-Stream and GridGraph by 31.6x, 54.5x and 23.1x respectively, when running popular graph applications on a billion-vertex graph. 2017-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4764 info:doi/10.1109/ICPADS.2017.00045 https://ink.library.smu.edu.sg/context/sis_research/article/5767/viewcontent/1707.02557.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Graph Processing Big Data Parallel Computing Computer and Systems Architecture Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Graph Processing Big Data Parallel Computing Computer and Systems Architecture Software Engineering |
spellingShingle |
Graph Processing Big Data Parallel Computing Computer and Systems Architecture Software Engineering SUN, Peng WEN, Yonggang TA, Nguyen Binh Duong XIAO, Xiaokui GraphMP: an efficient semi-external-memory big graph processing system on a single machine |
description |
Recent studies showed that single-machine graph processing systems can be as highly competitive as clusterbased approaches on large-scale problems. While several outof-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce performance in many practical cases. In this paper, we propose GraphMP to tackle big graph analytics on a single machine. GraphMP achieves low disk I/O overhead with three techniques. First, we design a vertex-centric sliding window (VSW) computation model to avoid reading and writing vertices on disk. Second, we propose a selective scheduling method to skip loading and processing unnecessary edge shards on disk. Third, we use a compressed edge cache mechanism to fully utilize the available memory of a machine to reduce the amount of disk accesses for edges. Extensive evaluations have shown that GraphMP could outperform state-of-the-art systems such as GraphChi, X-Stream and GridGraph by 31.6x, 54.5x and 23.1x respectively, when running popular graph applications on a billion-vertex graph. |
format |
text |
author |
SUN, Peng WEN, Yonggang TA, Nguyen Binh Duong XIAO, Xiaokui |
author_facet |
SUN, Peng WEN, Yonggang TA, Nguyen Binh Duong XIAO, Xiaokui |
author_sort |
SUN, Peng |
title |
GraphMP: an efficient semi-external-memory big graph processing system on a single machine |
title_short |
GraphMP: an efficient semi-external-memory big graph processing system on a single machine |
title_full |
GraphMP: an efficient semi-external-memory big graph processing system on a single machine |
title_fullStr |
GraphMP: an efficient semi-external-memory big graph processing system on a single machine |
title_full_unstemmed |
GraphMP: an efficient semi-external-memory big graph processing system on a single machine |
title_sort |
graphmp: an efficient semi-external-memory big graph processing system on a single machine |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2017 |
url |
https://ink.library.smu.edu.sg/sis_research/4764 https://ink.library.smu.edu.sg/context/sis_research/article/5767/viewcontent/1707.02557.pdf |
_version_ |
1770575024809312256 |