Improving large graph processing on partitioned graphs in the cloud

As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various data-intensive applications in cloud computing, developing large graph processing systems has become a hot and fruitful research area. Many of those existing systems support a vertex-oriented execution m...

Full description

Saved in:
Bibliographic Details
Main Authors: Chen, Rishan, Yang, Mao, Weng, Xuetian, Choi, Byron, He, Bingsheng, Li, Xiaoming
Other Authors: School of Computer Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/99469
http://hdl.handle.net/10220/12588
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-99469
record_format dspace
spelling sg-ntu-dr.10356-994692020-05-28T07:17:57Z Improving large graph processing on partitioned graphs in the cloud Chen, Rishan Yang, Mao Weng, Xuetian Choi, Byron He, Bingsheng Li, Xiaoming School of Computer Engineering Symposium on Cloud Computing (3rd : 2012) DRNTU::Engineering::Computer science and engineering As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various data-intensive applications in cloud computing, developing large graph processing systems has become a hot and fruitful research area. Many of those existing systems support a vertex-oriented execution model and allow users to develop custom logics on vertices. However, the inherently random access pattern on the vertex-oriented computation generates a significant amount of network traffic. While graph partitioning is known to be effective to reduce network traffic in graph processing, there is little attention given to how graph partitioning can be effectively integrated into large graph processing in the cloud environment. In this paper, we develop a novel graph partitioning framework to improve the network performance of graph partitioning itself, partitioned graph storage and vertex-oriented graph processing. All optimizations are specifically designed for the cloud network environment. In experiments, we develop a system prototype following Pregel (the latest vertex-oriented graph engine by Google), and extend it with our graph partitioning framework. We conduct the experiments with a real-world social network and synthetic graphs over 100GB each in a local cluster and on Amazon EC2. Our experimental results demonstrate the efficiency of our graph partitioning framework, and the effectiveness of network performance aware optimizations on the large graph processing engine. 2013-07-31T04:06:25Z 2019-12-06T20:07:50Z 2013-07-31T04:06:25Z 2019-12-06T20:07:50Z 2012 2012 Conference Paper Chen, R., Yang, M., Weng, X., Choi, B., He, B., & Li, X. (2012). Improving large graph processing on partitioned graphs in the cloud. Proceedings of the Third ACM Symposium on Cloud Computing - SoCC '12. https://hdl.handle.net/10356/99469 http://hdl.handle.net/10220/12588 10.1145/2391229.2391232 en
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Chen, Rishan
Yang, Mao
Weng, Xuetian
Choi, Byron
He, Bingsheng
Li, Xiaoming
Improving large graph processing on partitioned graphs in the cloud
description As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various data-intensive applications in cloud computing, developing large graph processing systems has become a hot and fruitful research area. Many of those existing systems support a vertex-oriented execution model and allow users to develop custom logics on vertices. However, the inherently random access pattern on the vertex-oriented computation generates a significant amount of network traffic. While graph partitioning is known to be effective to reduce network traffic in graph processing, there is little attention given to how graph partitioning can be effectively integrated into large graph processing in the cloud environment. In this paper, we develop a novel graph partitioning framework to improve the network performance of graph partitioning itself, partitioned graph storage and vertex-oriented graph processing. All optimizations are specifically designed for the cloud network environment. In experiments, we develop a system prototype following Pregel (the latest vertex-oriented graph engine by Google), and extend it with our graph partitioning framework. We conduct the experiments with a real-world social network and synthetic graphs over 100GB each in a local cluster and on Amazon EC2. Our experimental results demonstrate the efficiency of our graph partitioning framework, and the effectiveness of network performance aware optimizations on the large graph processing engine.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Chen, Rishan
Yang, Mao
Weng, Xuetian
Choi, Byron
He, Bingsheng
Li, Xiaoming
format Conference or Workshop Item
author Chen, Rishan
Yang, Mao
Weng, Xuetian
Choi, Byron
He, Bingsheng
Li, Xiaoming
author_sort Chen, Rishan
title Improving large graph processing on partitioned graphs in the cloud
title_short Improving large graph processing on partitioned graphs in the cloud
title_full Improving large graph processing on partitioned graphs in the cloud
title_fullStr Improving large graph processing on partitioned graphs in the cloud
title_full_unstemmed Improving large graph processing on partitioned graphs in the cloud
title_sort improving large graph processing on partitioned graphs in the cloud
publishDate 2013
url https://hdl.handle.net/10356/99469
http://hdl.handle.net/10220/12588
_version_ 1681056269134200832