Graph OLAP: Towards Online Analytical Processing on Graphs

OLAP (On-Line Analytical Processing) is an important notion in data analysis. Recently, more and more graph or networked data sources come into being. There exists a similar need to deploy graph analysis from different perspectives and with multiple granularities. However, traditional OLAP technolog...

Full description

Saved in:
Bibliographic Details
Main Authors: CHEN, CHEN, YAN, Xifeng, ZHU, Feida, Han, Jiawei, YU, Philip S.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2008
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/952
http://dx.doi.org/10.1109/ICDM.2007.75
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1951
record_format dspace
spelling sg-smu-ink.sis_research-19512011-01-05T09:42:50Z Graph OLAP: Towards Online Analytical Processing on Graphs CHEN, CHEN YAN, Xifeng ZHU, Feida Han, Jiawei YU, Philip S. OLAP (On-Line Analytical Processing) is an important notion in data analysis. Recently, more and more graph or networked data sources come into being. There exists a similar need to deploy graph analysis from different perspectives and with multiple granularities. However, traditional OLAP technology cannot handle such demands because it does not consider the links among individual data tuples. In this paper, we develop a novel graph OLAP framework, which presents a multi-dimensional and multi-level view over graphs. The contributions of this work are two-fold. First, starting from basic definitions, i.e., what are dimensions and measures in the graph OLAP scenario, we develop a conceptual framework for data cubes on graphs. We also look into different semantics of OLAP operations, and classify the framework into two major subcases: informational OLAP and topological OLAP. Then, with more emphasis on informational OLAP (topological OLAP will be covered in a future study due to the lack of space), we show how a graph cube can be materialized by calculating a special kind of measure called aggregated graph and how to implement it efficiently. This includes both full materialization and partial materialization where constraints are enforced to obtain an iceberg cube. We can see that the aggregated graphs, which depend on the graph properties of underlying networks, are much harder to compute than their traditional OLAP counterparts, due to the increased structural complexity of data. Empirical studies show insightful results on real datasets and demonstrate the efficiency of our proposed optimizations. 2008-12-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/952 info:doi/10.1109/ICDM.2008.30 http://dx.doi.org/10.1109/ICDM.2007.75 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
CHEN, CHEN
YAN, Xifeng
ZHU, Feida
Han, Jiawei
YU, Philip S.
Graph OLAP: Towards Online Analytical Processing on Graphs
description OLAP (On-Line Analytical Processing) is an important notion in data analysis. Recently, more and more graph or networked data sources come into being. There exists a similar need to deploy graph analysis from different perspectives and with multiple granularities. However, traditional OLAP technology cannot handle such demands because it does not consider the links among individual data tuples. In this paper, we develop a novel graph OLAP framework, which presents a multi-dimensional and multi-level view over graphs. The contributions of this work are two-fold. First, starting from basic definitions, i.e., what are dimensions and measures in the graph OLAP scenario, we develop a conceptual framework for data cubes on graphs. We also look into different semantics of OLAP operations, and classify the framework into two major subcases: informational OLAP and topological OLAP. Then, with more emphasis on informational OLAP (topological OLAP will be covered in a future study due to the lack of space), we show how a graph cube can be materialized by calculating a special kind of measure called aggregated graph and how to implement it efficiently. This includes both full materialization and partial materialization where constraints are enforced to obtain an iceberg cube. We can see that the aggregated graphs, which depend on the graph properties of underlying networks, are much harder to compute than their traditional OLAP counterparts, due to the increased structural complexity of data. Empirical studies show insightful results on real datasets and demonstrate the efficiency of our proposed optimizations.
format text
author CHEN, CHEN
YAN, Xifeng
ZHU, Feida
Han, Jiawei
YU, Philip S.
author_facet CHEN, CHEN
YAN, Xifeng
ZHU, Feida
Han, Jiawei
YU, Philip S.
author_sort CHEN, CHEN
title Graph OLAP: Towards Online Analytical Processing on Graphs
title_short Graph OLAP: Towards Online Analytical Processing on Graphs
title_full Graph OLAP: Towards Online Analytical Processing on Graphs
title_fullStr Graph OLAP: Towards Online Analytical Processing on Graphs
title_full_unstemmed Graph OLAP: Towards Online Analytical Processing on Graphs
title_sort graph olap: towards online analytical processing on graphs
publisher Institutional Knowledge at Singapore Management University
publishDate 2008
url https://ink.library.smu.edu.sg/sis_research/952
http://dx.doi.org/10.1109/ICDM.2007.75
_version_ 1770570791564345344