Experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch

This report presents a final year project that is about an experimental analysis of applying the gSketch partitioning method onto the gMatrix graph-stream sketch. The report first introduces how the gSketch partitioning method can be applied onto the gMatrix sketch and proposes optimizations for the...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Eric Leonardo
Other Authors: Arjit Khan
Format: Final Year Project
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/74053
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-74053
record_format dspace
spelling sg-ntu-dr.10356-740532023-03-03T20:58:02Z Experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch Lim, Eric Leonardo Arjit Khan School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering::Data::Data structures This report presents a final year project that is about an experimental analysis of applying the gSketch partitioning method onto the gMatrix graph-stream sketch. The report first introduces how the gSketch partitioning method can be applied onto the gMatrix sketch and proposes optimizations for the method, and then analyzes how the gSketch partitioning method changes how gMatrix answers various query types, such as edge frequency, heavy-hitter edges, and node aggregate-frequency queries, and how the performance and probabilistic accuracy guarantees change, and after that, shows experimental results with metrics that each evaluates differently how partitioning affects gMatrix's accuracy for answering the different query types on up to three different graph-stream datasets. Finally, the report concludes that the gSketch partitioning method successfully improves the accuracy of gMatrix in query types such as edge frequency estimation and source-node aggregate-frequency estimation, although fails to bring the same improvements onto the destination-node aggregate-frequency estimation and heavy-hitter edge queries. Bachelor of Engineering (Computer Science) 2018-04-24T04:10:03Z 2018-04-24T04:10:03Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74053 en Nanyang Technological University 45 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Data::Data structures
spellingShingle DRNTU::Engineering::Computer science and engineering::Data::Data structures
Lim, Eric Leonardo
Experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch
description This report presents a final year project that is about an experimental analysis of applying the gSketch partitioning method onto the gMatrix graph-stream sketch. The report first introduces how the gSketch partitioning method can be applied onto the gMatrix sketch and proposes optimizations for the method, and then analyzes how the gSketch partitioning method changes how gMatrix answers various query types, such as edge frequency, heavy-hitter edges, and node aggregate-frequency queries, and how the performance and probabilistic accuracy guarantees change, and after that, shows experimental results with metrics that each evaluates differently how partitioning affects gMatrix's accuracy for answering the different query types on up to three different graph-stream datasets. Finally, the report concludes that the gSketch partitioning method successfully improves the accuracy of gMatrix in query types such as edge frequency estimation and source-node aggregate-frequency estimation, although fails to bring the same improvements onto the destination-node aggregate-frequency estimation and heavy-hitter edge queries.
author2 Arjit Khan
author_facet Arjit Khan
Lim, Eric Leonardo
format Final Year Project
author Lim, Eric Leonardo
author_sort Lim, Eric Leonardo
title Experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch
title_short Experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch
title_full Experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch
title_fullStr Experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch
title_full_unstemmed Experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch
title_sort experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch
publishDate 2018
url http://hdl.handle.net/10356/74053
_version_ 1759855917309886464