Experimental analysis of the application of the gsketch partitioning method onto the gmatrix graph-stream sketch

This report presents a final year project that is about an experimental analysis of applying the gSketch partitioning method onto the gMatrix graph-stream sketch. The report first introduces how the gSketch partitioning method can be applied onto the gMatrix sketch and proposes optimizations for the...

全面介紹

Saved in:
書目詳細資料
主要作者: Lim, Eric Leonardo
其他作者: Arjit Khan
格式: Final Year Project
語言:English
出版: 2018
主題:
在線閱讀:http://hdl.handle.net/10356/74053
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
實物特徵
總結:This report presents a final year project that is about an experimental analysis of applying the gSketch partitioning method onto the gMatrix graph-stream sketch. The report first introduces how the gSketch partitioning method can be applied onto the gMatrix sketch and proposes optimizations for the method, and then analyzes how the gSketch partitioning method changes how gMatrix answers various query types, such as edge frequency, heavy-hitter edges, and node aggregate-frequency queries, and how the performance and probabilistic accuracy guarantees change, and after that, shows experimental results with metrics that each evaluates differently how partitioning affects gMatrix's accuracy for answering the different query types on up to three different graph-stream datasets. Finally, the report concludes that the gSketch partitioning method successfully improves the accuracy of gMatrix in query types such as edge frequency estimation and source-node aggregate-frequency estimation, although fails to bring the same improvements onto the destination-node aggregate-frequency estimation and heavy-hitter edge queries.