A Direct Mining Approach To Efficient Constrained Graph Pattern Discovery

Despite the wealth of research on frequent graph pattern mining, how to efficiently mine the complete set of those with constraints still poses a huge challenge to the existing algorithms mainly due to the inherent bottleneck in the mining paradigm. In essence, mining requests with explicitly-specif...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHU, Feida, ZHANG, Zequn, QU, Qiang
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2013
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1819
https://ink.library.smu.edu.sg/context/sis_research/article/2818/viewcontent/C53___A_Direct_Mining_Approach_To_Efficient_Constrained_Graph_Pattern_Discovery__SIGMOD2013_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2818
record_format dspace
spelling sg-smu-ink.sis_research-28182018-07-13T03:18:55Z A Direct Mining Approach To Efficient Constrained Graph Pattern Discovery ZHU, Feida ZHANG, Zequn QU, Qiang Despite the wealth of research on frequent graph pattern mining, how to efficiently mine the complete set of those with constraints still poses a huge challenge to the existing algorithms mainly due to the inherent bottleneck in the mining paradigm. In essence, mining requests with explicitly-specified constraints cannot be handled in a way that is direct and precise. In this paper, we propose a direct mining framework to solve the problem and illustrate our ideas in the context of a particular type of constrained frequent patterns — the “skinny” patterns, which are graph patterns with a long backbone from which short twigs branch out. These patterns, which we formally define as l-long d-skinny patterns, are able to reveal insightful spatial and temporal trajectory patterns in mobile data mining, information diffusion, adoption propagation, and many others. Based on the key concept of a canonical diameter, we develop SkinnyMine, an efficient algorithm to mine all the l-long d-skinny patterns guaranteeing both the completeness of our mining result as well as the unique generation of each target pattern. We also present a general direct mining framework together with two properties of reducibility and continuity for qualified constraints. Our experiments on both synthetic and real data demonstrate the effectiveness and scalability of our approach. 2013-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1819 info:doi/10.1145/2463676.2463723 https://ink.library.smu.edu.sg/context/sis_research/article/2818/viewcontent/C53___A_Direct_Mining_Approach_To_Efficient_Constrained_Graph_Pattern_Discovery__SIGMOD2013_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Direct mining frequent graph pattern mining constrained patternmining skinny pattern Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Direct mining
frequent graph pattern mining
constrained patternmining
skinny pattern
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Direct mining
frequent graph pattern mining
constrained patternmining
skinny pattern
Databases and Information Systems
Numerical Analysis and Scientific Computing
ZHU, Feida
ZHANG, Zequn
QU, Qiang
A Direct Mining Approach To Efficient Constrained Graph Pattern Discovery
description Despite the wealth of research on frequent graph pattern mining, how to efficiently mine the complete set of those with constraints still poses a huge challenge to the existing algorithms mainly due to the inherent bottleneck in the mining paradigm. In essence, mining requests with explicitly-specified constraints cannot be handled in a way that is direct and precise. In this paper, we propose a direct mining framework to solve the problem and illustrate our ideas in the context of a particular type of constrained frequent patterns — the “skinny” patterns, which are graph patterns with a long backbone from which short twigs branch out. These patterns, which we formally define as l-long d-skinny patterns, are able to reveal insightful spatial and temporal trajectory patterns in mobile data mining, information diffusion, adoption propagation, and many others. Based on the key concept of a canonical diameter, we develop SkinnyMine, an efficient algorithm to mine all the l-long d-skinny patterns guaranteeing both the completeness of our mining result as well as the unique generation of each target pattern. We also present a general direct mining framework together with two properties of reducibility and continuity for qualified constraints. Our experiments on both synthetic and real data demonstrate the effectiveness and scalability of our approach.
format text
author ZHU, Feida
ZHANG, Zequn
QU, Qiang
author_facet ZHU, Feida
ZHANG, Zequn
QU, Qiang
author_sort ZHU, Feida
title A Direct Mining Approach To Efficient Constrained Graph Pattern Discovery
title_short A Direct Mining Approach To Efficient Constrained Graph Pattern Discovery
title_full A Direct Mining Approach To Efficient Constrained Graph Pattern Discovery
title_fullStr A Direct Mining Approach To Efficient Constrained Graph Pattern Discovery
title_full_unstemmed A Direct Mining Approach To Efficient Constrained Graph Pattern Discovery
title_sort direct mining approach to efficient constrained graph pattern discovery
publisher Institutional Knowledge at Singapore Management University
publishDate 2013
url https://ink.library.smu.edu.sg/sis_research/1819
https://ink.library.smu.edu.sg/context/sis_research/article/2818/viewcontent/C53___A_Direct_Mining_Approach_To_Efficient_Constrained_Graph_Pattern_Discovery__SIGMOD2013_.pdf
_version_ 1770571596817235968