Mining globally distributed frequent subgraphs in a single labeled graph

Recent years have observed increasing efforts on graph mining and many algorithms have been developed for this purpose. However, most of the existing algorithms are designed for discovering frequent subgraphs in a set of labeled graphs only. Also, the few algorithms that find frequent subgraphs in a...

Full description

Saved in:
Bibliographic Details
Main Authors: JIANG, Xing, XIONG, Hui, WANG, Chen, TAN, Ah-hwee
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2009
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5196
https://ink.library.smu.edu.sg/context/sis_research/article/6199/viewcontent/1_s2.0_S0169023X09000585_main.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6199
record_format dspace
spelling sg-smu-ink.sis_research-61992020-07-23T18:47:02Z Mining globally distributed frequent subgraphs in a single labeled graph JIANG, Xing XIONG, Hui WANG, Chen TAN, Ah-hwee Recent years have observed increasing efforts on graph mining and many algorithms have been developed for this purpose. However, most of the existing algorithms are designed for discovering frequent subgraphs in a set of labeled graphs only. Also, the few algorithms that find frequent subgraphs in a single labeled graph typically identify subgraphs appearing regionally in the input graph. In contrast, for real-world applications, it is commonly required that the identified frequent subgraphs in a single labeled graph should also be globally distributed. This paper thus fills this crucial void by proposing a new measure, termed G-Measure, to find globally distributed frequent subgraphs, called G-Patterns, in a single labeled graph. Specifically, we first show that the G-Patterns, selected by G-Measure, tend to be globally distributed in the input graph. Then, we present that G-Measure has the downward closure property, which guarantees the G-Measure value of a G-Pattern is not less than those of its supersets. Consequently, a G-Miner algorithm is developed for finding G-Patterns. Experimental results on four synthetic and seven real-world data sets and comparison with the existing algorithms demonstrate the efficacy of the G-Measure and the G-Miner for finding G-Patterns. Finally, an application of the G-Patterns is given 2009-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5196 info:doi/10.1016/j.datak.2009.04.008 https://ink.library.smu.edu.sg/context/sis_research/article/6199/viewcontent/1_s2.0_S0169023X09000585_main.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Frequent subgraph mining G-Measure G-Pattern Computer Engineering Databases and Information Systems Data Storage Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Frequent subgraph mining
G-Measure
G-Pattern
Computer Engineering
Databases and Information Systems
Data Storage Systems
spellingShingle Frequent subgraph mining
G-Measure
G-Pattern
Computer Engineering
Databases and Information Systems
Data Storage Systems
JIANG, Xing
XIONG, Hui
WANG, Chen
TAN, Ah-hwee
Mining globally distributed frequent subgraphs in a single labeled graph
description Recent years have observed increasing efforts on graph mining and many algorithms have been developed for this purpose. However, most of the existing algorithms are designed for discovering frequent subgraphs in a set of labeled graphs only. Also, the few algorithms that find frequent subgraphs in a single labeled graph typically identify subgraphs appearing regionally in the input graph. In contrast, for real-world applications, it is commonly required that the identified frequent subgraphs in a single labeled graph should also be globally distributed. This paper thus fills this crucial void by proposing a new measure, termed G-Measure, to find globally distributed frequent subgraphs, called G-Patterns, in a single labeled graph. Specifically, we first show that the G-Patterns, selected by G-Measure, tend to be globally distributed in the input graph. Then, we present that G-Measure has the downward closure property, which guarantees the G-Measure value of a G-Pattern is not less than those of its supersets. Consequently, a G-Miner algorithm is developed for finding G-Patterns. Experimental results on four synthetic and seven real-world data sets and comparison with the existing algorithms demonstrate the efficacy of the G-Measure and the G-Miner for finding G-Patterns. Finally, an application of the G-Patterns is given
format text
author JIANG, Xing
XIONG, Hui
WANG, Chen
TAN, Ah-hwee
author_facet JIANG, Xing
XIONG, Hui
WANG, Chen
TAN, Ah-hwee
author_sort JIANG, Xing
title Mining globally distributed frequent subgraphs in a single labeled graph
title_short Mining globally distributed frequent subgraphs in a single labeled graph
title_full Mining globally distributed frequent subgraphs in a single labeled graph
title_fullStr Mining globally distributed frequent subgraphs in a single labeled graph
title_full_unstemmed Mining globally distributed frequent subgraphs in a single labeled graph
title_sort mining globally distributed frequent subgraphs in a single labeled graph
publisher Institutional Knowledge at Singapore Management University
publishDate 2009
url https://ink.library.smu.edu.sg/sis_research/5196
https://ink.library.smu.edu.sg/context/sis_research/article/6199/viewcontent/1_s2.0_S0169023X09000585_main.pdf
_version_ 1770575327838339072