Claim: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data

Recently, frequent itemsets mining over data streams attracted much attention. However, mining closed itemsets from data stream has not been well addressed. The main difficulty lies in its high complexity of maintenance aroused by the model definition of closed itemsets and the dynamic changing of...

Full description

Saved in:
Bibliographic Details
Main Authors: SONG, Guojie, YANG, Dongqing, Cui, Bin, ZHENG, Baihua, WANG, Tengjiao
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2007
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/386
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1385
record_format dspace
spelling sg-smu-ink.sis_research-13852010-09-24T06:36:22Z Claim: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data SONG, Guojie YANG, Dongqing Cui, Bin ZHENG, Baihua WANG, Tengjiao Recently, frequent itemsets mining over data streams attracted much attention. However, mining closed itemsets from data stream has not been well addressed. The main difficulty lies in its high complexity of maintenance aroused by the model definition of closed itemsets and the dynamic changing of data streams. In data stream scenario, it is sufficient to mining only approximated frequent closed itemsets instead of in full precision. Such a compact but close-enough frequent itemset is called a relaxed frequent closed itemsets. In this paper, we first introduce the concept of (Relaxed frequent Closed Itemsets), which is the generalized form of approximation. We also propose a novel mechanism CLAIM, which stands for CLosed Approximated Itemset Mining, to support efficiently mining of . The CLAIM adopts bipartite graph model to store frequent closed itemsets, use Bloom filter based hash function to speed up the update of drifted itemsets, and build a compact HR-tree structure to efficiently maintain the s and support mining process. An experimental study is conducted, and the results demonstrate the effectiveness and efficiency of our approach at handling frequent closed itemsets mining for data stream. This work is supported by the National Natural Science Foundation of China under Grant No. 60473051 and No.60642004 and HP and IBM Joint Research Project. 2007-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/386 info:doi/10.1007/978-3-540-71703-4_56 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Sciences
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Computer Sciences
spellingShingle Computer Sciences
SONG, Guojie
YANG, Dongqing
Cui, Bin
ZHENG, Baihua
WANG, Tengjiao
Claim: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data
description Recently, frequent itemsets mining over data streams attracted much attention. However, mining closed itemsets from data stream has not been well addressed. The main difficulty lies in its high complexity of maintenance aroused by the model definition of closed itemsets and the dynamic changing of data streams. In data stream scenario, it is sufficient to mining only approximated frequent closed itemsets instead of in full precision. Such a compact but close-enough frequent itemset is called a relaxed frequent closed itemsets. In this paper, we first introduce the concept of (Relaxed frequent Closed Itemsets), which is the generalized form of approximation. We also propose a novel mechanism CLAIM, which stands for CLosed Approximated Itemset Mining, to support efficiently mining of . The CLAIM adopts bipartite graph model to store frequent closed itemsets, use Bloom filter based hash function to speed up the update of drifted itemsets, and build a compact HR-tree structure to efficiently maintain the s and support mining process. An experimental study is conducted, and the results demonstrate the effectiveness and efficiency of our approach at handling frequent closed itemsets mining for data stream. This work is supported by the National Natural Science Foundation of China under Grant No. 60473051 and No.60642004 and HP and IBM Joint Research Project.
format text
author SONG, Guojie
YANG, Dongqing
Cui, Bin
ZHENG, Baihua
WANG, Tengjiao
author_facet SONG, Guojie
YANG, Dongqing
Cui, Bin
ZHENG, Baihua
WANG, Tengjiao
author_sort SONG, Guojie
title Claim: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data
title_short Claim: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data
title_full Claim: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data
title_fullStr Claim: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data
title_full_unstemmed Claim: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data
title_sort claim: an efficient method for relaxed frequent closed itemsets mining over stream data
publisher Institutional Knowledge at Singapore Management University
publishDate 2007
url https://ink.library.smu.edu.sg/sis_research/386
_version_ 1770570405825740800