Mining Colossal Frequent Patterns by Core Pattern Fusion

Extensive research for frequent-pattern mining in the past decade has brought forth a number of pattern mining algorithms that are both effective and efficient. However, the existing frequent-pattern mining algorithms encounter challenges at mining rather large patterns, called colossal frequent pat...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHU, Feida, YAN, Xifeng, HAN, Jiawei, YU, Philip S., CHENG, Hong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2007
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1007
https://ink.library.smu.edu.sg/context/sis_research/article/2006/viewcontent/MiningColossalFrequent_Patterns_2007.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2006
record_format dspace
spelling sg-smu-ink.sis_research-20062017-11-22T06:52:20Z Mining Colossal Frequent Patterns by Core Pattern Fusion ZHU, Feida YAN, Xifeng HAN, Jiawei YU, Philip S. CHENG, Hong Extensive research for frequent-pattern mining in the past decade has brought forth a number of pattern mining algorithms that are both effective and efficient. However, the existing frequent-pattern mining algorithms encounter challenges at mining rather large patterns, called colossal frequent patterns, in the presence of an explosive number of frequent patterns. Colossal patterns are critical to many applications, especially in domains like bioinformatics. In this study, we investigate a novel mining approach called Pattern-Fusion to efficiently find a good approximation to the colossal patterns. With Pattern-Fusion, a colossal pattern is discovered by fusing its small core patterns in one step, whereas the incremental pattern-growth mining strategies, such as those adopted in Apriori and FP-growth, have to examine a large number of mid-sized ones. This property distinguishes Pattern-Fusion from all the existing frequent pattern mining approaches and draws a new mining methodology. Our empirical studies show that, in cases where current mining algorithms cannot proceed, Pattern-Fusion is able to mine a result set which is a close enough approximation to the complete set of the colossal patterns, under a quality evaluation model proposed in this paper. 2007-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1007 info:doi/10.1109/ICDE.2007.367916 https://ink.library.smu.edu.sg/context/sis_research/article/2006/viewcontent/MiningColossalFrequent_Patterns_2007.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
ZHU, Feida
YAN, Xifeng
HAN, Jiawei
YU, Philip S.
CHENG, Hong
Mining Colossal Frequent Patterns by Core Pattern Fusion
description Extensive research for frequent-pattern mining in the past decade has brought forth a number of pattern mining algorithms that are both effective and efficient. However, the existing frequent-pattern mining algorithms encounter challenges at mining rather large patterns, called colossal frequent patterns, in the presence of an explosive number of frequent patterns. Colossal patterns are critical to many applications, especially in domains like bioinformatics. In this study, we investigate a novel mining approach called Pattern-Fusion to efficiently find a good approximation to the colossal patterns. With Pattern-Fusion, a colossal pattern is discovered by fusing its small core patterns in one step, whereas the incremental pattern-growth mining strategies, such as those adopted in Apriori and FP-growth, have to examine a large number of mid-sized ones. This property distinguishes Pattern-Fusion from all the existing frequent pattern mining approaches and draws a new mining methodology. Our empirical studies show that, in cases where current mining algorithms cannot proceed, Pattern-Fusion is able to mine a result set which is a close enough approximation to the complete set of the colossal patterns, under a quality evaluation model proposed in this paper.
format text
author ZHU, Feida
YAN, Xifeng
HAN, Jiawei
YU, Philip S.
CHENG, Hong
author_facet ZHU, Feida
YAN, Xifeng
HAN, Jiawei
YU, Philip S.
CHENG, Hong
author_sort ZHU, Feida
title Mining Colossal Frequent Patterns by Core Pattern Fusion
title_short Mining Colossal Frequent Patterns by Core Pattern Fusion
title_full Mining Colossal Frequent Patterns by Core Pattern Fusion
title_fullStr Mining Colossal Frequent Patterns by Core Pattern Fusion
title_full_unstemmed Mining Colossal Frequent Patterns by Core Pattern Fusion
title_sort mining colossal frequent patterns by core pattern fusion
publisher Institutional Knowledge at Singapore Management University
publishDate 2007
url https://ink.library.smu.edu.sg/sis_research/1007
https://ink.library.smu.edu.sg/context/sis_research/article/2006/viewcontent/MiningColossalFrequent_Patterns_2007.pdf
_version_ 1770570821005213696