Mining Colossal Frequent Patterns by Core Pattern Fusion
Extensive research for frequent-pattern mining in the past decade has brought forth a number of pattern mining algorithms that are both effective and efficient. However, the existing frequent-pattern mining algorithms encounter challenges at mining rather large patterns, called colossal frequent pat...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2007
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/1007 https://ink.library.smu.edu.sg/context/sis_research/article/2006/viewcontent/MiningColossalFrequent_Patterns_2007.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-2006 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-20062017-11-22T06:52:20Z Mining Colossal Frequent Patterns by Core Pattern Fusion ZHU, Feida YAN, Xifeng HAN, Jiawei YU, Philip S. CHENG, Hong Extensive research for frequent-pattern mining in the past decade has brought forth a number of pattern mining algorithms that are both effective and efficient. However, the existing frequent-pattern mining algorithms encounter challenges at mining rather large patterns, called colossal frequent patterns, in the presence of an explosive number of frequent patterns. Colossal patterns are critical to many applications, especially in domains like bioinformatics. In this study, we investigate a novel mining approach called Pattern-Fusion to efficiently find a good approximation to the colossal patterns. With Pattern-Fusion, a colossal pattern is discovered by fusing its small core patterns in one step, whereas the incremental pattern-growth mining strategies, such as those adopted in Apriori and FP-growth, have to examine a large number of mid-sized ones. This property distinguishes Pattern-Fusion from all the existing frequent pattern mining approaches and draws a new mining methodology. Our empirical studies show that, in cases where current mining algorithms cannot proceed, Pattern-Fusion is able to mine a result set which is a close enough approximation to the complete set of the colossal patterns, under a quality evaluation model proposed in this paper. 2007-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1007 info:doi/10.1109/ICDE.2007.367916 https://ink.library.smu.edu.sg/context/sis_research/article/2006/viewcontent/MiningColossalFrequent_Patterns_2007.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
Databases and Information Systems Numerical Analysis and Scientific Computing ZHU, Feida YAN, Xifeng HAN, Jiawei YU, Philip S. CHENG, Hong Mining Colossal Frequent Patterns by Core Pattern Fusion |
description |
Extensive research for frequent-pattern mining in the past decade has brought forth a number of pattern mining algorithms that are both effective and efficient. However, the existing frequent-pattern mining algorithms encounter challenges at mining rather large patterns, called colossal frequent patterns, in the presence of an explosive number of frequent patterns. Colossal patterns are critical to many applications, especially in domains like bioinformatics. In this study, we investigate a novel mining approach called Pattern-Fusion to efficiently find a good approximation to the colossal patterns. With Pattern-Fusion, a colossal pattern is discovered by fusing its small core patterns in one step, whereas the incremental pattern-growth mining strategies, such as those adopted in Apriori and FP-growth, have to examine a large number of mid-sized ones. This property distinguishes Pattern-Fusion from all the existing frequent pattern mining approaches and draws a new mining methodology. Our empirical studies show that, in cases where current mining algorithms cannot proceed, Pattern-Fusion is able to mine a result set which is a close enough approximation to the complete set of the colossal patterns, under a quality evaluation model proposed in this paper. |
format |
text |
author |
ZHU, Feida YAN, Xifeng HAN, Jiawei YU, Philip S. CHENG, Hong |
author_facet |
ZHU, Feida YAN, Xifeng HAN, Jiawei YU, Philip S. CHENG, Hong |
author_sort |
ZHU, Feida |
title |
Mining Colossal Frequent Patterns by Core Pattern Fusion |
title_short |
Mining Colossal Frequent Patterns by Core Pattern Fusion |
title_full |
Mining Colossal Frequent Patterns by Core Pattern Fusion |
title_fullStr |
Mining Colossal Frequent Patterns by Core Pattern Fusion |
title_full_unstemmed |
Mining Colossal Frequent Patterns by Core Pattern Fusion |
title_sort |
mining colossal frequent patterns by core pattern fusion |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2007 |
url |
https://ink.library.smu.edu.sg/sis_research/1007 https://ink.library.smu.edu.sg/context/sis_research/article/2006/viewcontent/MiningColossalFrequent_Patterns_2007.pdf |
_version_ |
1770570821005213696 |