Towards scalable algorithm for closed itemset mining in high-dimensional data
Mining frequent itemsets from large dataset has a major drawback in which the explosive number of itemsets requires additional mining process which might filter the interesting ones. Therefore, as the solution, the concept of closed frequent itemset was introduced that is lossless and condensed repr...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English English |
Published: |
Institute of Advanced Engineering and Science
2017
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/63096/1/63096_Towards%20scalable%20algorithm%20for%20closed%20itemset%20_article.pdf http://irep.iium.edu.my/63096/2/63096_Towards%20scalable%20algorithm%20for%20closed%20itemset%20_scopus.pdf http://irep.iium.edu.my/63096/ http://www.iaescore.com/journals/index.php/IJEECS/article/view/10019/7653 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Islam Antarabangsa Malaysia |
Language: | English English |
id |
my.iium.irep.63096 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.630962018-04-18T08:05:27Z http://irep.iium.edu.my/63096/ Towards scalable algorithm for closed itemset mining in high-dimensional data Md. Zaki, Fatimah Audah Zulkurnain, Nurul Fariza TK Electrical engineering. Electronics Nuclear engineering Mining frequent itemsets from large dataset has a major drawback in which the explosive number of itemsets requires additional mining process which might filter the interesting ones. Therefore, as the solution, the concept of closed frequent itemset was introduced that is lossless and condensed representation of all the frequent itemsets and their corresponding supports. Unfortunately, many algorithms are not memory-efficient since it requires the storage of closed itemsets in main memory for duplication checks. This paper presents BFF, a scalable algorithm for discovering closed frequent itemsets from high-dimensional data. Unlike many well-known algorithms, BFF traverses the search tree in breadth-first manner resulted to a minimum use of memory and less running time. The tests conducted on a number of microarray datasets show that the performance of this algorithm improved significantly as the support threshold decreases which is crucial in generating more interesting rules. Institute of Advanced Engineering and Science 2017-11 Article REM application/pdf en http://irep.iium.edu.my/63096/1/63096_Towards%20scalable%20algorithm%20for%20closed%20itemset%20_article.pdf application/pdf en http://irep.iium.edu.my/63096/2/63096_Towards%20scalable%20algorithm%20for%20closed%20itemset%20_scopus.pdf Md. Zaki, Fatimah Audah and Zulkurnain, Nurul Fariza (2017) Towards scalable algorithm for closed itemset mining in high-dimensional data. Indonesian Journal of Electrical Engineering and Computer Science, 8 (2). pp. 487-494. http://www.iaescore.com/journals/index.php/IJEECS/article/view/10019/7653 10.11591/ijeecs.v8.i2.pp487-494 |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English English |
topic |
TK Electrical engineering. Electronics Nuclear engineering |
spellingShingle |
TK Electrical engineering. Electronics Nuclear engineering Md. Zaki, Fatimah Audah Zulkurnain, Nurul Fariza Towards scalable algorithm for closed itemset mining in high-dimensional data |
description |
Mining frequent itemsets from large dataset has a major drawback in which the explosive number of itemsets requires additional mining process which might filter the interesting ones. Therefore, as the solution, the concept of closed frequent itemset was introduced that is lossless and condensed representation of all the frequent itemsets and their corresponding supports. Unfortunately, many algorithms are not memory-efficient since it requires the storage of closed itemsets in main memory for duplication checks. This paper presents BFF, a scalable algorithm for discovering closed frequent itemsets from high-dimensional data. Unlike many well-known algorithms, BFF traverses the search tree in breadth-first manner resulted to a minimum use of memory and less running time. The tests conducted on a number of microarray datasets show that the performance of this algorithm improved significantly as the support threshold decreases which is crucial in generating more interesting rules. |
format |
Article |
author |
Md. Zaki, Fatimah Audah Zulkurnain, Nurul Fariza |
author_facet |
Md. Zaki, Fatimah Audah Zulkurnain, Nurul Fariza |
author_sort |
Md. Zaki, Fatimah Audah |
title |
Towards scalable algorithm for closed itemset mining in high-dimensional data |
title_short |
Towards scalable algorithm for closed itemset mining in high-dimensional data |
title_full |
Towards scalable algorithm for closed itemset mining in high-dimensional data |
title_fullStr |
Towards scalable algorithm for closed itemset mining in high-dimensional data |
title_full_unstemmed |
Towards scalable algorithm for closed itemset mining in high-dimensional data |
title_sort |
towards scalable algorithm for closed itemset mining in high-dimensional data |
publisher |
Institute of Advanced Engineering and Science |
publishDate |
2017 |
url |
http://irep.iium.edu.my/63096/1/63096_Towards%20scalable%20algorithm%20for%20closed%20itemset%20_article.pdf http://irep.iium.edu.my/63096/2/63096_Towards%20scalable%20algorithm%20for%20closed%20itemset%20_scopus.pdf http://irep.iium.edu.my/63096/ http://www.iaescore.com/journals/index.php/IJEECS/article/view/10019/7653 |
_version_ |
1643616306808750080 |