A novel association rule mining approach using TID intermediate itemset

Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output...

Full description

Saved in:
Bibliographic Details
Main Authors: Aqra, Iyad, Herawan, Tutut, Norjihan, Abdul Ghani, Akhunzada, Adnan, Ali, Akhtar, Ramdan, Razali, Ilahi, Manzoor, Choo, Kim-Kwang Raymond
Format: Article
Language:English
Published: Public Library of Science 2018
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/25213/1/A%20novel%20association%20rule%20mining%20approach.pdf
http://umpir.ump.edu.my/id/eprint/25213/
https://doi.org/10.1371/journal.pone.0179703
https://doi.org/10.1371/journal.pone.0179703
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Pahang
Language: English
Description
Summary:Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output knowledge certainly necessitates the extant state-of-the-art algorithms to rescan the entire database. Subsequently, the process incurs heavy computation cost and is not feasible for real-time applications. The paper addresses efficiently the problem of threshold dynamic updation for a given purpose. The paper contributes by presenting a novel ARM approach that creates an intermediate itemset and applies a threshold to extract categorical frequent itemsets with diverse threshold values. Thus, improving the overall efficiency as we no longer needs to scan the whole database. After the entire itemset is built, we are able to obtain real support without the need of rebuilding the itemset (e.g. Itemset list is intersected to obtain the actual support). Moreover, the algorithm supports to extract many frequent itemsets according to a pre-determined minimum support with an independent purpose. Additionally, the experimental results of our proposed approach demonstrate the capability to be deployed in any mining system in a fully parallel mode; consequently, increasing the efficiency of the real-time association rules discovery process. The proposed approach outperforms the extant state-of-the-art and shows promising results that reduce computation cost, increase accuracy, and produce all possible itemsets.