A modified multi-class association rule for text mining

Classification and association rule mining are significant tasks in data mining. Integrating association rule discovery and classification in data mining brings us an approach known as the associative classification. One common shortcoming of existing Association Classifiers is the huge number of ru...

Full description

Saved in:
Bibliographic Details
Main Author: Al-Refai, Mohammad Hayel Abdel Karim
Format: Thesis
Language:English
English
Published: 2015
Subjects:
Online Access:http://etd.uum.edu.my/5767/1/depositpermission_s91487.pdf
http://etd.uum.edu.my/5767/2/s91487_01.pdf
http://etd.uum.edu.my/5767/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Utara Malaysia
Language: English
English
id my.uum.etd.5767
record_format eprints
spelling my.uum.etd.57672016-07-20T10:01:45Z http://etd.uum.edu.my/5767/ A modified multi-class association rule for text mining Al-Refai, Mohammad Hayel Abdel Karim QA71-90 Instruments and machines Classification and association rule mining are significant tasks in data mining. Integrating association rule discovery and classification in data mining brings us an approach known as the associative classification. One common shortcoming of existing Association Classifiers is the huge number of rules produced in order to obtain high classification accuracy. This study proposes s a Modified Multi-class Association Rule Mining (mMCAR) that consists of three procedures; rule discovery, rule pruning and group-based class assignment. The rule discovery and rule pruning procedures are designed to reduce the number of classification rules. On the other hand, the group-based class assignment procedure contributes in improving the classification accuracy. Experiments on the structured and unstructured text datasets obtained from the UCI and Reuters repositories are performed in order to evaluate the proposed Association Classifier. The proposed mMCAR classifier is benchmarked against the traditional classifiers and existing Association Classifiers. Experimental results indicate that the proposed Association Classifier, mMCAR, produced high accuracy with a smaller number of classification rules. For the structured dataset, the mMCAR produces an average of 84.24% accuracy as compared to MCAR that obtains 84.23%. Even though the classification accuracy difference is small, the proposed mMCAR uses only 50 rules for the classification while its benchmark method involves 60 rules. On the other hand, mMCAR is at par with MCAR when unstructured dataset is utilized. Both classifiers produce 89% accuracy but mMCAR uses less number of rules for the classification. This study contributes to the text mining domain as automatic classification of huge and widely distributed textual data could facilitate the text representation and retrieval processes. 2015 Thesis NonPeerReviewed text en http://etd.uum.edu.my/5767/1/depositpermission_s91487.pdf text en http://etd.uum.edu.my/5767/2/s91487_01.pdf Al-Refai, Mohammad Hayel Abdel Karim (2015) A modified multi-class association rule for text mining. PhD. thesis, Universiti Utara Malaysia.
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Electronic Theses
url_provider http://etd.uum.edu.my/
language English
English
topic QA71-90 Instruments and machines
spellingShingle QA71-90 Instruments and machines
Al-Refai, Mohammad Hayel Abdel Karim
A modified multi-class association rule for text mining
description Classification and association rule mining are significant tasks in data mining. Integrating association rule discovery and classification in data mining brings us an approach known as the associative classification. One common shortcoming of existing Association Classifiers is the huge number of rules produced in order to obtain high classification accuracy. This study proposes s a Modified Multi-class Association Rule Mining (mMCAR) that consists of three procedures; rule discovery, rule pruning and group-based class assignment. The rule discovery and rule pruning procedures are designed to reduce the number of classification rules. On the other hand, the group-based class assignment procedure contributes in improving the classification accuracy. Experiments on the structured and unstructured text datasets obtained from the UCI and Reuters repositories are performed in order to evaluate the proposed Association Classifier. The proposed mMCAR classifier is benchmarked against the traditional classifiers and existing Association Classifiers. Experimental results indicate that the proposed Association Classifier, mMCAR, produced high accuracy with a smaller number of classification rules. For the structured dataset, the mMCAR produces an average of 84.24% accuracy as compared to MCAR that obtains 84.23%. Even though the classification accuracy difference is small, the proposed mMCAR uses only 50 rules for the classification while its benchmark method involves 60 rules. On the other hand, mMCAR is at par with MCAR when unstructured dataset is utilized. Both classifiers produce 89% accuracy but mMCAR uses less number of rules for the classification. This study contributes to the text mining domain as automatic classification of huge and widely distributed textual data could facilitate the text representation and retrieval processes.
format Thesis
author Al-Refai, Mohammad Hayel Abdel Karim
author_facet Al-Refai, Mohammad Hayel Abdel Karim
author_sort Al-Refai, Mohammad Hayel Abdel Karim
title A modified multi-class association rule for text mining
title_short A modified multi-class association rule for text mining
title_full A modified multi-class association rule for text mining
title_fullStr A modified multi-class association rule for text mining
title_full_unstemmed A modified multi-class association rule for text mining
title_sort modified multi-class association rule for text mining
publishDate 2015
url http://etd.uum.edu.my/5767/1/depositpermission_s91487.pdf
http://etd.uum.edu.my/5767/2/s91487_01.pdf
http://etd.uum.edu.my/5767/
_version_ 1644277427901300736