An improved associative classification model using fuzzy parameterized soft set-based decision for text classification
Text classification is applicable in various problem domains, including marketing, security, and biomedical. One of the potential text classifiers is the well-known associative classification approach. However, the existing associative classification approach is still prone to some limitations es...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English English English |
Published: |
2023
|
Subjects: | |
Online Access: | http://eprints.uthm.edu.my/11033/1/24p%20DEDE%20ROHIDIN.pdf http://eprints.uthm.edu.my/11033/2/DEDE%20ROHIDIN%20COPYRIGHT%20DECLARATION.pdf http://eprints.uthm.edu.my/11033/3/DEDE%20ROHIDIN%20WATERMARK.pdf http://eprints.uthm.edu.my/11033/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Tun Hussein Onn Malaysia |
Language: | English English English |
id |
my.uthm.eprints.11033 |
---|---|
record_format |
eprints |
spelling |
my.uthm.eprints.110332024-05-29T02:25:28Z http://eprints.uthm.edu.my/11033/ An improved associative classification model using fuzzy parameterized soft set-based decision for text classification Rohidin, Dede T Technology (General) Text classification is applicable in various problem domains, including marketing, security, and biomedical. One of the potential text classifiers is the well-known associative classification approach. However, the existing associative classification approach is still prone to some limitations especially when dealing with the problem with too many rules in text classification problem. Some of the rules generated from the textual data may be irrelevant and redundant, result in low performance in imbalanced and class overlapping data. Therefore, this research has proposed an improved associative classification approach to enhance the performance and efficiency of the text classification by removing the irrelevant rules, reducing redundant rules, and handling the imbalanced and class overlapping issues in the textual data. The proposed associative classification approach consists of three stages: pre-processing, fuzzification and classification. In the classification stage primarily, this study proposed to integrating principles of fuzzy soft set theory into associative rules, therefore referred to as Class-Based Fuzzy Soft Associative (CBFSA) method. The experiments used 20 Newsgroup (balanced data) datasets and Reuter-25178 (imbalanced) to evaluate the proposed model. It shows that CBFSA is successful in removing irrelevant and reducing redundant rules. The CBFSA classifier applies smaller number of rules than Class Based Associative (CBA) and Class Based of Predictive Association Rule (CPAR). The CBFSA is also successful in dealing with imbalanced and class overlap data. The CBFSA performance is higher and faster than CBA and CPAR. Meanwhile, comparative analysis with some other non-associative based classifiers may achieve improved f1-measure between 6% to 32%. The processing time of CBFSA is faster than RNN and CNN but slightly slower than Decision Tree, k-NN, Naïve Bayes, Roccio, Bagging and Boosting 2023-03 Thesis NonPeerReviewed text en http://eprints.uthm.edu.my/11033/1/24p%20DEDE%20ROHIDIN.pdf text en http://eprints.uthm.edu.my/11033/2/DEDE%20ROHIDIN%20COPYRIGHT%20DECLARATION.pdf text en http://eprints.uthm.edu.my/11033/3/DEDE%20ROHIDIN%20WATERMARK.pdf Rohidin, Dede (2023) An improved associative classification model using fuzzy parameterized soft set-based decision for text classification. Doctoral thesis, Universiti Tun Hussein Onn Malaysia. |
institution |
Universiti Tun Hussein Onn Malaysia |
building |
UTHM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Tun Hussein Onn Malaysia |
content_source |
UTHM Institutional Repository |
url_provider |
http://eprints.uthm.edu.my/ |
language |
English English English |
topic |
T Technology (General) |
spellingShingle |
T Technology (General) Rohidin, Dede An improved associative classification model using fuzzy parameterized soft set-based decision for text classification |
description |
Text classification is applicable in various problem domains, including marketing,
security, and biomedical. One of the potential text classifiers is the well-known
associative classification approach. However, the existing associative classification
approach is still prone to some limitations especially when dealing with the problem
with too many rules in text classification problem. Some of the rules generated from
the textual data may be irrelevant and redundant, result in low performance in
imbalanced and class overlapping data. Therefore, this research has proposed an
improved associative classification approach to enhance the performance and
efficiency of the text classification by removing the irrelevant rules, reducing
redundant rules, and handling the imbalanced and class overlapping issues in the
textual data. The proposed associative classification approach consists of three stages:
pre-processing, fuzzification and classification. In the classification stage primarily,
this study proposed to integrating principles of fuzzy soft set theory into associative
rules, therefore referred to as Class-Based Fuzzy Soft Associative (CBFSA) method.
The experiments used 20 Newsgroup (balanced data) datasets and Reuter-25178
(imbalanced) to evaluate the proposed model. It shows that CBFSA is successful in
removing irrelevant and reducing redundant rules. The CBFSA classifier applies
smaller number of rules than Class Based Associative (CBA) and Class Based of
Predictive Association Rule (CPAR). The CBFSA is also successful in dealing with
imbalanced and class overlap data. The CBFSA performance is higher and faster than
CBA and CPAR. Meanwhile, comparative analysis with some other non-associative
based classifiers may achieve improved f1-measure between 6% to 32%. The
processing time of CBFSA is faster than RNN and CNN but slightly slower than
Decision Tree, k-NN, Naïve Bayes, Roccio, Bagging and Boosting |
format |
Thesis |
author |
Rohidin, Dede |
author_facet |
Rohidin, Dede |
author_sort |
Rohidin, Dede |
title |
An improved associative classification model using fuzzy parameterized soft set-based decision for text classification |
title_short |
An improved associative classification model using fuzzy parameterized soft set-based decision for text classification |
title_full |
An improved associative classification model using fuzzy parameterized soft set-based decision for text classification |
title_fullStr |
An improved associative classification model using fuzzy parameterized soft set-based decision for text classification |
title_full_unstemmed |
An improved associative classification model using fuzzy parameterized soft set-based decision for text classification |
title_sort |
improved associative classification model using fuzzy parameterized soft set-based decision for text classification |
publishDate |
2023 |
url |
http://eprints.uthm.edu.my/11033/1/24p%20DEDE%20ROHIDIN.pdf http://eprints.uthm.edu.my/11033/2/DEDE%20ROHIDIN%20COPYRIGHT%20DECLARATION.pdf http://eprints.uthm.edu.my/11033/3/DEDE%20ROHIDIN%20WATERMARK.pdf http://eprints.uthm.edu.my/11033/ |
_version_ |
1800721942481731584 |