Evolving fuzzy clustering approach: An epoch clustering that enables heuristic postpruning
Clustering is an unsupervised machine learning method that is used both individually and as a part of the preprocessing stage for the supervised machine learning methods. Due to its unsupervised nature, clustering results have less accuracy compared to the supervised learning. This article aims to i...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Published: |
Institute of Electrical and Electronics Engineers (IEEE)
2021
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/26486/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Malaya |
id |
my.um.eprints.26486 |
---|---|
record_format |
eprints |
spelling |
my.um.eprints.264862022-03-08T02:51:51Z http://eprints.um.edu.my/26486/ Evolving fuzzy clustering approach: An epoch clustering that enables heuristic postpruning Shirkhorshidi, Ali Seyed Wah, Teh Ying Shirkhorshidi, Seyed Mohammad Reza Aghabozorgi, Saeed QA75 Electronic computers. Computer science Clustering is an unsupervised machine learning method that is used both individually and as a part of the preprocessing stage for the supervised machine learning methods. Due to its unsupervised nature, clustering results have less accuracy compared to the supervised learning. This article aims to introduce a new perspective in clustering by defining an approach for data pruning. The method also enables clustering using multiple sets of prototypes instead of only one set to improve clustering accuracy. Consequently, this approach has the potential to be used independently or as a part of a preprocessing to prepare purified data for the training step of a supervised learning approach. An evolving fuzzy clustering approach (EFCA) utilizes the fuzzy membership concept to breakdown clustering in epochs instead of running the clustering on all data at once. In some cases, for supervised learning, we rather have a smaller subset of highly accurate labeled data instead of a dataset with less accurate labels. The EFCA's ``epoch cut'' enables postpruning ability to eliminate obscure data points, which results in more clustering accuracy. The EFCA has been applied to a set of eight multivariate and ten time-series datasets, and for example, after deploying epoch cut and eliminating obscure data (20% of data) by automatic postpruning, it achieved 100% accuracy for the rest 80% Iris data. Institute of Electrical and Electronics Engineers (IEEE) 2021-03 Article PeerReviewed Shirkhorshidi, Ali Seyed and Wah, Teh Ying and Shirkhorshidi, Seyed Mohammad Reza and Aghabozorgi, Saeed (2021) Evolving fuzzy clustering approach: An epoch clustering that enables heuristic postpruning. IEEE Transactions on Fuzzy Systems, 29 (3). pp. 560-568. ISSN 1063-6706, DOI https://doi.org/10.1109/TFUZZ.2019.2956900 <https://doi.org/10.1109/TFUZZ.2019.2956900>. 10.1109/TFUZZ.2019.2956900 |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Research Repository |
url_provider |
http://eprints.um.edu.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Shirkhorshidi, Ali Seyed Wah, Teh Ying Shirkhorshidi, Seyed Mohammad Reza Aghabozorgi, Saeed Evolving fuzzy clustering approach: An epoch clustering that enables heuristic postpruning |
description |
Clustering is an unsupervised machine learning method that is used both individually and as a part of the preprocessing stage for the supervised machine learning methods. Due to its unsupervised nature, clustering results have less accuracy compared to the supervised learning. This article aims to introduce a new perspective in clustering by defining an approach for data pruning. The method also enables clustering using multiple sets of prototypes instead of only one set to improve clustering accuracy. Consequently, this approach has the potential to be used independently or as a part of a preprocessing to prepare purified data for the training step of a supervised learning approach. An evolving fuzzy clustering approach (EFCA) utilizes the fuzzy membership concept to breakdown clustering in epochs instead of running the clustering on all data at once. In some cases, for supervised learning, we rather have a smaller subset of highly accurate labeled data instead of a dataset with less accurate labels. The EFCA's ``epoch cut'' enables postpruning ability to eliminate obscure data points, which results in more clustering accuracy. The EFCA has been applied to a set of eight multivariate and ten time-series datasets, and for example, after deploying epoch cut and eliminating obscure data (20% of data) by automatic postpruning, it achieved 100% accuracy for the rest 80% Iris data. |
format |
Article |
author |
Shirkhorshidi, Ali Seyed Wah, Teh Ying Shirkhorshidi, Seyed Mohammad Reza Aghabozorgi, Saeed |
author_facet |
Shirkhorshidi, Ali Seyed Wah, Teh Ying Shirkhorshidi, Seyed Mohammad Reza Aghabozorgi, Saeed |
author_sort |
Shirkhorshidi, Ali Seyed |
title |
Evolving fuzzy clustering approach: An epoch clustering that enables heuristic postpruning |
title_short |
Evolving fuzzy clustering approach: An epoch clustering that enables heuristic postpruning |
title_full |
Evolving fuzzy clustering approach: An epoch clustering that enables heuristic postpruning |
title_fullStr |
Evolving fuzzy clustering approach: An epoch clustering that enables heuristic postpruning |
title_full_unstemmed |
Evolving fuzzy clustering approach: An epoch clustering that enables heuristic postpruning |
title_sort |
evolving fuzzy clustering approach: an epoch clustering that enables heuristic postpruning |
publisher |
Institute of Electrical and Electronics Engineers (IEEE) |
publishDate |
2021 |
url |
http://eprints.um.edu.my/26486/ |
_version_ |
1735409419033772032 |