A string grammar fuzzy-possibilistic C-medians

© 2017 Elsevier B.V. In some pattern recognition problems, the syntactic or structural information that describes each pattern is important. A syntactic pattern can be described using string grammar. There is only a handful of research works involving with the string grammar clustering. The string g...

Full description

Saved in:
Bibliographic Details
Main Authors: Klomsae A., Auephanwiriyakul S., Theera-Umpon N.
Format: Journal
Published: 2017
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019580833&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/40201
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-40201
record_format dspace
spelling th-cmuir.6653943832-402012017-09-28T04:08:17Z A string grammar fuzzy-possibilistic C-medians Klomsae A. Auephanwiriyakul S. Theera-Umpon N. © 2017 Elsevier B.V. In some pattern recognition problems, the syntactic or structural information that describes each pattern is important. A syntactic pattern can be described using string grammar. There is only a handful of research works involving with the string grammar clustering. The string grammar hard C-means (sgHCM) is one of the most well-known clustering algorithms in syntactic pattern recognition. Since, it has been proved that fuzzy clustering is better than a hard clustering, a string grammar fuzzy C-medians (sgFCMed) algorithm to improve the sgHCM was previously proposed. However, the sgFCMed may not provide a good clustering result for any application with an overlapping data. Thus, in this paper, a string grammar fuzzy-possibilistic C-medians (sgFPCMed) algorithm is introduced to cope with the overlapping data problem. The proposed algorithm is implemented on four real overlapping data sets, i.e., MPEG-7 data set, Copenhagen chromosomes data set, MNIST database of handwritten digits, and USPS database of handwritten digits. The proposed sgFPCMed results are compared with the results from the sgHCM and the sgFCMed. The results show that the proposed sgFPCMed is better than both. The proposed sgFPCMed algorithm results are directly and indirectly compared with the results from other syntactic or numeric methods The proposed sgFPCMed is better than some approaches and comparable to some of the methods However, since the proposed sgFPCMed is a string grammar clustering, it is easier to transform each prototype string back into the original form of data set. 2017-09-28T04:08:17Z 2017-09-28T04:08:17Z Journal 15684946 2-s2.0-85019580833 10.1016/j.asoc.2017.04.037 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019580833&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/40201
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
description © 2017 Elsevier B.V. In some pattern recognition problems, the syntactic or structural information that describes each pattern is important. A syntactic pattern can be described using string grammar. There is only a handful of research works involving with the string grammar clustering. The string grammar hard C-means (sgHCM) is one of the most well-known clustering algorithms in syntactic pattern recognition. Since, it has been proved that fuzzy clustering is better than a hard clustering, a string grammar fuzzy C-medians (sgFCMed) algorithm to improve the sgHCM was previously proposed. However, the sgFCMed may not provide a good clustering result for any application with an overlapping data. Thus, in this paper, a string grammar fuzzy-possibilistic C-medians (sgFPCMed) algorithm is introduced to cope with the overlapping data problem. The proposed algorithm is implemented on four real overlapping data sets, i.e., MPEG-7 data set, Copenhagen chromosomes data set, MNIST database of handwritten digits, and USPS database of handwritten digits. The proposed sgFPCMed results are compared with the results from the sgHCM and the sgFCMed. The results show that the proposed sgFPCMed is better than both. The proposed sgFPCMed algorithm results are directly and indirectly compared with the results from other syntactic or numeric methods The proposed sgFPCMed is better than some approaches and comparable to some of the methods However, since the proposed sgFPCMed is a string grammar clustering, it is easier to transform each prototype string back into the original form of data set.
format Journal
author Klomsae A.
Auephanwiriyakul S.
Theera-Umpon N.
spellingShingle Klomsae A.
Auephanwiriyakul S.
Theera-Umpon N.
A string grammar fuzzy-possibilistic C-medians
author_facet Klomsae A.
Auephanwiriyakul S.
Theera-Umpon N.
author_sort Klomsae A.
title A string grammar fuzzy-possibilistic C-medians
title_short A string grammar fuzzy-possibilistic C-medians
title_full A string grammar fuzzy-possibilistic C-medians
title_fullStr A string grammar fuzzy-possibilistic C-medians
title_full_unstemmed A string grammar fuzzy-possibilistic C-medians
title_sort string grammar fuzzy-possibilistic c-medians
publishDate 2017
url https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019580833&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/40201
_version_ 1681421766178635776