A string grammar fuzzy-possibilistic C-medians

© 2017 Elsevier B.V. In some pattern recognition problems, the syntactic or structural information that describes each pattern is important. A syntactic pattern can be described using string grammar. There is only a handful of research works involving with the string grammar clustering. The string g...

Full description

Saved in:
Bibliographic Details
Main Authors: Atcharin Klomsae, Sansanee Auephanwiriyakul, Nipon Theera-Umpon
Format: Journal
Published: 2018
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019580833&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/46649
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-46649
record_format dspace
spelling th-cmuir.6653943832-466492018-04-25T07:21:51Z A string grammar fuzzy-possibilistic C-medians Atcharin Klomsae Sansanee Auephanwiriyakul Nipon Theera-Umpon Agricultural and Biological Sciences © 2017 Elsevier B.V. In some pattern recognition problems, the syntactic or structural information that describes each pattern is important. A syntactic pattern can be described using string grammar. There is only a handful of research works involving with the string grammar clustering. The string grammar hard C-means (sgHCM) is one of the most well-known clustering algorithms in syntactic pattern recognition. Since, it has been proved that fuzzy clustering is better than a hard clustering, a string grammar fuzzy C-medians (sgFCMed) algorithm to improve the sgHCM was previously proposed. However, the sgFCMed may not provide a good clustering result for any application with an overlapping data. Thus, in this paper, a string grammar fuzzy-possibilistic C-medians (sgFPCMed) algorithm is introduced to cope with the overlapping data problem. The proposed algorithm is implemented on four real overlapping data sets, i.e., MPEG-7 data set, Copenhagen chromosomes data set, MNIST database of ha ndwritten digits, and USPS database of handwritten digits. The proposed sgFPCMed results are compared with the results from the sgHCM and the sgFCMed. The results show that the proposed sgFPCMed is better than both. The proposed sgFPCMed algorithm results are directly and indirectly compared with the results from other syntactic or numeric methods The proposed sgFPCMed is better than some approaches and comparable to some of the methods However, since the proposed sgFPCMed is a string grammar clustering, it is easier to transform each prototype string back into the original form of data set. 2018-04-25T06:58:57Z 2018-04-25T06:58:57Z 2017-08-01 Journal 15684946 2-s2.0-85019580833 10.1016/j.asoc.2017.04.037 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019580833&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/46649
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Agricultural and Biological Sciences
spellingShingle Agricultural and Biological Sciences
Atcharin Klomsae
Sansanee Auephanwiriyakul
Nipon Theera-Umpon
A string grammar fuzzy-possibilistic C-medians
description © 2017 Elsevier B.V. In some pattern recognition problems, the syntactic or structural information that describes each pattern is important. A syntactic pattern can be described using string grammar. There is only a handful of research works involving with the string grammar clustering. The string grammar hard C-means (sgHCM) is one of the most well-known clustering algorithms in syntactic pattern recognition. Since, it has been proved that fuzzy clustering is better than a hard clustering, a string grammar fuzzy C-medians (sgFCMed) algorithm to improve the sgHCM was previously proposed. However, the sgFCMed may not provide a good clustering result for any application with an overlapping data. Thus, in this paper, a string grammar fuzzy-possibilistic C-medians (sgFPCMed) algorithm is introduced to cope with the overlapping data problem. The proposed algorithm is implemented on four real overlapping data sets, i.e., MPEG-7 data set, Copenhagen chromosomes data set, MNIST database of ha ndwritten digits, and USPS database of handwritten digits. The proposed sgFPCMed results are compared with the results from the sgHCM and the sgFCMed. The results show that the proposed sgFPCMed is better than both. The proposed sgFPCMed algorithm results are directly and indirectly compared with the results from other syntactic or numeric methods The proposed sgFPCMed is better than some approaches and comparable to some of the methods However, since the proposed sgFPCMed is a string grammar clustering, it is easier to transform each prototype string back into the original form of data set.
format Journal
author Atcharin Klomsae
Sansanee Auephanwiriyakul
Nipon Theera-Umpon
author_facet Atcharin Klomsae
Sansanee Auephanwiriyakul
Nipon Theera-Umpon
author_sort Atcharin Klomsae
title A string grammar fuzzy-possibilistic C-medians
title_short A string grammar fuzzy-possibilistic C-medians
title_full A string grammar fuzzy-possibilistic C-medians
title_fullStr A string grammar fuzzy-possibilistic C-medians
title_full_unstemmed A string grammar fuzzy-possibilistic C-medians
title_sort string grammar fuzzy-possibilistic c-medians
publishDate 2018
url https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019580833&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/46649
_version_ 1681422913625915392