Analysis of a similarity measure for non-overlapped data

© 2017 by the authors. A similarity measure is a measure evaluating the degree of similarity between two fuzzy data sets and has become an essential tool in many applications including data mining, pattern recognition, and clustering. In this paper, we propose a similarity measure capable of handlin...

Full description

Saved in:

Bibliographic Details
Main Authors:	Sanghyuk Lee, Jaehoon Cha, Nipon Theera-Umpon, Kyeong Soo Kim
Format:	Journal
Published:	2018
Subjects:	Chemistry Computer Science Mathematics Physics and Astronomy
Online Access:	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019235664&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/56986
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Chiang Mai University

id	th-cmuir.6653943832-56986
record_format	dspace
spelling	th-cmuir.6653943832-569862018-09-05T03:53:21Z Analysis of a similarity measure for non-overlapped data Sanghyuk Lee Jaehoon Cha Nipon Theera-Umpon Kyeong Soo Kim Chemistry Computer Science Mathematics Physics and Astronomy © 2017 by the authors. A similarity measure is a measure evaluating the degree of similarity between two fuzzy data sets and has become an essential tool in many applications including data mining, pattern recognition, and clustering. In this paper, we propose a similarity measure capable of handling non-overlapped data as well as overlapped data and analyze its characteristics on data distributions. We first design the similarity measure based on a distance measure and apply it to overlapped data distributions. From the calculations for example data distributions, we find that, though the similarity calculation is effective, the designed similarity measure cannot distinguish two non-overlapped data distributions, thus resulting in the same value for both data sets. To obtain discriminative similarity values for non-overlapped data, we consider two approaches. The first one is to use a conventional similarity measure after preprocessing non-overlapped data. The second one is to take into account neighbor data information in designing the similarity measure, where we consider the relation to specific data and residual data information. Two artificial patterns of non-overlapped data are analyzed in an illustrative example. The calculation results demonstrate that the proposed similarity measures can discriminate non-overlapped data. 2018-09-05T03:33:10Z 2018-09-05T03:33:10Z 2017-05-01 Journal 20738994 2-s2.0-85019235664 10.3390/sym9050068 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019235664&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/56986
institution	Chiang Mai University
building	Chiang Mai University Library
country	Thailand
collection	CMU Intellectual Repository
topic	Chemistry Computer Science Mathematics Physics and Astronomy
spellingShingle	Chemistry Computer Science Mathematics Physics and Astronomy Sanghyuk Lee Jaehoon Cha Nipon Theera-Umpon Kyeong Soo Kim Analysis of a similarity measure for non-overlapped data
description	© 2017 by the authors. A similarity measure is a measure evaluating the degree of similarity between two fuzzy data sets and has become an essential tool in many applications including data mining, pattern recognition, and clustering. In this paper, we propose a similarity measure capable of handling non-overlapped data as well as overlapped data and analyze its characteristics on data distributions. We first design the similarity measure based on a distance measure and apply it to overlapped data distributions. From the calculations for example data distributions, we find that, though the similarity calculation is effective, the designed similarity measure cannot distinguish two non-overlapped data distributions, thus resulting in the same value for both data sets. To obtain discriminative similarity values for non-overlapped data, we consider two approaches. The first one is to use a conventional similarity measure after preprocessing non-overlapped data. The second one is to take into account neighbor data information in designing the similarity measure, where we consider the relation to specific data and residual data information. Two artificial patterns of non-overlapped data are analyzed in an illustrative example. The calculation results demonstrate that the proposed similarity measures can discriminate non-overlapped data.
format	Journal
author	Sanghyuk Lee Jaehoon Cha Nipon Theera-Umpon Kyeong Soo Kim
author_facet	Sanghyuk Lee Jaehoon Cha Nipon Theera-Umpon Kyeong Soo Kim
author_sort	Sanghyuk Lee
title	Analysis of a similarity measure for non-overlapped data
title_short	Analysis of a similarity measure for non-overlapped data
title_full	Analysis of a similarity measure for non-overlapped data
title_fullStr	Analysis of a similarity measure for non-overlapped data
title_full_unstemmed	Analysis of a similarity measure for non-overlapped data
title_sort	analysis of a similarity measure for non-overlapped data
publishDate	2018
url	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019235664&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/56986
_version_	1681424794536378368

Analysis of a similarity measure for non-overlapped data

Similar Items