A systematic density-based clustering method using anchor points

Clustering is an important unsupervised learning method in machine learning and data mining. Many existing clustering methods may still face the challenge in self-identifying clusters with varying shapes, sizes and densities. To devise a more generic clustering method that considers all the aforemen...

Full description

Saved in:
Bibliographic Details
Main Authors: WANG, Yizhang, WANG, Di, PANG, Wei, TAN, Ah-hwee, ZHOU, You
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2020
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5183
https://ink.library.smu.edu.sg/context/sis_research/article/6186/viewcontent/systematic_density_based_clustering_av.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6186
record_format dspace
spelling sg-smu-ink.sis_research-61862020-07-22T05:05:04Z A systematic density-based clustering method using anchor points WANG, Yizhang WANG, Di PANG, Wei TAN, Ah-hwee ZHOU, You Clustering is an important unsupervised learning method in machine learning and data mining. Many existing clustering methods may still face the challenge in self-identifying clusters with varying shapes, sizes and densities. To devise a more generic clustering method that considers all the aforementioned properties of the natural clusters, we propose a novel clustering algorithm named Anchor Points based Clustering (APC). The anchor points in APC are characterized by having a relatively large distance from data points with higher densities. We take anchor points as centers to obtain intermediate clusters, which can divide the whole dataset more appropriately so as to better facilitate further grouping. In essence, based on the analysis of the identified anchor points, the relationship among the corresponding intermediate clusters can be better revealed. In short, the difference in local densities (densities within neighboring data points) of the anchor points characterizes their different properties, that is to say, all the intermediate clusters may fall into one or multiple identified levels with different densities. Finally, based on the properties of anchor points, APC spontaneously chooses the appropriate clustering strategies and reports the final clustering results. To evaluate the performances of APC, we conduct experiments on twelve two-dimensional synthetic datasets and twelve multi-dimensional real-world datasets. Moreover, we also apply APC to the Olivetti Face dataset to further assess its effectiveness in terms of face recognition. All experimental results indicate that APC outperforms four classical methods and two state-of-the-art methods in most cases. 2020-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5183 info:doi/10.1016/j.neucom.2020.02.119 https://ink.library.smu.edu.sg/context/sis_research/article/6186/viewcontent/systematic_density_based_clustering_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Density based clustering Anchor data points Local density analysis Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Density based clustering
Anchor data points
Local density analysis
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Density based clustering
Anchor data points
Local density analysis
Databases and Information Systems
Numerical Analysis and Scientific Computing
WANG, Yizhang
WANG, Di
PANG, Wei
TAN, Ah-hwee
ZHOU, You
A systematic density-based clustering method using anchor points
description Clustering is an important unsupervised learning method in machine learning and data mining. Many existing clustering methods may still face the challenge in self-identifying clusters with varying shapes, sizes and densities. To devise a more generic clustering method that considers all the aforementioned properties of the natural clusters, we propose a novel clustering algorithm named Anchor Points based Clustering (APC). The anchor points in APC are characterized by having a relatively large distance from data points with higher densities. We take anchor points as centers to obtain intermediate clusters, which can divide the whole dataset more appropriately so as to better facilitate further grouping. In essence, based on the analysis of the identified anchor points, the relationship among the corresponding intermediate clusters can be better revealed. In short, the difference in local densities (densities within neighboring data points) of the anchor points characterizes their different properties, that is to say, all the intermediate clusters may fall into one or multiple identified levels with different densities. Finally, based on the properties of anchor points, APC spontaneously chooses the appropriate clustering strategies and reports the final clustering results. To evaluate the performances of APC, we conduct experiments on twelve two-dimensional synthetic datasets and twelve multi-dimensional real-world datasets. Moreover, we also apply APC to the Olivetti Face dataset to further assess its effectiveness in terms of face recognition. All experimental results indicate that APC outperforms four classical methods and two state-of-the-art methods in most cases.
format text
author WANG, Yizhang
WANG, Di
PANG, Wei
TAN, Ah-hwee
ZHOU, You
author_facet WANG, Yizhang
WANG, Di
PANG, Wei
TAN, Ah-hwee
ZHOU, You
author_sort WANG, Yizhang
title A systematic density-based clustering method using anchor points
title_short A systematic density-based clustering method using anchor points
title_full A systematic density-based clustering method using anchor points
title_fullStr A systematic density-based clustering method using anchor points
title_full_unstemmed A systematic density-based clustering method using anchor points
title_sort systematic density-based clustering method using anchor points
publisher Institutional Knowledge at Singapore Management University
publishDate 2020
url https://ink.library.smu.edu.sg/sis_research/5183
https://ink.library.smu.edu.sg/context/sis_research/article/6186/viewcontent/systematic_density_based_clustering_av.pdf
_version_ 1770575323591606272