REDPC : a residual error-based density peak clustering algorithm

The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However,...

Full description

Saved in:

Bibliographic Details
Main Authors:	Parmar, Milan, Wang, Di, Zhang, Xiaofeng, Tan, Ah-Hwee, Miao, Chunyan, Jiang, Jianhua, Zhou, You
Other Authors:	School of Computer Science and Engineering
Format:	Article
Language:	English
Published:	2020
Subjects:	Engineering::Computer science and engineering Clustering Density Peak Clustering
Online Access:	https://hdl.handle.net/10356/143202
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-143202
record_format	dspace
spelling	sg-ntu-dr.10356-1432022020-08-12T05:45:28Z REDPC : a residual error-based density peak clustering algorithm Parmar, Milan Wang, Di Zhang, Xiaofeng Tan, Ah-Hwee Miao, Chunyan Jiang, Jianhua Zhou, You School of Computer Science and Engineering Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly Engineering::Computer science and engineering Clustering Density Peak Clustering The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However, because DPC takes the entire data space into consideration during the computation of local density, which is then used to generate a decision graph for the identification of cluster centroids, DPC may face difficulty in differentiating overlapping clusters and in dealing with low-density data points. In this paper, we propose a residual error-based density peak clustering algorithm named REDPC to better handle datasets comprising various data distribution patterns. Specifically, REDPC adopts the residual error computation to measure the local density within a neighbourhood region. As such, comparing to DPC, our REDPC algorithm provides a better decision graph for the identification of cluster centroids and better handles the low-density data points. Experimental results on both synthetic and real-world datasets show that REDPC performs better than DPC and other algorithms. National Research Foundation (NRF) Accepted version This research is supported in part by the National Science Fund Project of China No. 61772227 and Science & Technology Development Foundation of Jilin Province under the grant No. 20160101259JC, 20180201045GX. This research is also supported in part by the National Research Foundation, Prime Minister’s Office, Singapore under its IDM Futures Funding Initiative. 2020-08-12T05:45:28Z 2020-08-12T05:45:28Z 2018 Journal Article Parmar, M., Wang, D., Zhang, X., Tan, A.-H., Miao, C., Jiang, J., & Zhou, Y. (2019). REDPC: a residual error-based density peak clustering algorithm. Neurocomputing, 348, 82-96. doi: 10.1016/j.neucom.2018.06.087 0925-2312 https://hdl.handle.net/10356/143202 10.1016/j.neucom.2018.06.087 2-s2.0-85056710633 348 82 96 en Neurocomputing © 2018 Elsevier B.V. All rights reserved. This paper was published in Neurocomputing and is made available with permission of Elsevier B.V. application/pdf
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering Clustering Density Peak Clustering
spellingShingle	Engineering::Computer science and engineering Clustering Density Peak Clustering Parmar, Milan Wang, Di Zhang, Xiaofeng Tan, Ah-Hwee Miao, Chunyan Jiang, Jianhua Zhou, You REDPC : a residual error-based density peak clustering algorithm
description	The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However, because DPC takes the entire data space into consideration during the computation of local density, which is then used to generate a decision graph for the identification of cluster centroids, DPC may face difficulty in differentiating overlapping clusters and in dealing with low-density data points. In this paper, we propose a residual error-based density peak clustering algorithm named REDPC to better handle datasets comprising various data distribution patterns. Specifically, REDPC adopts the residual error computation to measure the local density within a neighbourhood region. As such, comparing to DPC, our REDPC algorithm provides a better decision graph for the identification of cluster centroids and better handles the low-density data points. Experimental results on both synthetic and real-world datasets show that REDPC performs better than DPC and other algorithms.
author2	School of Computer Science and Engineering
author_facet	School of Computer Science and Engineering Parmar, Milan Wang, Di Zhang, Xiaofeng Tan, Ah-Hwee Miao, Chunyan Jiang, Jianhua Zhou, You
format	Article
author	Parmar, Milan Wang, Di Zhang, Xiaofeng Tan, Ah-Hwee Miao, Chunyan Jiang, Jianhua Zhou, You
author_sort	Parmar, Milan
title	REDPC : a residual error-based density peak clustering algorithm
title_short	REDPC : a residual error-based density peak clustering algorithm
title_full	REDPC : a residual error-based density peak clustering algorithm
title_fullStr	REDPC : a residual error-based density peak clustering algorithm
title_full_unstemmed	REDPC : a residual error-based density peak clustering algorithm
title_sort	redpc : a residual error-based density peak clustering algorithm
publishDate	2020
url	https://hdl.handle.net/10356/143202
_version_	1681056631802036224

REDPC : a residual error-based density peak clustering algorithm

Similar Items