REDPC : a residual error-based density peak clustering algorithm
The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However,...
Saved in:
Main Authors: | , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/143202 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-143202 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1432022020-08-12T05:45:28Z REDPC : a residual error-based density peak clustering algorithm Parmar, Milan Wang, Di Zhang, Xiaofeng Tan, Ah-Hwee Miao, Chunyan Jiang, Jianhua Zhou, You School of Computer Science and Engineering Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly Engineering::Computer science and engineering Clustering Density Peak Clustering The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However, because DPC takes the entire data space into consideration during the computation of local density, which is then used to generate a decision graph for the identification of cluster centroids, DPC may face difficulty in differentiating overlapping clusters and in dealing with low-density data points. In this paper, we propose a residual error-based density peak clustering algorithm named REDPC to better handle datasets comprising various data distribution patterns. Specifically, REDPC adopts the residual error computation to measure the local density within a neighbourhood region. As such, comparing to DPC, our REDPC algorithm provides a better decision graph for the identification of cluster centroids and better handles the low-density data points. Experimental results on both synthetic and real-world datasets show that REDPC performs better than DPC and other algorithms. National Research Foundation (NRF) Accepted version This research is supported in part by the National Science Fund Project of China No. 61772227 and Science & Technology Development Foundation of Jilin Province under the grant No. 20160101259JC, 20180201045GX. This research is also supported in part by the National Research Foundation, Prime Minister’s Office, Singapore under its IDM Futures Funding Initiative. 2020-08-12T05:45:28Z 2020-08-12T05:45:28Z 2018 Journal Article Parmar, M., Wang, D., Zhang, X., Tan, A.-H., Miao, C., Jiang, J., & Zhou, Y. (2019). REDPC: a residual error-based density peak clustering algorithm. Neurocomputing, 348, 82-96. doi: 10.1016/j.neucom.2018.06.087 0925-2312 https://hdl.handle.net/10356/143202 10.1016/j.neucom.2018.06.087 2-s2.0-85056710633 348 82 96 en Neurocomputing © 2018 Elsevier B.V. All rights reserved. This paper was published in Neurocomputing and is made available with permission of Elsevier B.V. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Clustering Density Peak Clustering |
spellingShingle |
Engineering::Computer science and engineering Clustering Density Peak Clustering Parmar, Milan Wang, Di Zhang, Xiaofeng Tan, Ah-Hwee Miao, Chunyan Jiang, Jianhua Zhou, You REDPC : a residual error-based density peak clustering algorithm |
description |
The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However, because DPC takes the entire data space into consideration during the computation of local density, which is then used to generate a decision graph for the identification of cluster centroids, DPC may face difficulty in differentiating overlapping clusters and in dealing with low-density data points. In this paper, we propose a residual error-based density peak clustering algorithm named REDPC to better handle datasets comprising various data distribution patterns. Specifically, REDPC adopts the residual error computation to measure the local density within a neighbourhood region. As such, comparing to DPC, our REDPC algorithm provides a better decision graph for the identification of cluster centroids and better handles the low-density data points. Experimental results on both synthetic and real-world datasets show that REDPC performs better than DPC and other algorithms. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Parmar, Milan Wang, Di Zhang, Xiaofeng Tan, Ah-Hwee Miao, Chunyan Jiang, Jianhua Zhou, You |
format |
Article |
author |
Parmar, Milan Wang, Di Zhang, Xiaofeng Tan, Ah-Hwee Miao, Chunyan Jiang, Jianhua Zhou, You |
author_sort |
Parmar, Milan |
title |
REDPC : a residual error-based density peak clustering algorithm |
title_short |
REDPC : a residual error-based density peak clustering algorithm |
title_full |
REDPC : a residual error-based density peak clustering algorithm |
title_fullStr |
REDPC : a residual error-based density peak clustering algorithm |
title_full_unstemmed |
REDPC : a residual error-based density peak clustering algorithm |
title_sort |
redpc : a residual error-based density peak clustering algorithm |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/143202 |
_version_ |
1681056631802036224 |