REDPC: A residual error-based density peak clustering algorithm

The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However,...

Full description

Saved in:
Bibliographic Details
Main Authors: PARMAR, Milan, WANG, Di, ZHANG, Xiaofeng, TAN, Ah-hwee, MIAO, Chunyan, ZHOU, You
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5185
https://ink.library.smu.edu.sg/context/sis_research/article/6188/viewcontent/NeuCom2018REDPC.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6188
record_format dspace
spelling sg-smu-ink.sis_research-61882020-07-23T18:51:14Z REDPC: A residual error-based density peak clustering algorithm PARMAR, Milan WANG, Di ZHANG, Xiaofeng TAN, Ah-hwee MIAO, Chunyan ZHOU, You The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However, because DPC takes the entire data space into consideration during the computation of local density, which is then used to generate a decision graph for the identification of cluster centroids, DPC may face difficulty in differentiating overlapping clusters and in dealing with low-density data points. In this paper, we propose a residual error-based density peak clustering algorithm named REDPC to better handle datasets comprising various data distribution patterns. Specifically, REDPC adopts the residual error computation to measure the local density within a neighbourhood region. As such, comparing to DPC, our REDPC algorithm provides a better decision graph for the identification of cluster centroids and better handles the low-density data points. Experimental results on both synthetic and real-world datasets show that REDPC performs better than DPC and other algorithms. 2019-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5185 info:doi/10.1016/j.neucom.2018.06.087 https://ink.library.smu.edu.sg/context/sis_research/article/6188/viewcontent/NeuCom2018REDPC.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Clustering Density peak clustering Anomaly detection Residual error Low-density data points Databases and Information Systems Software Engineering Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Clustering
Density peak clustering
Anomaly detection
Residual error
Low-density data points
Databases and Information Systems
Software Engineering
Theory and Algorithms
spellingShingle Clustering
Density peak clustering
Anomaly detection
Residual error
Low-density data points
Databases and Information Systems
Software Engineering
Theory and Algorithms
PARMAR, Milan
WANG, Di
ZHANG, Xiaofeng
TAN, Ah-hwee
MIAO, Chunyan
ZHOU, You
REDPC: A residual error-based density peak clustering algorithm
description The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However, because DPC takes the entire data space into consideration during the computation of local density, which is then used to generate a decision graph for the identification of cluster centroids, DPC may face difficulty in differentiating overlapping clusters and in dealing with low-density data points. In this paper, we propose a residual error-based density peak clustering algorithm named REDPC to better handle datasets comprising various data distribution patterns. Specifically, REDPC adopts the residual error computation to measure the local density within a neighbourhood region. As such, comparing to DPC, our REDPC algorithm provides a better decision graph for the identification of cluster centroids and better handles the low-density data points. Experimental results on both synthetic and real-world datasets show that REDPC performs better than DPC and other algorithms.
format text
author PARMAR, Milan
WANG, Di
ZHANG, Xiaofeng
TAN, Ah-hwee
MIAO, Chunyan
ZHOU, You
author_facet PARMAR, Milan
WANG, Di
ZHANG, Xiaofeng
TAN, Ah-hwee
MIAO, Chunyan
ZHOU, You
author_sort PARMAR, Milan
title REDPC: A residual error-based density peak clustering algorithm
title_short REDPC: A residual error-based density peak clustering algorithm
title_full REDPC: A residual error-based density peak clustering algorithm
title_fullStr REDPC: A residual error-based density peak clustering algorithm
title_full_unstemmed REDPC: A residual error-based density peak clustering algorithm
title_sort redpc: a residual error-based density peak clustering algorithm
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/5185
https://ink.library.smu.edu.sg/context/sis_research/article/6188/viewcontent/NeuCom2018REDPC.pdf
_version_ 1770575324600336384