Robust local triangular kernel density-based clustering for high-dimensional data

A number of clustering algorithms can be employed to find clusters in multivariate data. However, the effectiveness and efficiency of the existing algorithms are limited, since the respective data has high dimension, contain large amount of noise and consist of clusters with arbitrary shapes and den...

Full description

Saved in:
Bibliographic Details
Main Authors: Musdholifah, Aina, Mohd Hashim, Siti Zaiton
Format: Conference or Workshop Item
Published: 2013
Subjects:
Online Access:http://eprints.utm.my/id/eprint/51289/
http://apps.webofknowledge.com.ezproxy.utm.my/full_record.do?product=WOS&search_mode=GeneralSearch&qid=11&SID=R2Cjh3fA6kIeWhVr585&page=1&doc=1
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.51289
record_format eprints
spelling my.utm.512892017-07-18T07:46:37Z http://eprints.utm.my/id/eprint/51289/ Robust local triangular kernel density-based clustering for high-dimensional data Musdholifah, Aina Mohd Hashim, Siti Zaiton QA75 Electronic computers. Computer science A number of clustering algorithms can be employed to find clusters in multivariate data. However, the effectiveness and efficiency of the existing algorithms are limited, since the respective data has high dimension, contain large amount of noise and consist of clusters with arbitrary shapes and densities. In this paper, a new kernel density-based clustering algorithm, called Local Triangular Kernel-based Clustering (LTKC), is proposed to deal with these conditions. LTKC is based on combination of k-nearest-neighbor density estimation and triangular kernel density-based clustering. The advantages of our LTKC approach are: (1) it has a firm mathematical basis; (2) it requires only one parameter, number of neighbors; (3) it defines the number of cluster automatically; (4) it allows discovering clusters with arbitrary shapes and densities; and (5) it is significantly faster than existing algorithms. LTKC is tested using artificial data and applied to some UCI data. A comparison with k-means, KFCM and well known density-based clustering algorithms including ILGC, DBSCAN, and DENCLUE shows the superiority of our proposed LTKC algorithm. 2013 Conference or Workshop Item PeerReviewed Musdholifah, Aina and Mohd Hashim, Siti Zaiton (2013) Robust local triangular kernel density-based clustering for high-dimensional data. In: 2013 5th International Conference on Computer Science and Information Technology, CSIT 2013 - Proceedings, MAR 27-28, 2013, Amman, Jordon. http://apps.webofknowledge.com.ezproxy.utm.my/full_record.do?product=WOS&search_mode=GeneralSearch&qid=11&SID=R2Cjh3fA6kIeWhVr585&page=1&doc=1
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Musdholifah, Aina
Mohd Hashim, Siti Zaiton
Robust local triangular kernel density-based clustering for high-dimensional data
description A number of clustering algorithms can be employed to find clusters in multivariate data. However, the effectiveness and efficiency of the existing algorithms are limited, since the respective data has high dimension, contain large amount of noise and consist of clusters with arbitrary shapes and densities. In this paper, a new kernel density-based clustering algorithm, called Local Triangular Kernel-based Clustering (LTKC), is proposed to deal with these conditions. LTKC is based on combination of k-nearest-neighbor density estimation and triangular kernel density-based clustering. The advantages of our LTKC approach are: (1) it has a firm mathematical basis; (2) it requires only one parameter, number of neighbors; (3) it defines the number of cluster automatically; (4) it allows discovering clusters with arbitrary shapes and densities; and (5) it is significantly faster than existing algorithms. LTKC is tested using artificial data and applied to some UCI data. A comparison with k-means, KFCM and well known density-based clustering algorithms including ILGC, DBSCAN, and DENCLUE shows the superiority of our proposed LTKC algorithm.
format Conference or Workshop Item
author Musdholifah, Aina
Mohd Hashim, Siti Zaiton
author_facet Musdholifah, Aina
Mohd Hashim, Siti Zaiton
author_sort Musdholifah, Aina
title Robust local triangular kernel density-based clustering for high-dimensional data
title_short Robust local triangular kernel density-based clustering for high-dimensional data
title_full Robust local triangular kernel density-based clustering for high-dimensional data
title_fullStr Robust local triangular kernel density-based clustering for high-dimensional data
title_full_unstemmed Robust local triangular kernel density-based clustering for high-dimensional data
title_sort robust local triangular kernel density-based clustering for high-dimensional data
publishDate 2013
url http://eprints.utm.my/id/eprint/51289/
http://apps.webofknowledge.com.ezproxy.utm.my/full_record.do?product=WOS&search_mode=GeneralSearch&qid=11&SID=R2Cjh3fA6kIeWhVr585&page=1&doc=1
_version_ 1643652995789881344