SCLOPE: An algorithm for clustering data streams of categorical attributes

Clustering is a difficult problem especially when we consider the task in the context of a data stream of categorical attributes. In this paper, we propose SCLOPE, a novel algorithm based on CLOPE's intuitive observation about cluster histograms. Unlike CLOPE however, our algorithm is very fast...

Full description

Saved in:
Bibliographic Details
Main Authors: ONG, Kok-Leong, LI, Wenyuan, NG, Wee-Keong, LIM, Ee Peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2004
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1021
https://ink.library.smu.edu.sg/context/sis_research/article/2020/viewcontent/SCLOPE__An_algorithm_for_clustering_data_streams_of_categorical_attributes.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2020
record_format dspace
spelling sg-smu-ink.sis_research-20202018-06-13T07:22:05Z SCLOPE: An algorithm for clustering data streams of categorical attributes ONG, Kok-Leong LI, Wenyuan NG, Wee-Keong LIM, Ee Peng Clustering is a difficult problem especially when we consider the task in the context of a data stream of categorical attributes. In this paper, we propose SCLOPE, a novel algorithm based on CLOPE's intuitive observation about cluster histograms. Unlike CLOPE however, our algorithm is very fast and operates within the constraints of a data stream environment. In particular, we designed SCLOPE according to the recent CluStream framework. Our evaluation of SCLOPE shows very promising results. It consistently outperforms CLOPE in speed and scalability tests on our data sets while maintaining high cluster purity; it also supports cluster analysis that other algorithms in its class do not. 2004-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1021 info:doi/10.1007/978-3-540-30076-2_21 https://ink.library.smu.edu.sg/context/sis_research/article/2020/viewcontent/SCLOPE__An_algorithm_for_clustering_data_streams_of_categorical_attributes.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
ONG, Kok-Leong
LI, Wenyuan
NG, Wee-Keong
LIM, Ee Peng
SCLOPE: An algorithm for clustering data streams of categorical attributes
description Clustering is a difficult problem especially when we consider the task in the context of a data stream of categorical attributes. In this paper, we propose SCLOPE, a novel algorithm based on CLOPE's intuitive observation about cluster histograms. Unlike CLOPE however, our algorithm is very fast and operates within the constraints of a data stream environment. In particular, we designed SCLOPE according to the recent CluStream framework. Our evaluation of SCLOPE shows very promising results. It consistently outperforms CLOPE in speed and scalability tests on our data sets while maintaining high cluster purity; it also supports cluster analysis that other algorithms in its class do not.
format text
author ONG, Kok-Leong
LI, Wenyuan
NG, Wee-Keong
LIM, Ee Peng
author_facet ONG, Kok-Leong
LI, Wenyuan
NG, Wee-Keong
LIM, Ee Peng
author_sort ONG, Kok-Leong
title SCLOPE: An algorithm for clustering data streams of categorical attributes
title_short SCLOPE: An algorithm for clustering data streams of categorical attributes
title_full SCLOPE: An algorithm for clustering data streams of categorical attributes
title_fullStr SCLOPE: An algorithm for clustering data streams of categorical attributes
title_full_unstemmed SCLOPE: An algorithm for clustering data streams of categorical attributes
title_sort sclope: an algorithm for clustering data streams of categorical attributes
publisher Institutional Knowledge at Singapore Management University
publishDate 2004
url https://ink.library.smu.edu.sg/sis_research/1021
https://ink.library.smu.edu.sg/context/sis_research/article/2020/viewcontent/SCLOPE__An_algorithm_for_clustering_data_streams_of_categorical_attributes.pdf
_version_ 1770570824871313408