An Evolutionary Stream Clustering Technique for Outlier Detection

Clustering data streams appeared to be the most popular studies among the researchers due to their developing field. Data streams address numerous threats on clustering such as limited time, memory and single scan clustering. Besides, identifying arbitrary shapes clusters approach are very significa...

Full description

Saved in:
Bibliographic Details
Main Authors: Supardi, N.A., Abdulkadir, S.J., Aziz, N.
Format: Conference or Workshop Item
Published: Institute of Electrical and Electronics Engineers Inc. 2020
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85097540799&doi=10.1109%2fICCI51257.2020.9247832&partnerID=40&md5=0e4b185d3e95c097ae42c8390035dd4e
http://eprints.utp.edu.my/29857/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Petronas
Description
Summary:Clustering data streams appeared to be the most popular studies among the researchers due to their developing field. Data streams address numerous threats on clustering such as limited time, memory and single scan clustering. Besides, identifying arbitrary shapes clusters approach are very significant in data streams applications. Data streams are an infinite sequence of the element, evolve over time with no knowledge on the number of the clusters. Various factors such as some noise appear occasionally have the potential to negatively impact on data streams environment. The density-based technique is proven to be an astounding method in clustering data streams. It is computationally efficient to yield arbitrary shape clusters and detect noise immediately. Generally, it does not require the number of clusters in advance. Most of the traditional density-based clustering is not applicable in data streams due to its own characteristics. Nearly all traditional density-based clustering algorithms can be extended to the latest ones for data streams study purposes. This concept is mainly focused on the density-based technique in the clustering process to overcome the constraint from data streams nature. This paper proposes a preliminary result on a density-based algorithm (evoStream) for clustering which is to investigate outlier detection on three different real data sets named, KDDCup99, sensor and power supply. Later, this algorithm will be extended to optimize the model in detecting outlier on data streams. © 2020 IEEE.