ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets

This paper introduces a new unsupervised anomaly detector called ZERO++ which employs the number of zero appearances in subspaces to detect anomalies in categorical data. It is unique in that it works in regions of subspaces that are not occupied by data; whereas existing methods work in regions occ...

Full description

Saved in:
Bibliographic Details
Main Authors: PANG, Guansong, TING, Kai Ming, ALBRECHT, David, JIN, Huidong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7026
https://ink.library.smu.edu.sg/context/sis_research/article/8029/viewcontent/11035_Article_Text_20550_1_10_20180216.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8029
record_format dspace
spelling sg-smu-ink.sis_research-80292022-03-17T15:02:05Z ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets PANG, Guansong TING, Kai Ming ALBRECHT, David JIN, Huidong This paper introduces a new unsupervised anomaly detector called ZERO++ which employs the number of zero appearances in subspaces to detect anomalies in categorical data. It is unique in that it works in regions of subspaces that are not occupied by data; whereas existing methods work in regions occupied by data. ZERO++ examines only a small number of low dimensional subspaces to successfully identify anomalies. Unlike existing frequencybased algorithms, ZERO++ does not involve subspace pattern searching. We show that ZERO++ is better than or comparable with the state-of-the-art anomaly detection methods over a wide range of real-world categorical and numeric data sets; and it is efficient with linear time complexity and constant space complexity which make it a suitable candidate for large-scale data sets. 2016-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7026 info:doi/10.1613/jair.5228 https://ink.library.smu.edu.sg/context/sis_research/article/8029/viewcontent/11035_Article_Text_20550_1_10_20180216.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Artificial Intelligence and Robotics
Databases and Information Systems
spellingShingle Artificial Intelligence and Robotics
Databases and Information Systems
PANG, Guansong
TING, Kai Ming
ALBRECHT, David
JIN, Huidong
ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets
description This paper introduces a new unsupervised anomaly detector called ZERO++ which employs the number of zero appearances in subspaces to detect anomalies in categorical data. It is unique in that it works in regions of subspaces that are not occupied by data; whereas existing methods work in regions occupied by data. ZERO++ examines only a small number of low dimensional subspaces to successfully identify anomalies. Unlike existing frequencybased algorithms, ZERO++ does not involve subspace pattern searching. We show that ZERO++ is better than or comparable with the state-of-the-art anomaly detection methods over a wide range of real-world categorical and numeric data sets; and it is efficient with linear time complexity and constant space complexity which make it a suitable candidate for large-scale data sets.
format text
author PANG, Guansong
TING, Kai Ming
ALBRECHT, David
JIN, Huidong
author_facet PANG, Guansong
TING, Kai Ming
ALBRECHT, David
JIN, Huidong
author_sort PANG, Guansong
title ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets
title_short ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets
title_full ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets
title_fullStr ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets
title_full_unstemmed ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets
title_sort zero++: harnessing the power of zero appearances to detect anomalies in large-scale data sets
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/7026
https://ink.library.smu.edu.sg/context/sis_research/article/8029/viewcontent/11035_Article_Text_20550_1_10_20180216.pdf
_version_ 1770576190322507776