ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets
This paper introduces a new unsupervised anomaly detector called ZERO++ which employs the number of zero appearances in subspaces to detect anomalies in categorical data. It is unique in that it works in regions of subspaces that are not occupied by data; whereas existing methods work in regions occ...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2016
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7026 https://ink.library.smu.edu.sg/context/sis_research/article/8029/viewcontent/11035_Article_Text_20550_1_10_20180216.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8029 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-80292022-03-17T15:02:05Z ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets PANG, Guansong TING, Kai Ming ALBRECHT, David JIN, Huidong This paper introduces a new unsupervised anomaly detector called ZERO++ which employs the number of zero appearances in subspaces to detect anomalies in categorical data. It is unique in that it works in regions of subspaces that are not occupied by data; whereas existing methods work in regions occupied by data. ZERO++ examines only a small number of low dimensional subspaces to successfully identify anomalies. Unlike existing frequencybased algorithms, ZERO++ does not involve subspace pattern searching. We show that ZERO++ is better than or comparable with the state-of-the-art anomaly detection methods over a wide range of real-world categorical and numeric data sets; and it is efficient with linear time complexity and constant space complexity which make it a suitable candidate for large-scale data sets. 2016-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7026 info:doi/10.1613/jair.5228 https://ink.library.smu.edu.sg/context/sis_research/article/8029/viewcontent/11035_Article_Text_20550_1_10_20180216.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Artificial Intelligence and Robotics Databases and Information Systems |
spellingShingle |
Artificial Intelligence and Robotics Databases and Information Systems PANG, Guansong TING, Kai Ming ALBRECHT, David JIN, Huidong ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets |
description |
This paper introduces a new unsupervised anomaly detector called ZERO++ which employs the number of zero appearances in subspaces to detect anomalies in categorical data. It is unique in that it works in regions of subspaces that are not occupied by data; whereas existing methods work in regions occupied by data. ZERO++ examines only a small number of low dimensional subspaces to successfully identify anomalies. Unlike existing frequencybased algorithms, ZERO++ does not involve subspace pattern searching. We show that ZERO++ is better than or comparable with the state-of-the-art anomaly detection methods over a wide range of real-world categorical and numeric data sets; and it is efficient with linear time complexity and constant space complexity which make it a suitable candidate for large-scale data sets. |
format |
text |
author |
PANG, Guansong TING, Kai Ming ALBRECHT, David JIN, Huidong |
author_facet |
PANG, Guansong TING, Kai Ming ALBRECHT, David JIN, Huidong |
author_sort |
PANG, Guansong |
title |
ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets |
title_short |
ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets |
title_full |
ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets |
title_fullStr |
ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets |
title_full_unstemmed |
ZERO++: Harnessing the power of zero appearances to detect anomalies in large-scale data sets |
title_sort |
zero++: harnessing the power of zero appearances to detect anomalies in large-scale data sets |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2016 |
url |
https://ink.library.smu.edu.sg/sis_research/7026 https://ink.library.smu.edu.sg/context/sis_research/article/8029/viewcontent/11035_Article_Text_20550_1_10_20180216.pdf |
_version_ |
1770576190322507776 |