RoSAS: Deep semi-supervised anomaly detection with contamination-resilient continuous supervision

Semi-supervised anomaly detection methods leverage a few anomaly examples to yield drastically improved performance compared to unsupervised models. However, they still suffer from two limitations: 1) unlabeled anomalies (i.e., anomaly contamination) may mislead the learning process when all the unl...

Full description

Saved in:
Bibliographic Details
Main Authors: XU, Hongzuo, WANG, Yijie, PANG, Guansong, JIAN, Songlei, LIU, Ning, WANG, Yongjun
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8267
https://ink.library.smu.edu.sg/context/sis_research/article/9270/viewcontent/Rosas_av_cc_by.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9270
record_format dspace
spelling sg-smu-ink.sis_research-92702023-11-10T08:48:42Z RoSAS: Deep semi-supervised anomaly detection with contamination-resilient continuous supervision XU, Hongzuo WANG, Yijie PANG, Guansong JIAN, Songlei LIU, Ning WANG, Yongjun Semi-supervised anomaly detection methods leverage a few anomaly examples to yield drastically improved performance compared to unsupervised models. However, they still suffer from two limitations: 1) unlabeled anomalies (i.e., anomaly contamination) may mislead the learning process when all the unlabeled data are employed as inliers for model training; 2) only discrete supervision information (such as binary or ordinal data labels) is exploited, which leads to suboptimal learning of anomaly scores that essentially take on a continuous distribution. Therefore, this paper proposes a novel semi-supervised anomaly detection method, which devises contamination-resilient continuous supervisory signals. Specifically, we propose a mass interpolation method to diffuse the abnormality of labeled anomalies, thereby creating new data samples labeled with continuous abnormal degrees. Meanwhile, the contaminated area can be covered by new data samples generated via combinations of data with correct labels. A feature learning based objective is added to serve as an optimization constraint to regularize the network and further enhance the robustness w.r.t. anomaly contamination. Extensive experiments on 11 real world datasets show that our approach significantly outperforms state-of-the-art competitors by 20%-30% in AUC-PR and obtains more robust and superior performance in settings with different anomaly contamination levels and varying numbers of labeled anomalies. 2023-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8267 info:doi/10.1016/j.ipm.2023.103459 https://ink.library.smu.edu.sg/context/sis_research/article/9270/viewcontent/Rosas_av_cc_by.pdf http://creativecommons.org/licenses/by/3.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Anomaly detection Anomaly contamination Continuous supervision Semi-supervised learning Deep learning Artificial Intelligence and Robotics Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Anomaly detection
Anomaly contamination
Continuous supervision
Semi-supervised learning
Deep learning
Artificial Intelligence and Robotics
Databases and Information Systems
spellingShingle Anomaly detection
Anomaly contamination
Continuous supervision
Semi-supervised learning
Deep learning
Artificial Intelligence and Robotics
Databases and Information Systems
XU, Hongzuo
WANG, Yijie
PANG, Guansong
JIAN, Songlei
LIU, Ning
WANG, Yongjun
RoSAS: Deep semi-supervised anomaly detection with contamination-resilient continuous supervision
description Semi-supervised anomaly detection methods leverage a few anomaly examples to yield drastically improved performance compared to unsupervised models. However, they still suffer from two limitations: 1) unlabeled anomalies (i.e., anomaly contamination) may mislead the learning process when all the unlabeled data are employed as inliers for model training; 2) only discrete supervision information (such as binary or ordinal data labels) is exploited, which leads to suboptimal learning of anomaly scores that essentially take on a continuous distribution. Therefore, this paper proposes a novel semi-supervised anomaly detection method, which devises contamination-resilient continuous supervisory signals. Specifically, we propose a mass interpolation method to diffuse the abnormality of labeled anomalies, thereby creating new data samples labeled with continuous abnormal degrees. Meanwhile, the contaminated area can be covered by new data samples generated via combinations of data with correct labels. A feature learning based objective is added to serve as an optimization constraint to regularize the network and further enhance the robustness w.r.t. anomaly contamination. Extensive experiments on 11 real world datasets show that our approach significantly outperforms state-of-the-art competitors by 20%-30% in AUC-PR and obtains more robust and superior performance in settings with different anomaly contamination levels and varying numbers of labeled anomalies.
format text
author XU, Hongzuo
WANG, Yijie
PANG, Guansong
JIAN, Songlei
LIU, Ning
WANG, Yongjun
author_facet XU, Hongzuo
WANG, Yijie
PANG, Guansong
JIAN, Songlei
LIU, Ning
WANG, Yongjun
author_sort XU, Hongzuo
title RoSAS: Deep semi-supervised anomaly detection with contamination-resilient continuous supervision
title_short RoSAS: Deep semi-supervised anomaly detection with contamination-resilient continuous supervision
title_full RoSAS: Deep semi-supervised anomaly detection with contamination-resilient continuous supervision
title_fullStr RoSAS: Deep semi-supervised anomaly detection with contamination-resilient continuous supervision
title_full_unstemmed RoSAS: Deep semi-supervised anomaly detection with contamination-resilient continuous supervision
title_sort rosas: deep semi-supervised anomaly detection with contamination-resilient continuous supervision
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8267
https://ink.library.smu.edu.sg/context/sis_research/article/9270/viewcontent/Rosas_av_cc_by.pdf
_version_ 1783955661368852480