Anomaly detection under distribution shift

Anomaly detection (AD) is a crucial machine learning task that aims to learn patterns from a set of normal training samples to identify abnormal samples in test data. Most existing AD studies assume that the training and test data are drawn from the same data distribution, but the test data can have...

Full description

Saved in:
Bibliographic Details
Main Authors: CAO, Tri, ZHU, Jiawen, PANG, Guansong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8415
https://ink.library.smu.edu.sg/context/sis_research/article/9418/viewcontent/Cao_Anomaly_Detection_Under_Distribution_Shift_ICCV_2023_paper.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9418
record_format dspace
spelling sg-smu-ink.sis_research-94182024-01-09T03:45:07Z Anomaly detection under distribution shift CAO, Tri ZHU, Jiawen PANG, Guansong Anomaly detection (AD) is a crucial machine learning task that aims to learn patterns from a set of normal training samples to identify abnormal samples in test data. Most existing AD studies assume that the training and test data are drawn from the same data distribution, but the test data can have large distribution shifts arising in many real-world applications due to different natural variations such as new lighting conditions, object poses, or background appearances, rendering existing AD methods ineffective in such cases. In this paper, we consider the problem of anomaly detection under distribution shift and establish performance benchmarks on four widely-used AD and out-of-distribution (OOD) generalization datasets. We demonstrate that simple adaptation of state-of-the-art OOD generalization methods to AD settings fails to work effectively due to the lack of labeled anomaly data. We further introduce a novel robust AD approach to diverse distribution shifts by minimizing the distribution gap between indistribution and OOD normal samples in both the training and inference stages in an unsupervised way. Our extensive empirical results on the four datasets show that our approach substantially outperforms state-of-the-art AD methods and OOD generalization methods on data with various distribution shifts, while maintaining the detection accuracy on in-distribution data. Code and data are available at https://github.com/mala-lab/ADShift. 2023-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8415 https://ink.library.smu.edu.sg/context/sis_research/article/9418/viewcontent/Cao_Anomaly_Detection_Under_Distribution_Shift_ICCV_2023_paper.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
CAO, Tri
ZHU, Jiawen
PANG, Guansong
Anomaly detection under distribution shift
description Anomaly detection (AD) is a crucial machine learning task that aims to learn patterns from a set of normal training samples to identify abnormal samples in test data. Most existing AD studies assume that the training and test data are drawn from the same data distribution, but the test data can have large distribution shifts arising in many real-world applications due to different natural variations such as new lighting conditions, object poses, or background appearances, rendering existing AD methods ineffective in such cases. In this paper, we consider the problem of anomaly detection under distribution shift and establish performance benchmarks on four widely-used AD and out-of-distribution (OOD) generalization datasets. We demonstrate that simple adaptation of state-of-the-art OOD generalization methods to AD settings fails to work effectively due to the lack of labeled anomaly data. We further introduce a novel robust AD approach to diverse distribution shifts by minimizing the distribution gap between indistribution and OOD normal samples in both the training and inference stages in an unsupervised way. Our extensive empirical results on the four datasets show that our approach substantially outperforms state-of-the-art AD methods and OOD generalization methods on data with various distribution shifts, while maintaining the detection accuracy on in-distribution data. Code and data are available at https://github.com/mala-lab/ADShift.
format text
author CAO, Tri
ZHU, Jiawen
PANG, Guansong
author_facet CAO, Tri
ZHU, Jiawen
PANG, Guansong
author_sort CAO, Tri
title Anomaly detection under distribution shift
title_short Anomaly detection under distribution shift
title_full Anomaly detection under distribution shift
title_fullStr Anomaly detection under distribution shift
title_full_unstemmed Anomaly detection under distribution shift
title_sort anomaly detection under distribution shift
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8415
https://ink.library.smu.edu.sg/context/sis_research/article/9418/viewcontent/Cao_Anomaly_Detection_Under_Distribution_Shift_ICCV_2023_paper.pdf
_version_ 1787590771858210816