Anomaly detection under distribution shift
Anomaly detection (AD) is a crucial machine learning task that aims to learn patterns from a set of normal training samples to identify abnormal samples in test data. Most existing AD studies assume that the training and test data are drawn from the same data distribution, but the test data can have...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8415 https://ink.library.smu.edu.sg/context/sis_research/article/9418/viewcontent/Cao_Anomaly_Detection_Under_Distribution_Shift_ICCV_2023_paper.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9418 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-94182024-01-09T03:45:07Z Anomaly detection under distribution shift CAO, Tri ZHU, Jiawen PANG, Guansong Anomaly detection (AD) is a crucial machine learning task that aims to learn patterns from a set of normal training samples to identify abnormal samples in test data. Most existing AD studies assume that the training and test data are drawn from the same data distribution, but the test data can have large distribution shifts arising in many real-world applications due to different natural variations such as new lighting conditions, object poses, or background appearances, rendering existing AD methods ineffective in such cases. In this paper, we consider the problem of anomaly detection under distribution shift and establish performance benchmarks on four widely-used AD and out-of-distribution (OOD) generalization datasets. We demonstrate that simple adaptation of state-of-the-art OOD generalization methods to AD settings fails to work effectively due to the lack of labeled anomaly data. We further introduce a novel robust AD approach to diverse distribution shifts by minimizing the distribution gap between indistribution and OOD normal samples in both the training and inference stages in an unsupervised way. Our extensive empirical results on the four datasets show that our approach substantially outperforms state-of-the-art AD methods and OOD generalization methods on data with various distribution shifts, while maintaining the detection accuracy on in-distribution data. Code and data are available at https://github.com/mala-lab/ADShift. 2023-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8415 https://ink.library.smu.edu.sg/context/sis_research/article/9418/viewcontent/Cao_Anomaly_Detection_Under_Distribution_Shift_ICCV_2023_paper.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems |
spellingShingle |
Databases and Information Systems CAO, Tri ZHU, Jiawen PANG, Guansong Anomaly detection under distribution shift |
description |
Anomaly detection (AD) is a crucial machine learning task that aims to learn patterns from a set of normal training samples to identify abnormal samples in test data. Most existing AD studies assume that the training and test data are drawn from the same data distribution, but the test data can have large distribution shifts arising in many real-world applications due to different natural variations such as new lighting conditions, object poses, or background appearances, rendering existing AD methods ineffective in such cases. In this paper, we consider the problem of anomaly detection under distribution shift and establish performance benchmarks on four widely-used AD and out-of-distribution (OOD) generalization datasets. We demonstrate that simple adaptation of state-of-the-art OOD generalization methods to AD settings fails to work effectively due to the lack of labeled anomaly data. We further introduce a novel robust AD approach to diverse distribution shifts by minimizing the distribution gap between indistribution and OOD normal samples in both the training and inference stages in an unsupervised way. Our extensive empirical results on the four datasets show that our approach substantially outperforms state-of-the-art AD methods and OOD generalization methods on data with various distribution shifts, while maintaining the detection accuracy on in-distribution data. Code and data are available at https://github.com/mala-lab/ADShift. |
format |
text |
author |
CAO, Tri ZHU, Jiawen PANG, Guansong |
author_facet |
CAO, Tri ZHU, Jiawen PANG, Guansong |
author_sort |
CAO, Tri |
title |
Anomaly detection under distribution shift |
title_short |
Anomaly detection under distribution shift |
title_full |
Anomaly detection under distribution shift |
title_fullStr |
Anomaly detection under distribution shift |
title_full_unstemmed |
Anomaly detection under distribution shift |
title_sort |
anomaly detection under distribution shift |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2023 |
url |
https://ink.library.smu.edu.sg/sis_research/8415 https://ink.library.smu.edu.sg/context/sis_research/article/9418/viewcontent/Cao_Anomaly_Detection_Under_Distribution_Shift_ICCV_2023_paper.pdf |
_version_ |
1787590771858210816 |