Open-vocabulary object detection via debiased curriculum self-training
Open-vocabulary object detection aims to train a detector capable of recognizing various novel classes. Most existing studies exploit image-level weak supervision to generate pseudo object boxes for novel class training. However, the generated pseudo boxes are often noisy and biased towards base cla...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/180718 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-180718 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1807182024-10-22T00:52:37Z Open-vocabulary object detection via debiased curriculum self-training Zhang, Hanlue Guan, Dayan Ke, Xiangrui El Saddik, Abdulmotaleb Lu, Shijian College of Computing and Data Science Computer and Information Science Object detection Open-vocabulary recognition Open-vocabulary object detection aims to train a detector capable of recognizing various novel classes. Most existing studies exploit image-level weak supervision to generate pseudo object boxes for novel class training. However, the generated pseudo boxes are often noisy and biased towards base classes, leading to sub-optimal open-vocabulary detectors. We propose DCS, a novel Debiased Curriculum Self-Training technique that generates pseudo object boxes progressively and adaptively for training accurate open-vocabulary detectors. DCS consists of two complementary designs, namely, progressive pseudo-label filtering (PPF) and adaptive pseudo-label selection (APS). Specifically, PPF discards confident but mismatched detection progressively at the early training stage when the trained detector is biased towards the base classes, APS instead fuses class-aware and class-agnostic pseudo labels by prioritizing class-aware pseudo labels at the late training stage when the detector can better recognize novel classes. Without bells and whistles, DCS achieves superior detection performance over two open-vocabulary detection benchmarks. 2024-10-22T00:52:37Z 2024-10-22T00:52:37Z 2024 Journal Article Zhang, H., Guan, D., Ke, X., El Saddik, A. & Lu, S. (2024). Open-vocabulary object detection via debiased curriculum self-training. Expert Systems With Applications, 255, 124762-. https://dx.doi.org/10.1016/j.eswa.2024.124762 0957-4174 https://hdl.handle.net/10356/180718 10.1016/j.eswa.2024.124762 2-s2.0-85198754234 255 124762 en Expert Systems with Applications © 2024 Elsevier Ltd. All rights are reserved, including those for text and data mining, AI training, and similar technologies. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Object detection Open-vocabulary recognition |
spellingShingle |
Computer and Information Science Object detection Open-vocabulary recognition Zhang, Hanlue Guan, Dayan Ke, Xiangrui El Saddik, Abdulmotaleb Lu, Shijian Open-vocabulary object detection via debiased curriculum self-training |
description |
Open-vocabulary object detection aims to train a detector capable of recognizing various novel classes. Most existing studies exploit image-level weak supervision to generate pseudo object boxes for novel class training. However, the generated pseudo boxes are often noisy and biased towards base classes, leading to sub-optimal open-vocabulary detectors. We propose DCS, a novel Debiased Curriculum Self-Training technique that generates pseudo object boxes progressively and adaptively for training accurate open-vocabulary detectors. DCS consists of two complementary designs, namely, progressive pseudo-label filtering (PPF) and adaptive pseudo-label selection (APS). Specifically, PPF discards confident but mismatched detection progressively at the early training stage when the trained detector is biased towards the base classes, APS instead fuses class-aware and class-agnostic pseudo labels by prioritizing class-aware pseudo labels at the late training stage when the detector can better recognize novel classes. Without bells and whistles, DCS achieves superior detection performance over two open-vocabulary detection benchmarks. |
author2 |
College of Computing and Data Science |
author_facet |
College of Computing and Data Science Zhang, Hanlue Guan, Dayan Ke, Xiangrui El Saddik, Abdulmotaleb Lu, Shijian |
format |
Article |
author |
Zhang, Hanlue Guan, Dayan Ke, Xiangrui El Saddik, Abdulmotaleb Lu, Shijian |
author_sort |
Zhang, Hanlue |
title |
Open-vocabulary object detection via debiased curriculum self-training |
title_short |
Open-vocabulary object detection via debiased curriculum self-training |
title_full |
Open-vocabulary object detection via debiased curriculum self-training |
title_fullStr |
Open-vocabulary object detection via debiased curriculum self-training |
title_full_unstemmed |
Open-vocabulary object detection via debiased curriculum self-training |
title_sort |
open-vocabulary object detection via debiased curriculum self-training |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/180718 |
_version_ |
1814777756958851072 |