Open-vocabulary object detection via debiased curriculum self-training

Open-vocabulary object detection aims to train a detector capable of recognizing various novel classes. Most existing studies exploit image-level weak supervision to generate pseudo object boxes for novel class training. However, the generated pseudo boxes are often noisy and biased towards base cla...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Hanlue, Guan, Dayan, Ke, Xiangrui, El Saddik, Abdulmotaleb, Lu, Shijian
Other Authors: College of Computing and Data Science
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/180718
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-180718
record_format dspace
spelling sg-ntu-dr.10356-1807182024-10-22T00:52:37Z Open-vocabulary object detection via debiased curriculum self-training Zhang, Hanlue Guan, Dayan Ke, Xiangrui El Saddik, Abdulmotaleb Lu, Shijian College of Computing and Data Science Computer and Information Science Object detection Open-vocabulary recognition Open-vocabulary object detection aims to train a detector capable of recognizing various novel classes. Most existing studies exploit image-level weak supervision to generate pseudo object boxes for novel class training. However, the generated pseudo boxes are often noisy and biased towards base classes, leading to sub-optimal open-vocabulary detectors. We propose DCS, a novel Debiased Curriculum Self-Training technique that generates pseudo object boxes progressively and adaptively for training accurate open-vocabulary detectors. DCS consists of two complementary designs, namely, progressive pseudo-label filtering (PPF) and adaptive pseudo-label selection (APS). Specifically, PPF discards confident but mismatched detection progressively at the early training stage when the trained detector is biased towards the base classes, APS instead fuses class-aware and class-agnostic pseudo labels by prioritizing class-aware pseudo labels at the late training stage when the detector can better recognize novel classes. Without bells and whistles, DCS achieves superior detection performance over two open-vocabulary detection benchmarks. 2024-10-22T00:52:37Z 2024-10-22T00:52:37Z 2024 Journal Article Zhang, H., Guan, D., Ke, X., El Saddik, A. & Lu, S. (2024). Open-vocabulary object detection via debiased curriculum self-training. Expert Systems With Applications, 255, 124762-. https://dx.doi.org/10.1016/j.eswa.2024.124762 0957-4174 https://hdl.handle.net/10356/180718 10.1016/j.eswa.2024.124762 2-s2.0-85198754234 255 124762 en Expert Systems with Applications © 2024 Elsevier Ltd. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Object detection
Open-vocabulary recognition
spellingShingle Computer and Information Science
Object detection
Open-vocabulary recognition
Zhang, Hanlue
Guan, Dayan
Ke, Xiangrui
El Saddik, Abdulmotaleb
Lu, Shijian
Open-vocabulary object detection via debiased curriculum self-training
description Open-vocabulary object detection aims to train a detector capable of recognizing various novel classes. Most existing studies exploit image-level weak supervision to generate pseudo object boxes for novel class training. However, the generated pseudo boxes are often noisy and biased towards base classes, leading to sub-optimal open-vocabulary detectors. We propose DCS, a novel Debiased Curriculum Self-Training technique that generates pseudo object boxes progressively and adaptively for training accurate open-vocabulary detectors. DCS consists of two complementary designs, namely, progressive pseudo-label filtering (PPF) and adaptive pseudo-label selection (APS). Specifically, PPF discards confident but mismatched detection progressively at the early training stage when the trained detector is biased towards the base classes, APS instead fuses class-aware and class-agnostic pseudo labels by prioritizing class-aware pseudo labels at the late training stage when the detector can better recognize novel classes. Without bells and whistles, DCS achieves superior detection performance over two open-vocabulary detection benchmarks.
author2 College of Computing and Data Science
author_facet College of Computing and Data Science
Zhang, Hanlue
Guan, Dayan
Ke, Xiangrui
El Saddik, Abdulmotaleb
Lu, Shijian
format Article
author Zhang, Hanlue
Guan, Dayan
Ke, Xiangrui
El Saddik, Abdulmotaleb
Lu, Shijian
author_sort Zhang, Hanlue
title Open-vocabulary object detection via debiased curriculum self-training
title_short Open-vocabulary object detection via debiased curriculum self-training
title_full Open-vocabulary object detection via debiased curriculum self-training
title_fullStr Open-vocabulary object detection via debiased curriculum self-training
title_full_unstemmed Open-vocabulary object detection via debiased curriculum self-training
title_sort open-vocabulary object detection via debiased curriculum self-training
publishDate 2024
url https://hdl.handle.net/10356/180718
_version_ 1814777756958851072