Fast OWDETR: transformer for open world object detection

Object detection is one of the basic computer vision tasks. Recently, a more challenging task, called open world object detection, which aims to identify novel unknown objects and incrementally learn to classify them when labels are available has been proposed. Open World Object Detector (ORE) an...

全面介紹

Saved in:
書目詳細資料
主要作者: Chen, Xuanying
其他作者: Tan Yap Peng
格式: Thesis-Master by Coursework
語言:English
出版: Nanyang Technological University 2022
主題:
在線閱讀:https://hdl.handle.net/10356/162462
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
實物特徵
總結:Object detection is one of the basic computer vision tasks. Recently, a more challenging task, called open world object detection, which aims to identify novel unknown objects and incrementally learn to classify them when labels are available has been proposed. Open World Object Detector (ORE) and Open-world Detection Transformer (OWDETR) are two methodologies proposed to address the open world task, while they are both time-consuming in training and with shortcomings. Aiming to improve the training speed and detection performance, we propose Fast OWDETR based on OWDETR which is a transformer-based approach. Speci cally, we replace the attentiondriven pseudo labeling mechanism in OWDETR with a logits-based one, and change the standard Deformable DETR into Deformable DETR with box re nement. For shorter transferring time between tasks, we present an incremental learning approach which dynamically reduces the number of trainable parameters in the classi cation head while keeping the backbone frozen after initial training. Our extensive experiments show that Fast OWDETR can achieve detection performance comparable with OWDETR while using less training time within tasks and between tasks.