Fast OWDETR: transformer for open world object detection
Object detection is one of the basic computer vision tasks. Recently, a more challenging task, called open world object detection, which aims to identify novel unknown objects and incrementally learn to classify them when labels are available has been proposed. Open World Object Detector (ORE) an...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Coursework |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/162462 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-162462 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1624622023-07-04T17:50:11Z Fast OWDETR: transformer for open world object detection Chen, Xuanying Tan Yap Peng School of Electrical and Electronic Engineering EYPTan@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Object detection is one of the basic computer vision tasks. Recently, a more challenging task, called open world object detection, which aims to identify novel unknown objects and incrementally learn to classify them when labels are available has been proposed. Open World Object Detector (ORE) and Open-world Detection Transformer (OWDETR) are two methodologies proposed to address the open world task, while they are both time-consuming in training and with shortcomings. Aiming to improve the training speed and detection performance, we propose Fast OWDETR based on OWDETR which is a transformer-based approach. Speci cally, we replace the attentiondriven pseudo labeling mechanism in OWDETR with a logits-based one, and change the standard Deformable DETR into Deformable DETR with box re nement. For shorter transferring time between tasks, we present an incremental learning approach which dynamically reduces the number of trainable parameters in the classi cation head while keeping the backbone frozen after initial training. Our extensive experiments show that Fast OWDETR can achieve detection performance comparable with OWDETR while using less training time within tasks and between tasks. Master of Science (Computer Control and Automation) 2022-10-21T04:45:27Z 2022-10-21T04:45:27Z 2022 Thesis-Master by Coursework Chen, X. (2022). Fast OWDETR: transformer for open world object detection. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/162462 https://hdl.handle.net/10356/162462 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Chen, Xuanying Fast OWDETR: transformer for open world object detection |
description |
Object detection is one of the basic computer vision tasks. Recently, a more
challenging task, called open world object detection, which aims to identify
novel unknown objects and incrementally learn to classify them when labels
are available has been proposed. Open World Object Detector (ORE) and
Open-world Detection Transformer (OWDETR) are two methodologies proposed
to address the open world task, while they are both time-consuming
in training and with shortcomings. Aiming to improve the training speed
and detection performance, we propose Fast OWDETR based on OWDETR
which is a transformer-based approach. Speci cally, we replace the attentiondriven
pseudo labeling mechanism in OWDETR with a logits-based one, and
change the standard Deformable DETR into Deformable DETR with box
re nement. For shorter transferring time between tasks, we present an incremental
learning approach which dynamically reduces the number of trainable
parameters in the classi cation head while keeping the backbone frozen after
initial training. Our extensive experiments show that Fast OWDETR can
achieve detection performance comparable with OWDETR while using less
training time within tasks and between tasks. |
author2 |
Tan Yap Peng |
author_facet |
Tan Yap Peng Chen, Xuanying |
format |
Thesis-Master by Coursework |
author |
Chen, Xuanying |
author_sort |
Chen, Xuanying |
title |
Fast OWDETR: transformer for open world object detection |
title_short |
Fast OWDETR: transformer for open world object detection |
title_full |
Fast OWDETR: transformer for open world object detection |
title_fullStr |
Fast OWDETR: transformer for open world object detection |
title_full_unstemmed |
Fast OWDETR: transformer for open world object detection |
title_sort |
fast owdetr: transformer for open world object detection |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/162462 |
_version_ |
1772826414590984192 |