Fast OWDETR: transformer for open world object detection

Object detection is one of the basic computer vision tasks. Recently, a more challenging task, called open world object detection, which aims to identify novel unknown objects and incrementally learn to classify them when labels are available has been proposed. Open World Object Detector (ORE) an...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Chen, Xuanying
مؤلفون آخرون:	Tan Yap Peng
التنسيق:	Thesis-Master by Coursework
اللغة:	English
منشور في:	Nanyang Technological University 2022
الموضوعات:	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/162462
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Nanyang Technological University
اللغة:	English

id	sg-ntu-dr.10356-162462
record_format	dspace
spelling	sg-ntu-dr.10356-1624622023-07-04T17:50:11Z Fast OWDETR: transformer for open world object detection Chen, Xuanying Tan Yap Peng School of Electrical and Electronic Engineering EYPTan@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Object detection is one of the basic computer vision tasks. Recently, a more challenging task, called open world object detection, which aims to identify novel unknown objects and incrementally learn to classify them when labels are available has been proposed. Open World Object Detector (ORE) and Open-world Detection Transformer (OWDETR) are two methodologies proposed to address the open world task, while they are both time-consuming in training and with shortcomings. Aiming to improve the training speed and detection performance, we propose Fast OWDETR based on OWDETR which is a transformer-based approach. Speci cally, we replace the attentiondriven pseudo labeling mechanism in OWDETR with a logits-based one, and change the standard Deformable DETR into Deformable DETR with box re nement. For shorter transferring time between tasks, we present an incremental learning approach which dynamically reduces the number of trainable parameters in the classi cation head while keeping the backbone frozen after initial training. Our extensive experiments show that Fast OWDETR can achieve detection performance comparable with OWDETR while using less training time within tasks and between tasks. Master of Science (Computer Control and Automation) 2022-10-21T04:45:27Z 2022-10-21T04:45:27Z 2022 Thesis-Master by Coursework Chen, X. (2022). Fast OWDETR: transformer for open world object detection. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/162462 https://hdl.handle.net/10356/162462 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Chen, Xuanying Fast OWDETR: transformer for open world object detection
description	Object detection is one of the basic computer vision tasks. Recently, a more challenging task, called open world object detection, which aims to identify novel unknown objects and incrementally learn to classify them when labels are available has been proposed. Open World Object Detector (ORE) and Open-world Detection Transformer (OWDETR) are two methodologies proposed to address the open world task, while they are both time-consuming in training and with shortcomings. Aiming to improve the training speed and detection performance, we propose Fast OWDETR based on OWDETR which is a transformer-based approach. Speci cally, we replace the attentiondriven pseudo labeling mechanism in OWDETR with a logits-based one, and change the standard Deformable DETR into Deformable DETR with box re nement. For shorter transferring time between tasks, we present an incremental learning approach which dynamically reduces the number of trainable parameters in the classi cation head while keeping the backbone frozen after initial training. Our extensive experiments show that Fast OWDETR can achieve detection performance comparable with OWDETR while using less training time within tasks and between tasks.
author2	Tan Yap Peng
author_facet	Tan Yap Peng Chen, Xuanying
format	Thesis-Master by Coursework
author	Chen, Xuanying
author_sort	Chen, Xuanying
title	Fast OWDETR: transformer for open world object detection
title_short	Fast OWDETR: transformer for open world object detection
title_full	Fast OWDETR: transformer for open world object detection
title_fullStr	Fast OWDETR: transformer for open world object detection
title_full_unstemmed	Fast OWDETR: transformer for open world object detection
title_sort	fast owdetr: transformer for open world object detection
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/162462
_version_	1772826414590984192

Fast OWDETR: transformer for open world object detection

مواد مشابهة