Efficient few-shot object detection via knowledge inheritance

Few-shot object detection (FSOD), which aims at learning a generic detector that can adapt to unseen tasks with scarce training samples, has witnessed consistent improvement recently. However, most existing methods ignore the efficiency issues, e.g., high computational complexity and slow adaptation...

Full description

Saved in:
Bibliographic Details
Main Authors: Yang, Ze, Zhang, Chi, Li, Ruibo, Xu, Yi, Lin, Guosheng
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/169113
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-169113
record_format dspace
spelling sg-ntu-dr.10356-1691132023-07-02T23:55:54Z Efficient few-shot object detection via knowledge inheritance Yang, Ze Zhang, Chi Li, Ruibo Xu, Yi Lin, Guosheng School of Computer Science and Engineering Engineering::Computer science and engineering Transfer Learning Meta Learning Few-shot object detection (FSOD), which aims at learning a generic detector that can adapt to unseen tasks with scarce training samples, has witnessed consistent improvement recently. However, most existing methods ignore the efficiency issues, e.g., high computational complexity and slow adaptation speed. Notably, efficiency has become an increasingly important evaluation metric for few-shot techniques due to an emerging trend toward embedded AI. To this end, we present an efficient pretrain-transfer framework (PTF) baseline with no computational increment, which achieves comparable results with previous state-of-the-art (SOTA) methods. Upon this baseline, we devise an initializer named knowledge inheritance (KI) to reliably initialize the novel weights for the box classifier, which effectively facilitates the knowledge transfer process and boosts the adaptation speed. Within the KI initializer, we propose an adaptive length re-scaling (ALR) strategy to alleviate the vector length inconsistency between the predicted novel weights and the pretrained base weights. Finally, our approach not only achieves the SOTA results across three public benchmarks, i.e., PASCAL VOC, COCO and LVIS, but also exhibits high efficiency with 1.8-100× faster adaptation speed against the other methods on COCO/LVIS benchmark during few-shot transfer. To our best knowledge, this is the first work to consider the efficiency problem in FSOD. We hope to motivate a trend toward powerful yet efficient few-shot technique development. The codes are publicly available at https://github.com/Ze-Yang/Efficient-FSOD. Ministry of Education (MOE) National Research Foundation (NRF) This work was supported in part by the National Research Foundation, Singapore, under its AI Singapore Programme through AISG under Award AISG-RP-2018-003; in part by the Ministry of Education (MOE) Academic Research Fund (AcRF) Tier-1 Research Grant RG95/20; and in part by the OPPO Research Grant. 2023-06-30T08:23:22Z 2023-06-30T08:23:22Z 2023 Journal Article Yang, Z., Zhang, C., Li, R., Xu, Y. & Lin, G. (2023). Efficient few-shot object detection via knowledge inheritance. IEEE Transactions On Image Processing, 32, 321-334. https://dx.doi.org/10.1109/TIP.2022.3228162 1057-7149 https://hdl.handle.net/10356/169113 10.1109/TIP.2022.3228162 37015553 2-s2.0-85144787373 32 321 334 en AISG-RP-2018-003 IEEE Transactions on Image Processing © 2022 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Transfer Learning
Meta Learning
spellingShingle Engineering::Computer science and engineering
Transfer Learning
Meta Learning
Yang, Ze
Zhang, Chi
Li, Ruibo
Xu, Yi
Lin, Guosheng
Efficient few-shot object detection via knowledge inheritance
description Few-shot object detection (FSOD), which aims at learning a generic detector that can adapt to unseen tasks with scarce training samples, has witnessed consistent improvement recently. However, most existing methods ignore the efficiency issues, e.g., high computational complexity and slow adaptation speed. Notably, efficiency has become an increasingly important evaluation metric for few-shot techniques due to an emerging trend toward embedded AI. To this end, we present an efficient pretrain-transfer framework (PTF) baseline with no computational increment, which achieves comparable results with previous state-of-the-art (SOTA) methods. Upon this baseline, we devise an initializer named knowledge inheritance (KI) to reliably initialize the novel weights for the box classifier, which effectively facilitates the knowledge transfer process and boosts the adaptation speed. Within the KI initializer, we propose an adaptive length re-scaling (ALR) strategy to alleviate the vector length inconsistency between the predicted novel weights and the pretrained base weights. Finally, our approach not only achieves the SOTA results across three public benchmarks, i.e., PASCAL VOC, COCO and LVIS, but also exhibits high efficiency with 1.8-100× faster adaptation speed against the other methods on COCO/LVIS benchmark during few-shot transfer. To our best knowledge, this is the first work to consider the efficiency problem in FSOD. We hope to motivate a trend toward powerful yet efficient few-shot technique development. The codes are publicly available at https://github.com/Ze-Yang/Efficient-FSOD.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Yang, Ze
Zhang, Chi
Li, Ruibo
Xu, Yi
Lin, Guosheng
format Article
author Yang, Ze
Zhang, Chi
Li, Ruibo
Xu, Yi
Lin, Guosheng
author_sort Yang, Ze
title Efficient few-shot object detection via knowledge inheritance
title_short Efficient few-shot object detection via knowledge inheritance
title_full Efficient few-shot object detection via knowledge inheritance
title_fullStr Efficient few-shot object detection via knowledge inheritance
title_full_unstemmed Efficient few-shot object detection via knowledge inheritance
title_sort efficient few-shot object detection via knowledge inheritance
publishDate 2023
url https://hdl.handle.net/10356/169113
_version_ 1772827981412040704