Efficient few-shot object detection via knowledge inheritance
Few-shot object detection (FSOD), which aims at learning a generic detector that can adapt to unseen tasks with scarce training samples, has witnessed consistent improvement recently. However, most existing methods ignore the efficiency issues, e.g., high computational complexity and slow adaptation...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/169113 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-169113 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1691132023-07-02T23:55:54Z Efficient few-shot object detection via knowledge inheritance Yang, Ze Zhang, Chi Li, Ruibo Xu, Yi Lin, Guosheng School of Computer Science and Engineering Engineering::Computer science and engineering Transfer Learning Meta Learning Few-shot object detection (FSOD), which aims at learning a generic detector that can adapt to unseen tasks with scarce training samples, has witnessed consistent improvement recently. However, most existing methods ignore the efficiency issues, e.g., high computational complexity and slow adaptation speed. Notably, efficiency has become an increasingly important evaluation metric for few-shot techniques due to an emerging trend toward embedded AI. To this end, we present an efficient pretrain-transfer framework (PTF) baseline with no computational increment, which achieves comparable results with previous state-of-the-art (SOTA) methods. Upon this baseline, we devise an initializer named knowledge inheritance (KI) to reliably initialize the novel weights for the box classifier, which effectively facilitates the knowledge transfer process and boosts the adaptation speed. Within the KI initializer, we propose an adaptive length re-scaling (ALR) strategy to alleviate the vector length inconsistency between the predicted novel weights and the pretrained base weights. Finally, our approach not only achieves the SOTA results across three public benchmarks, i.e., PASCAL VOC, COCO and LVIS, but also exhibits high efficiency with 1.8-100× faster adaptation speed against the other methods on COCO/LVIS benchmark during few-shot transfer. To our best knowledge, this is the first work to consider the efficiency problem in FSOD. We hope to motivate a trend toward powerful yet efficient few-shot technique development. The codes are publicly available at https://github.com/Ze-Yang/Efficient-FSOD. Ministry of Education (MOE) National Research Foundation (NRF) This work was supported in part by the National Research Foundation, Singapore, under its AI Singapore Programme through AISG under Award AISG-RP-2018-003; in part by the Ministry of Education (MOE) Academic Research Fund (AcRF) Tier-1 Research Grant RG95/20; and in part by the OPPO Research Grant. 2023-06-30T08:23:22Z 2023-06-30T08:23:22Z 2023 Journal Article Yang, Z., Zhang, C., Li, R., Xu, Y. & Lin, G. (2023). Efficient few-shot object detection via knowledge inheritance. IEEE Transactions On Image Processing, 32, 321-334. https://dx.doi.org/10.1109/TIP.2022.3228162 1057-7149 https://hdl.handle.net/10356/169113 10.1109/TIP.2022.3228162 37015553 2-s2.0-85144787373 32 321 334 en AISG-RP-2018-003 IEEE Transactions on Image Processing © 2022 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Transfer Learning Meta Learning |
spellingShingle |
Engineering::Computer science and engineering Transfer Learning Meta Learning Yang, Ze Zhang, Chi Li, Ruibo Xu, Yi Lin, Guosheng Efficient few-shot object detection via knowledge inheritance |
description |
Few-shot object detection (FSOD), which aims at learning a generic detector that can adapt to unseen tasks with scarce training samples, has witnessed consistent improvement recently. However, most existing methods ignore the efficiency issues, e.g., high computational complexity and slow adaptation speed. Notably, efficiency has become an increasingly important evaluation metric for few-shot techniques due to an emerging trend toward embedded AI. To this end, we present an efficient pretrain-transfer framework (PTF) baseline with no computational increment, which achieves comparable results with previous state-of-the-art (SOTA) methods. Upon this baseline, we devise an initializer named knowledge inheritance (KI) to reliably initialize the novel weights for the box classifier, which effectively facilitates the knowledge transfer process and boosts the adaptation speed. Within the KI initializer, we propose an adaptive length re-scaling (ALR) strategy to alleviate the vector length inconsistency between the predicted novel weights and the pretrained base weights. Finally, our approach not only achieves the SOTA results across three public benchmarks, i.e., PASCAL VOC, COCO and LVIS, but also exhibits high efficiency with 1.8-100× faster adaptation speed against the other methods on COCO/LVIS benchmark during few-shot transfer. To our best knowledge, this is the first work to consider the efficiency problem in FSOD. We hope to motivate a trend toward powerful yet efficient few-shot technique development. The codes are publicly available at https://github.com/Ze-Yang/Efficient-FSOD. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Yang, Ze Zhang, Chi Li, Ruibo Xu, Yi Lin, Guosheng |
format |
Article |
author |
Yang, Ze Zhang, Chi Li, Ruibo Xu, Yi Lin, Guosheng |
author_sort |
Yang, Ze |
title |
Efficient few-shot object detection via knowledge inheritance |
title_short |
Efficient few-shot object detection via knowledge inheritance |
title_full |
Efficient few-shot object detection via knowledge inheritance |
title_fullStr |
Efficient few-shot object detection via knowledge inheritance |
title_full_unstemmed |
Efficient few-shot object detection via knowledge inheritance |
title_sort |
efficient few-shot object detection via knowledge inheritance |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/169113 |
_version_ |
1772827981412040704 |