Semi-supervised and long-tailed object detection with Cascadematch

This paper focuses on long-tailed object detection in the semi-supervised learning setting, which poses realistic challenges, but has rarely been studied in the literature. We propose a novel pseudo-labeling-based detector called CascadeMatch. Our detector features a cascade network architecture, wh...

Full description

Saved in:
Bibliographic Details
Main Authors: Zang, Yuhang, Zhou, Kaiyang, Huang, Chen, Loy, Chen Change
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/165791
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-165791
record_format dspace
spelling sg-ntu-dr.10356-1657912023-04-14T15:36:08Z Semi-supervised and long-tailed object detection with Cascadematch Zang, Yuhang Zhou, Kaiyang Huang, Chen Loy, Chen Change School of Computer Science and Engineering S-Lab, NTU Engineering::Computer science and engineering Object Detection Long-Tailed Learning This paper focuses on long-tailed object detection in the semi-supervised learning setting, which poses realistic challenges, but has rarely been studied in the literature. We propose a novel pseudo-labeling-based detector called CascadeMatch. Our detector features a cascade network architecture, which has multi-stage detection heads with progressive confidence thresholds. To avoid manually tuning the thresholds, we design a new adaptive pseudo-label mining mechanism to automatically identify suitable values from data. To mitigate confirmation bias, where a model is negatively reinforced by incorrect pseudo-labels produced by itself, each detection head is trained by the ensemble pseudo-labels of all detection heads. Experiments on two long-tailed datasets, i.e., LVIS and COCO-LT, demonstrate that CascadeMatch surpasses existing state-of-the-art semi-supervised approaches—across a wide range of detection architectures—in handling long-tailed object detection. For instance, CascadeMatch outperforms Unbiased Teacher by 1.9 AP Fix on LVIS when using a ResNet50-based Cascade R-CNN structure, and by 1.7 AP Fix when using Sparse R-CNN with a Transformer encoder. We also show that CascadeMatch can even handle the challenging sparsely annotated object detection problem. Code: https://github.com/yuhangzang/CascadeMatch. Ministry of Education (MOE) Nanyang Technological University Submitted/Accepted version This study is supported under the RIE2020 Industry Alignment Fund Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from the industry partner(s). It is also partly supported by the NTU NAP grant and Singapore MOE AcRF Tier 2 (MOE-T2EP20120-0001). 2023-04-10T08:33:03Z 2023-04-10T08:33:03Z 2023 Journal Article Zang, Y., Zhou, K., Huang, C. & Loy, C. C. (2023). Semi-supervised and long-tailed object detection with Cascadematch. International Journal of Computer Vision, 131(4), 987-1001. https://dx.doi.org/10.1007/s11263-022-01738-x 0920-5691 https://hdl.handle.net/10356/165791 10.1007/s11263-022-01738-x 2-s2.0-85145831630 4 131 987 1001 en MOE-T2EP20120-0001 NTU NAP International Journal of Computer Vision © 2023 The Author(s), under exclusive licence to Springer Science + Business Media, LLC, part of Springer Nature. All rights reserved. This version of the article has been accepted for publication, after peer review and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s11263-022-01738-x application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Object Detection
Long-Tailed Learning
spellingShingle Engineering::Computer science and engineering
Object Detection
Long-Tailed Learning
Zang, Yuhang
Zhou, Kaiyang
Huang, Chen
Loy, Chen Change
Semi-supervised and long-tailed object detection with Cascadematch
description This paper focuses on long-tailed object detection in the semi-supervised learning setting, which poses realistic challenges, but has rarely been studied in the literature. We propose a novel pseudo-labeling-based detector called CascadeMatch. Our detector features a cascade network architecture, which has multi-stage detection heads with progressive confidence thresholds. To avoid manually tuning the thresholds, we design a new adaptive pseudo-label mining mechanism to automatically identify suitable values from data. To mitigate confirmation bias, where a model is negatively reinforced by incorrect pseudo-labels produced by itself, each detection head is trained by the ensemble pseudo-labels of all detection heads. Experiments on two long-tailed datasets, i.e., LVIS and COCO-LT, demonstrate that CascadeMatch surpasses existing state-of-the-art semi-supervised approaches—across a wide range of detection architectures—in handling long-tailed object detection. For instance, CascadeMatch outperforms Unbiased Teacher by 1.9 AP Fix on LVIS when using a ResNet50-based Cascade R-CNN structure, and by 1.7 AP Fix when using Sparse R-CNN with a Transformer encoder. We also show that CascadeMatch can even handle the challenging sparsely annotated object detection problem. Code: https://github.com/yuhangzang/CascadeMatch.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Zang, Yuhang
Zhou, Kaiyang
Huang, Chen
Loy, Chen Change
format Article
author Zang, Yuhang
Zhou, Kaiyang
Huang, Chen
Loy, Chen Change
author_sort Zang, Yuhang
title Semi-supervised and long-tailed object detection with Cascadematch
title_short Semi-supervised and long-tailed object detection with Cascadematch
title_full Semi-supervised and long-tailed object detection with Cascadematch
title_fullStr Semi-supervised and long-tailed object detection with Cascadematch
title_full_unstemmed Semi-supervised and long-tailed object detection with Cascadematch
title_sort semi-supervised and long-tailed object detection with cascadematch
publishDate 2023
url https://hdl.handle.net/10356/165791
_version_ 1764208090751696896