Weakly supervised segmentation with maximum bipartite graph matching
In the weakly supervised segmentation task with only image-level labels, a common step in many existing algorithms is first to locate the image regions corresponding to each existing class with the Class Activation Maps (CAMs), and then generate the pseudo ground truth masks based on the CAMs to tra...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article Conference or Workshop Item |
Language: | English |
Published: |
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/151821 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-151821 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1518212021-09-29T07:23:41Z Weakly supervised segmentation with maximum bipartite graph matching Liu, Weide Zhang, Chi Lin, Guosheng Hung, Tzu-Yi Miao, Chunyan School of Computer Science and Engineering MM '20: Proceedings of the 28th ACM International Conference on Multimedia Engineering::Computer science and engineering Computer Vision Graph Matching In the weakly supervised segmentation task with only image-level labels, a common step in many existing algorithms is first to locate the image regions corresponding to each existing class with the Class Activation Maps (CAMs), and then generate the pseudo ground truth masks based on the CAMs to train a segmentation network in the fully supervised manner. The quality of the CAMs has a crucial impact on the performance of the segmentation model. We propose to improve the CAMs from a novel graph perspective. We model paired images containing common classes with a bipartite graph and use the maximum matching algorithm to locate corresponding areas in two images. The matching areas are then used to refine the predicted object regions in the CAMs. The experiments on Pascal VOC 2012 dataset show that our network can effectively boost the performance of the baseline model and achieves new state-of-the-art performance. AI Singapore Ministry of Education (MOE) National Research Foundation (NRF) This work is supported by the Delta-NTU Corporate Lab with funding support from Delta Electronics Inc. and the National Research Foundation (NRF) Singapore (SMA-RP10). This work is also partly supported by the National Research Foundation Singapore under its AI Singapore Programme (Award Number: AISG-RP-2018-003) and the MOE Tier-1 research grants: RG126/17 (S), RG28/18 (S) and RG22/19 (S). 2021-09-29T07:11:13Z 2021-09-29T07:11:13Z 2020 Journal Article Conference Paper Liu, W., Zhang, C., Lin, G., Hung, T. & Miao, C. (2020). Weakly supervised segmentation with maximum bipartite graph matching. MM '20: Proceedings of the 28th ACM International Conference on Multimedia, 2085-2094. https://dx.doi.org/10.1145/3394171.3413652 9781450379885 https://hdl.handle.net/10356/151821 10.1145/3394171.3413652 2-s2.0-85101013164 2085 2094 en SMA-RP10 AISG-RP-2018-003 RG126/17 (S) RG28/18 (S) RG22/19 (S) © 2020 Association for Computing Machinery. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Computer Vision Graph Matching |
spellingShingle |
Engineering::Computer science and engineering Computer Vision Graph Matching Liu, Weide Zhang, Chi Lin, Guosheng Hung, Tzu-Yi Miao, Chunyan Weakly supervised segmentation with maximum bipartite graph matching |
description |
In the weakly supervised segmentation task with only image-level labels, a common step in many existing algorithms is first to locate the image regions corresponding to each existing class with the Class Activation Maps (CAMs), and then generate the pseudo ground truth masks based on the CAMs to train a segmentation network in the fully supervised manner. The quality of the CAMs has a crucial impact on the performance of the segmentation model. We propose to improve the CAMs from a novel graph perspective. We model paired images containing common classes with a bipartite graph and use the maximum matching algorithm to locate corresponding areas in two images. The matching areas are then used to refine the predicted object regions in the CAMs. The experiments on Pascal VOC 2012 dataset show that our network can effectively boost the performance of the baseline model and achieves new state-of-the-art performance. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Liu, Weide Zhang, Chi Lin, Guosheng Hung, Tzu-Yi Miao, Chunyan |
format |
Article Conference or Workshop Item |
author |
Liu, Weide Zhang, Chi Lin, Guosheng Hung, Tzu-Yi Miao, Chunyan |
author_sort |
Liu, Weide |
title |
Weakly supervised segmentation with maximum bipartite graph matching |
title_short |
Weakly supervised segmentation with maximum bipartite graph matching |
title_full |
Weakly supervised segmentation with maximum bipartite graph matching |
title_fullStr |
Weakly supervised segmentation with maximum bipartite graph matching |
title_full_unstemmed |
Weakly supervised segmentation with maximum bipartite graph matching |
title_sort |
weakly supervised segmentation with maximum bipartite graph matching |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/151821 |
_version_ |
1712300642977972224 |