Class re-activation maps for weakly-supervised semantic segmentation
Extracting class activation maps (CAM) is arguably the most standard step of generating pseudo masks for weakly supervised semantic segmentation (WSSS). Yet, we find that the crux of the unsatisfactory pseudo masks is the binary cross-entropy loss (BCE) widely used in CAM. Specifically, due to the sum...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7511 https://ink.library.smu.edu.sg/context/sis_research/article/8514/viewcontent/Chen_Class_Re_Activation_Maps_for_Weakly_Supervised_Semantic_Segmentation_CVPR_2022_paper.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8514 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-85142022-11-18T07:46:11Z Class re-activation maps for weakly-supervised semantic segmentation CHEN, Zhaozheng WANG, Tan WU, Xiongwei HUA, Xian-Sheng ZHANG, Hanwang SUN, Qianru Extracting class activation maps (CAM) is arguably the most standard step of generating pseudo masks for weakly supervised semantic segmentation (WSSS). Yet, we find that the crux of the unsatisfactory pseudo masks is the binary cross-entropy loss (BCE) widely used in CAM. Specifically, due to the sum-over-class pooling nature of BCE, each pixel in CAM may be responsive to multiple classes co-occurring in the same receptive field. To this end, we introduce an embarrassingly simple yet surprisingly effective method: Reactivating the converged CAM with BCE by using softmax crossentropy loss (SCE), dubbed ReCAM. Given an image, we use CAM to extract the feature pixels of each single class, and use them with the class label to learn another fully-connected layer (after the backbone) with SCE. Once converged, we extract ReCAM in the same way as in CAM. 2022-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7511 info:doi/10.1109/CVPR52688.2022.00104 https://ink.library.smu.edu.sg/context/sis_research/article/8514/viewcontent/Chen_Class_Re_Activation_Maps_for_Weakly_Supervised_Semantic_Segmentation_CVPR_2022_paper.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University class activation maps weakly supervised learning semantic segmentation Databases and Information Systems Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
class activation maps weakly supervised learning semantic segmentation Databases and Information Systems Graphics and Human Computer Interfaces |
spellingShingle |
class activation maps weakly supervised learning semantic segmentation Databases and Information Systems Graphics and Human Computer Interfaces CHEN, Zhaozheng WANG, Tan WU, Xiongwei HUA, Xian-Sheng ZHANG, Hanwang SUN, Qianru Class re-activation maps for weakly-supervised semantic segmentation |
description |
Extracting class activation maps (CAM) is arguably the most standard step of generating pseudo masks for weakly supervised semantic segmentation (WSSS). Yet, we find that the crux of the unsatisfactory pseudo masks is the binary cross-entropy loss (BCE) widely used in CAM. Specifically, due to the sum-over-class pooling nature of BCE, each pixel in CAM may be responsive to multiple classes co-occurring in the same receptive field. To this end, we introduce an embarrassingly simple yet surprisingly effective method: Reactivating the converged CAM with BCE by using softmax crossentropy loss (SCE), dubbed ReCAM. Given an image, we use CAM to extract the feature pixels of each single class, and use them with the class label to learn another fully-connected layer (after the backbone) with SCE. Once converged, we extract ReCAM in the same way as in CAM. |
format |
text |
author |
CHEN, Zhaozheng WANG, Tan WU, Xiongwei HUA, Xian-Sheng ZHANG, Hanwang SUN, Qianru |
author_facet |
CHEN, Zhaozheng WANG, Tan WU, Xiongwei HUA, Xian-Sheng ZHANG, Hanwang SUN, Qianru |
author_sort |
CHEN, Zhaozheng |
title |
Class re-activation maps for weakly-supervised semantic segmentation |
title_short |
Class re-activation maps for weakly-supervised semantic segmentation |
title_full |
Class re-activation maps for weakly-supervised semantic segmentation |
title_fullStr |
Class re-activation maps for weakly-supervised semantic segmentation |
title_full_unstemmed |
Class re-activation maps for weakly-supervised semantic segmentation |
title_sort |
class re-activation maps for weakly-supervised semantic segmentation |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/7511 https://ink.library.smu.edu.sg/context/sis_research/article/8514/viewcontent/Chen_Class_Re_Activation_Maps_for_Weakly_Supervised_Semantic_Segmentation_CVPR_2022_paper.pdf |
_version_ |
1770576361003417600 |