Glance to count: Learning to rank with anchors for weakly-supervised crowd counting
Crowd image is arguably one of the most laborious data to annotate. In this paper, we devote to reduce the massive demand of densely labeled crowd data, and propose a novel weakly-supervised setting, in which we leverage the binary ranking of two images with highcontrast crowd counts as training gui...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8533 https://ink.library.smu.edu.sg/context/sis_research/article/9536/viewcontent/GlancetoCount_av__2_.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9536 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-95362024-01-22T14:57:57Z Glance to count: Learning to rank with anchors for weakly-supervised crowd counting XIONG, Zheng CHAI, Liangyu LIU, Wenxi LIU, Yongtuo REN, Sucheng HE, Shengfeng Crowd image is arguably one of the most laborious data to annotate. In this paper, we devote to reduce the massive demand of densely labeled crowd data, and propose a novel weakly-supervised setting, in which we leverage the binary ranking of two images with highcontrast crowd counts as training guidance. To enable training under this new setting, we convert the crowd count regression problem to a ranking potential prediction problem. In particular, we tailor a Siamese Ranking Network that predicts the potential scores of two images indicating the ordering of the counts. Hence, the ultimate goal is to assign appropriate potentials for all the crowd images to ensure their orderings obey the ranking labels. On the other hand, potentials reveal the relative crowd sizes but cannot yield an exact crowd count. We resolve this problem by introducing “anchors” during the inference stage. Concretely, anchors are a few images with count labels used for referencing the corresponding counts from potential scores by a simple linear mapping function. We conduct extensive experiments to study various combinations of supervision, and we show that the proposed method outperforms existing weakly-supervised methods without additional labeling effort by a large margin. 2024-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8533 https://ink.library.smu.edu.sg/context/sis_research/article/9536/viewcontent/GlancetoCount_av__2_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Crowd Counting Weakly-supervised Learning Ranking Graphics and Human Computer Interfaces Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Crowd Counting Weakly-supervised Learning Ranking Graphics and Human Computer Interfaces Numerical Analysis and Scientific Computing |
spellingShingle |
Crowd Counting Weakly-supervised Learning Ranking Graphics and Human Computer Interfaces Numerical Analysis and Scientific Computing XIONG, Zheng CHAI, Liangyu LIU, Wenxi LIU, Yongtuo REN, Sucheng HE, Shengfeng Glance to count: Learning to rank with anchors for weakly-supervised crowd counting |
description |
Crowd image is arguably one of the most laborious data to annotate. In this paper, we devote to reduce the massive demand of densely labeled crowd data, and propose a novel weakly-supervised setting, in which we leverage the binary ranking of two images with highcontrast crowd counts as training guidance. To enable training under this new setting, we convert the crowd count regression problem to a ranking potential prediction problem. In particular, we tailor a Siamese Ranking Network that predicts the potential scores of two images indicating the ordering of the counts. Hence, the ultimate goal is to assign appropriate potentials for all the crowd images to ensure their orderings obey the ranking labels. On the other hand, potentials reveal the relative crowd sizes but cannot yield an exact crowd count. We resolve this problem by introducing “anchors” during the inference stage. Concretely, anchors are a few images with count labels used for referencing the corresponding counts from potential scores by a simple linear mapping function. We conduct extensive experiments to study various combinations of supervision, and we show that the proposed method outperforms existing weakly-supervised methods without additional labeling effort by a large margin. |
format |
text |
author |
XIONG, Zheng CHAI, Liangyu LIU, Wenxi LIU, Yongtuo REN, Sucheng HE, Shengfeng |
author_facet |
XIONG, Zheng CHAI, Liangyu LIU, Wenxi LIU, Yongtuo REN, Sucheng HE, Shengfeng |
author_sort |
XIONG, Zheng |
title |
Glance to count: Learning to rank with anchors for weakly-supervised crowd counting |
title_short |
Glance to count: Learning to rank with anchors for weakly-supervised crowd counting |
title_full |
Glance to count: Learning to rank with anchors for weakly-supervised crowd counting |
title_fullStr |
Glance to count: Learning to rank with anchors for weakly-supervised crowd counting |
title_full_unstemmed |
Glance to count: Learning to rank with anchors for weakly-supervised crowd counting |
title_sort |
glance to count: learning to rank with anchors for weakly-supervised crowd counting |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2024 |
url |
https://ink.library.smu.edu.sg/sis_research/8533 https://ink.library.smu.edu.sg/context/sis_research/article/9536/viewcontent/GlancetoCount_av__2_.pdf |
_version_ |
1789483260318842880 |