Glance to count: Learning to rank with anchors for weakly-supervised crowd counting

Crowd image is arguably one of the most laborious data to annotate. In this paper, we devote to reduce the massive demand of densely labeled crowd data, and propose a novel weakly-supervised setting, in which we leverage the binary ranking of two images with highcontrast crowd counts as training gui...

Full description

Saved in:
Bibliographic Details
Main Authors: XIONG, Zheng, CHAI, Liangyu, LIU, Wenxi, LIU, Yongtuo, REN, Sucheng, HE, Shengfeng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8533
https://ink.library.smu.edu.sg/context/sis_research/article/9536/viewcontent/GlancetoCount_av__2_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9536
record_format dspace
spelling sg-smu-ink.sis_research-95362024-01-22T14:57:57Z Glance to count: Learning to rank with anchors for weakly-supervised crowd counting XIONG, Zheng CHAI, Liangyu LIU, Wenxi LIU, Yongtuo REN, Sucheng HE, Shengfeng Crowd image is arguably one of the most laborious data to annotate. In this paper, we devote to reduce the massive demand of densely labeled crowd data, and propose a novel weakly-supervised setting, in which we leverage the binary ranking of two images with highcontrast crowd counts as training guidance. To enable training under this new setting, we convert the crowd count regression problem to a ranking potential prediction problem. In particular, we tailor a Siamese Ranking Network that predicts the potential scores of two images indicating the ordering of the counts. Hence, the ultimate goal is to assign appropriate potentials for all the crowd images to ensure their orderings obey the ranking labels. On the other hand, potentials reveal the relative crowd sizes but cannot yield an exact crowd count. We resolve this problem by introducing “anchors” during the inference stage. Concretely, anchors are a few images with count labels used for referencing the corresponding counts from potential scores by a simple linear mapping function. We conduct extensive experiments to study various combinations of supervision, and we show that the proposed method outperforms existing weakly-supervised methods without additional labeling effort by a large margin. 2024-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8533 https://ink.library.smu.edu.sg/context/sis_research/article/9536/viewcontent/GlancetoCount_av__2_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Crowd Counting Weakly-supervised Learning Ranking Graphics and Human Computer Interfaces Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Crowd Counting
Weakly-supervised Learning
Ranking
Graphics and Human Computer Interfaces
Numerical Analysis and Scientific Computing
spellingShingle Crowd Counting
Weakly-supervised Learning
Ranking
Graphics and Human Computer Interfaces
Numerical Analysis and Scientific Computing
XIONG, Zheng
CHAI, Liangyu
LIU, Wenxi
LIU, Yongtuo
REN, Sucheng
HE, Shengfeng
Glance to count: Learning to rank with anchors for weakly-supervised crowd counting
description Crowd image is arguably one of the most laborious data to annotate. In this paper, we devote to reduce the massive demand of densely labeled crowd data, and propose a novel weakly-supervised setting, in which we leverage the binary ranking of two images with highcontrast crowd counts as training guidance. To enable training under this new setting, we convert the crowd count regression problem to a ranking potential prediction problem. In particular, we tailor a Siamese Ranking Network that predicts the potential scores of two images indicating the ordering of the counts. Hence, the ultimate goal is to assign appropriate potentials for all the crowd images to ensure their orderings obey the ranking labels. On the other hand, potentials reveal the relative crowd sizes but cannot yield an exact crowd count. We resolve this problem by introducing “anchors” during the inference stage. Concretely, anchors are a few images with count labels used for referencing the corresponding counts from potential scores by a simple linear mapping function. We conduct extensive experiments to study various combinations of supervision, and we show that the proposed method outperforms existing weakly-supervised methods without additional labeling effort by a large margin.
format text
author XIONG, Zheng
CHAI, Liangyu
LIU, Wenxi
LIU, Yongtuo
REN, Sucheng
HE, Shengfeng
author_facet XIONG, Zheng
CHAI, Liangyu
LIU, Wenxi
LIU, Yongtuo
REN, Sucheng
HE, Shengfeng
author_sort XIONG, Zheng
title Glance to count: Learning to rank with anchors for weakly-supervised crowd counting
title_short Glance to count: Learning to rank with anchors for weakly-supervised crowd counting
title_full Glance to count: Learning to rank with anchors for weakly-supervised crowd counting
title_fullStr Glance to count: Learning to rank with anchors for weakly-supervised crowd counting
title_full_unstemmed Glance to count: Learning to rank with anchors for weakly-supervised crowd counting
title_sort glance to count: learning to rank with anchors for weakly-supervised crowd counting
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/8533
https://ink.library.smu.edu.sg/context/sis_research/article/9536/viewcontent/GlancetoCount_av__2_.pdf
_version_ 1789483260318842880