Crowd counting via cross-stage refinement networks

Crowd counting is challenging due to unconstrained imaging factors, e.g., background clutters, non-uniform distribution of people, large scale and perspective variations. Dealing with these problems using deep neural networks requires rich prior knowledge and multi-scale contextual representations....

Full description

Saved in:
Bibliographic Details
Main Authors: LIU, Yongtuo, WEN, Qiang, CHEN, Haoxin, LIU, Wenxi, QIN, Jing, HAN, Guoqiang, HE, Shengfeng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2020
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7848
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8851
record_format dspace
spelling sg-smu-ink.sis_research-88512023-06-15T09:00:05Z Crowd counting via cross-stage refinement networks LIU, Yongtuo WEN, Qiang CHEN, Haoxin LIU, Wenxi QIN, Jing HAN, Guoqiang HE, Shengfeng Crowd counting is challenging due to unconstrained imaging factors, e.g., background clutters, non-uniform distribution of people, large scale and perspective variations. Dealing with these problems using deep neural networks requires rich prior knowledge and multi-scale contextual representations. In this paper, we propose a Cross-stage Refinement Network (CRNet) that can refine predicted density maps progressively based on hierarchical multi-level density priors. In particular, CRNet is composed of several fully convolutional networks. They are stacked together recursively with the previous output as the next input, and each of them serves to utilize previous density output to gradually correct prediction errors of crowd areas and refine the predicted density maps at different stages. Cross-stage multi-level density priors are further exploited in our recurrent framework by the cross-stage skip layers based on ConvLSTM. To cope with different challenges of unconstrained crowd scenes, we explore different crowd-specific data augmentation methods to mimic real-world scenarios and enrich crowd feature representations from different aspects. Extensive experiments show the proposed method achieves superior performances against state-of-the-art methods on four widely-used challenging benchmarks in terms of counting accuracy and density map quality. Code and models are available at this https://github.com/lytgftyf/Crowd-Counting-via-Cross-stage-Refinement-Networks. 2020-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/7848 info:doi/10.1109/TIP.2020.2994410 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Feature extraction Convolution Decoding Clutter Benchmark testing Cameras Network architecture Crowd counting recurrent network image refinement Information Security
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Feature extraction
Convolution
Decoding
Clutter
Benchmark testing
Cameras
Network architecture
Crowd counting
recurrent network
image refinement
Information Security
spellingShingle Feature extraction
Convolution
Decoding
Clutter
Benchmark testing
Cameras
Network architecture
Crowd counting
recurrent network
image refinement
Information Security
LIU, Yongtuo
WEN, Qiang
CHEN, Haoxin
LIU, Wenxi
QIN, Jing
HAN, Guoqiang
HE, Shengfeng
Crowd counting via cross-stage refinement networks
description Crowd counting is challenging due to unconstrained imaging factors, e.g., background clutters, non-uniform distribution of people, large scale and perspective variations. Dealing with these problems using deep neural networks requires rich prior knowledge and multi-scale contextual representations. In this paper, we propose a Cross-stage Refinement Network (CRNet) that can refine predicted density maps progressively based on hierarchical multi-level density priors. In particular, CRNet is composed of several fully convolutional networks. They are stacked together recursively with the previous output as the next input, and each of them serves to utilize previous density output to gradually correct prediction errors of crowd areas and refine the predicted density maps at different stages. Cross-stage multi-level density priors are further exploited in our recurrent framework by the cross-stage skip layers based on ConvLSTM. To cope with different challenges of unconstrained crowd scenes, we explore different crowd-specific data augmentation methods to mimic real-world scenarios and enrich crowd feature representations from different aspects. Extensive experiments show the proposed method achieves superior performances against state-of-the-art methods on four widely-used challenging benchmarks in terms of counting accuracy and density map quality. Code and models are available at this https://github.com/lytgftyf/Crowd-Counting-via-Cross-stage-Refinement-Networks.
format text
author LIU, Yongtuo
WEN, Qiang
CHEN, Haoxin
LIU, Wenxi
QIN, Jing
HAN, Guoqiang
HE, Shengfeng
author_facet LIU, Yongtuo
WEN, Qiang
CHEN, Haoxin
LIU, Wenxi
QIN, Jing
HAN, Guoqiang
HE, Shengfeng
author_sort LIU, Yongtuo
title Crowd counting via cross-stage refinement networks
title_short Crowd counting via cross-stage refinement networks
title_full Crowd counting via cross-stage refinement networks
title_fullStr Crowd counting via cross-stage refinement networks
title_full_unstemmed Crowd counting via cross-stage refinement networks
title_sort crowd counting via cross-stage refinement networks
publisher Institutional Knowledge at Singapore Management University
publishDate 2020
url https://ink.library.smu.edu.sg/sis_research/7848
_version_ 1770576555725029376