Crowd counting via cross-stage refinement networks
Crowd counting is challenging due to unconstrained imaging factors, e.g., background clutters, non-uniform distribution of people, large scale and perspective variations. Dealing with these problems using deep neural networks requires rich prior knowledge and multi-scale contextual representations....
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2020
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7848 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8851 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-88512023-06-15T09:00:05Z Crowd counting via cross-stage refinement networks LIU, Yongtuo WEN, Qiang CHEN, Haoxin LIU, Wenxi QIN, Jing HAN, Guoqiang HE, Shengfeng Crowd counting is challenging due to unconstrained imaging factors, e.g., background clutters, non-uniform distribution of people, large scale and perspective variations. Dealing with these problems using deep neural networks requires rich prior knowledge and multi-scale contextual representations. In this paper, we propose a Cross-stage Refinement Network (CRNet) that can refine predicted density maps progressively based on hierarchical multi-level density priors. In particular, CRNet is composed of several fully convolutional networks. They are stacked together recursively with the previous output as the next input, and each of them serves to utilize previous density output to gradually correct prediction errors of crowd areas and refine the predicted density maps at different stages. Cross-stage multi-level density priors are further exploited in our recurrent framework by the cross-stage skip layers based on ConvLSTM. To cope with different challenges of unconstrained crowd scenes, we explore different crowd-specific data augmentation methods to mimic real-world scenarios and enrich crowd feature representations from different aspects. Extensive experiments show the proposed method achieves superior performances against state-of-the-art methods on four widely-used challenging benchmarks in terms of counting accuracy and density map quality. Code and models are available at this https://github.com/lytgftyf/Crowd-Counting-via-Cross-stage-Refinement-Networks. 2020-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/7848 info:doi/10.1109/TIP.2020.2994410 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Feature extraction Convolution Decoding Clutter Benchmark testing Cameras Network architecture Crowd counting recurrent network image refinement Information Security |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Feature extraction Convolution Decoding Clutter Benchmark testing Cameras Network architecture Crowd counting recurrent network image refinement Information Security |
spellingShingle |
Feature extraction Convolution Decoding Clutter Benchmark testing Cameras Network architecture Crowd counting recurrent network image refinement Information Security LIU, Yongtuo WEN, Qiang CHEN, Haoxin LIU, Wenxi QIN, Jing HAN, Guoqiang HE, Shengfeng Crowd counting via cross-stage refinement networks |
description |
Crowd counting is challenging due to unconstrained imaging factors, e.g., background clutters, non-uniform distribution of people, large scale and perspective variations. Dealing with these problems using deep neural networks requires rich prior knowledge and multi-scale contextual representations. In this paper, we propose a Cross-stage Refinement Network (CRNet) that can refine predicted density maps progressively based on hierarchical multi-level density priors. In particular, CRNet is composed of several fully convolutional networks. They are stacked together recursively with the previous output as the next input, and each of them serves to utilize previous density output to gradually correct prediction errors of crowd areas and refine the predicted density maps at different stages. Cross-stage multi-level density priors are further exploited in our recurrent framework by the cross-stage skip layers based on ConvLSTM. To cope with different challenges of unconstrained crowd scenes, we explore different crowd-specific data augmentation methods to mimic real-world scenarios and enrich crowd feature representations from different aspects. Extensive experiments show the proposed method achieves superior performances against state-of-the-art methods on four widely-used challenging benchmarks in terms of counting accuracy and density map quality. Code and models are available at this https://github.com/lytgftyf/Crowd-Counting-via-Cross-stage-Refinement-Networks. |
format |
text |
author |
LIU, Yongtuo WEN, Qiang CHEN, Haoxin LIU, Wenxi QIN, Jing HAN, Guoqiang HE, Shengfeng |
author_facet |
LIU, Yongtuo WEN, Qiang CHEN, Haoxin LIU, Wenxi QIN, Jing HAN, Guoqiang HE, Shengfeng |
author_sort |
LIU, Yongtuo |
title |
Crowd counting via cross-stage refinement networks |
title_short |
Crowd counting via cross-stage refinement networks |
title_full |
Crowd counting via cross-stage refinement networks |
title_fullStr |
Crowd counting via cross-stage refinement networks |
title_full_unstemmed |
Crowd counting via cross-stage refinement networks |
title_sort |
crowd counting via cross-stage refinement networks |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2020 |
url |
https://ink.library.smu.edu.sg/sis_research/7848 |
_version_ |
1770576555725029376 |