Multi-resolution attention convolutional neural network for crowd counting
Estimating crowd counts remains a challenging task due to the problems of scale variations, non-uniform distribution and complex backgrounds. In this paper, we propose a multi-resolution attention convolutional neural network (MRA-CNN) to address this challenging task. Except for the counting task,...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/144965 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-144965 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1449652021-02-03T05:18:27Z Multi-resolution attention convolutional neural network for crowd counting Zhang, Youmei Zhou, Chunluan Chang, Faliang Kot, Alex Chichung School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Crowd Counting Multi-resolution Attention (MRA) Model Estimating crowd counts remains a challenging task due to the problems of scale variations, non-uniform distribution and complex backgrounds. In this paper, we propose a multi-resolution attention convolutional neural network (MRA-CNN) to address this challenging task. Except for the counting task, we exploit an additional density-level classification task during training and combine features learned for the two tasks, thus forming multi-scale, multi-contextual features to cope with the scale variation and non-uniform distribution. Besides, we utilize a multi-resolution attention (MRA) model to generate score maps, where head locations are with higher scores to guide the network to focus on head regions and suppress non-head regions regardless of the complex backgrounds. During the generation of score maps, atrous convolution layers are used to expand the receptive field with fewer parameters, thus getting higher-level features and providing the MRA model more comprehensive information. Experiments on ShanghaiTech, WorldExpo’10 and UCF datasets demonstrate the effectiveness of our method. Info-communications Media Development Authority (IMDA) Accepted version This work was supported in part by the National Natural Science Foundation of China under Grant 61673244, Grant 61273277 and Grant 61703240) and was carried out at the Rapid-Rich Object Search (ROSE) Lab at the Nanyang Technological University, Singapore. The ROSE Lab is supported by the Infocomm Media Development Authority, Singapore. 2020-12-07T03:43:20Z 2020-12-07T03:43:20Z 2019 Journal Article Zhang, Y., Zhou, C., Chang, F., & Kot, A. C. (2019). Multi-resolution attention convolutional neural network for crowd counting. Neurocomputing, 329, 144–152. doi:10.1016/j.neucom.2018.10.058 0925-2312 https://hdl.handle.net/10356/144965 10.1016/j.neucom.2018.10.058 329 144 152 en Neurocomputing © 2018 Elsevier B.V. All rights reserved. This paper was published in Neurocomputing and is made available with permission of Elsevier B.V. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Crowd Counting Multi-resolution Attention (MRA) Model |
spellingShingle |
Engineering::Electrical and electronic engineering Crowd Counting Multi-resolution Attention (MRA) Model Zhang, Youmei Zhou, Chunluan Chang, Faliang Kot, Alex Chichung Multi-resolution attention convolutional neural network for crowd counting |
description |
Estimating crowd counts remains a challenging task due to the problems of scale variations, non-uniform distribution and complex backgrounds. In this paper, we propose a multi-resolution attention convolutional neural network (MRA-CNN) to address this challenging task. Except for the counting task, we exploit an additional density-level classification task during training and combine features learned for the two tasks, thus forming multi-scale, multi-contextual features to cope with the scale variation and non-uniform distribution. Besides, we utilize a multi-resolution attention (MRA) model to generate score maps, where head locations are with higher scores to guide the network to focus on head regions and suppress non-head regions regardless of the complex backgrounds. During the generation of score maps, atrous convolution layers are used to expand the receptive field with fewer parameters, thus getting higher-level features and providing the MRA model more comprehensive information. Experiments on ShanghaiTech, WorldExpo’10 and UCF datasets demonstrate the effectiveness of our method. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Zhang, Youmei Zhou, Chunluan Chang, Faliang Kot, Alex Chichung |
format |
Article |
author |
Zhang, Youmei Zhou, Chunluan Chang, Faliang Kot, Alex Chichung |
author_sort |
Zhang, Youmei |
title |
Multi-resolution attention convolutional neural network for crowd counting |
title_short |
Multi-resolution attention convolutional neural network for crowd counting |
title_full |
Multi-resolution attention convolutional neural network for crowd counting |
title_fullStr |
Multi-resolution attention convolutional neural network for crowd counting |
title_full_unstemmed |
Multi-resolution attention convolutional neural network for crowd counting |
title_sort |
multi-resolution attention convolutional neural network for crowd counting |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/144965 |
_version_ |
1692012947327418368 |