Multi-resolution attention convolutional neural network for crowd counting

Estimating crowd counts remains a challenging task due to the problems of scale variations, non-uniform distribution and complex backgrounds. In this paper, we propose a multi-resolution attention convolutional neural network (MRA-CNN) to address this challenging task. Except for the counting task,...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhang, Youmei, Zhou, Chunluan, Chang, Faliang, Kot, Alex Chichung
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2020
Subjects:	Engineering::Electrical and electronic engineering Crowd Counting Multi-resolution Attention (MRA) Model
Online Access:	https://hdl.handle.net/10356/144965
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-144965
record_format	dspace
spelling	sg-ntu-dr.10356-1449652021-02-03T05:18:27Z Multi-resolution attention convolutional neural network for crowd counting Zhang, Youmei Zhou, Chunluan Chang, Faliang Kot, Alex Chichung School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Crowd Counting Multi-resolution Attention (MRA) Model Estimating crowd counts remains a challenging task due to the problems of scale variations, non-uniform distribution and complex backgrounds. In this paper, we propose a multi-resolution attention convolutional neural network (MRA-CNN) to address this challenging task. Except for the counting task, we exploit an additional density-level classification task during training and combine features learned for the two tasks, thus forming multi-scale, multi-contextual features to cope with the scale variation and non-uniform distribution. Besides, we utilize a multi-resolution attention (MRA) model to generate score maps, where head locations are with higher scores to guide the network to focus on head regions and suppress non-head regions regardless of the complex backgrounds. During the generation of score maps, atrous convolution layers are used to expand the receptive field with fewer parameters, thus getting higher-level features and providing the MRA model more comprehensive information. Experiments on ShanghaiTech, WorldExpo’10 and UCF datasets demonstrate the effectiveness of our method. Info-communications Media Development Authority (IMDA) Accepted version This work was supported in part by the National Natural Science Foundation of China under Grant 61673244, Grant 61273277 and Grant 61703240) and was carried out at the Rapid-Rich Object Search (ROSE) Lab at the Nanyang Technological University, Singapore. The ROSE Lab is supported by the Infocomm Media Development Authority, Singapore. 2020-12-07T03:43:20Z 2020-12-07T03:43:20Z 2019 Journal Article Zhang, Y., Zhou, C., Chang, F., & Kot, A. C. (2019). Multi-resolution attention convolutional neural network for crowd counting. Neurocomputing, 329, 144–152. doi:10.1016/j.neucom.2018.10.058 0925-2312 https://hdl.handle.net/10356/144965 10.1016/j.neucom.2018.10.058 329 144 152 en Neurocomputing © 2018 Elsevier B.V. All rights reserved. This paper was published in Neurocomputing and is made available with permission of Elsevier B.V. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering Crowd Counting Multi-resolution Attention (MRA) Model
spellingShingle	Engineering::Electrical and electronic engineering Crowd Counting Multi-resolution Attention (MRA) Model Zhang, Youmei Zhou, Chunluan Chang, Faliang Kot, Alex Chichung Multi-resolution attention convolutional neural network for crowd counting
description	Estimating crowd counts remains a challenging task due to the problems of scale variations, non-uniform distribution and complex backgrounds. In this paper, we propose a multi-resolution attention convolutional neural network (MRA-CNN) to address this challenging task. Except for the counting task, we exploit an additional density-level classification task during training and combine features learned for the two tasks, thus forming multi-scale, multi-contextual features to cope with the scale variation and non-uniform distribution. Besides, we utilize a multi-resolution attention (MRA) model to generate score maps, where head locations are with higher scores to guide the network to focus on head regions and suppress non-head regions regardless of the complex backgrounds. During the generation of score maps, atrous convolution layers are used to expand the receptive field with fewer parameters, thus getting higher-level features and providing the MRA model more comprehensive information. Experiments on ShanghaiTech, WorldExpo’10 and UCF datasets demonstrate the effectiveness of our method.
author2	School of Electrical and Electronic Engineering
author_facet	School of Electrical and Electronic Engineering Zhang, Youmei Zhou, Chunluan Chang, Faliang Kot, Alex Chichung
format	Article
author	Zhang, Youmei Zhou, Chunluan Chang, Faliang Kot, Alex Chichung
author_sort	Zhang, Youmei
title	Multi-resolution attention convolutional neural network for crowd counting
title_short	Multi-resolution attention convolutional neural network for crowd counting
title_full	Multi-resolution attention convolutional neural network for crowd counting
title_fullStr	Multi-resolution attention convolutional neural network for crowd counting
title_full_unstemmed	Multi-resolution attention convolutional neural network for crowd counting
title_sort	multi-resolution attention convolutional neural network for crowd counting
publishDate	2020
url	https://hdl.handle.net/10356/144965
_version_	1692012947327418368

Multi-resolution attention convolutional neural network for crowd counting

Similar Items