Multi-resolution attention convolutional neural network for crowd counting

Estimating crowd counts remains a challenging task due to the problems of scale variations, non-uniform distribution and complex backgrounds. In this paper, we propose a multi-resolution attention convolutional neural network (MRA-CNN) to address this challenging task. Except for the counting task,...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Youmei, Zhou, Chunluan, Chang, Faliang, Kot, Alex Chichung
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/144965
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-144965
record_format dspace
spelling sg-ntu-dr.10356-1449652021-02-03T05:18:27Z Multi-resolution attention convolutional neural network for crowd counting Zhang, Youmei Zhou, Chunluan Chang, Faliang Kot, Alex Chichung School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Crowd Counting Multi-resolution Attention (MRA) Model Estimating crowd counts remains a challenging task due to the problems of scale variations, non-uniform distribution and complex backgrounds. In this paper, we propose a multi-resolution attention convolutional neural network (MRA-CNN) to address this challenging task. Except for the counting task, we exploit an additional density-level classification task during training and combine features learned for the two tasks, thus forming multi-scale, multi-contextual features to cope with the scale variation and non-uniform distribution. Besides, we utilize a multi-resolution attention (MRA) model to generate score maps, where head locations are with higher scores to guide the network to focus on head regions and suppress non-head regions regardless of the complex backgrounds. During the generation of score maps, atrous convolution layers are used to expand the receptive field with fewer parameters, thus getting higher-level features and providing the MRA model more comprehensive information. Experiments on ShanghaiTech, WorldExpo’10 and UCF datasets demonstrate the effectiveness of our method. Info-communications Media Development Authority (IMDA) Accepted version This work was supported in part by the National Natural Science Foundation of China under Grant 61673244, Grant 61273277 and Grant 61703240) and was carried out at the Rapid-Rich Object Search (ROSE) Lab at the Nanyang Technological University, Singapore. The ROSE Lab is supported by the Infocomm Media Development Authority, Singapore. 2020-12-07T03:43:20Z 2020-12-07T03:43:20Z 2019 Journal Article Zhang, Y., Zhou, C., Chang, F., & Kot, A. C. (2019). Multi-resolution attention convolutional neural network for crowd counting. Neurocomputing, 329, 144–152. doi:10.1016/j.neucom.2018.10.058 0925-2312 https://hdl.handle.net/10356/144965 10.1016/j.neucom.2018.10.058 329 144 152 en Neurocomputing © 2018 Elsevier B.V. All rights reserved. This paper was published in Neurocomputing and is made available with permission of Elsevier B.V. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
Crowd Counting
Multi-resolution Attention (MRA) Model
spellingShingle Engineering::Electrical and electronic engineering
Crowd Counting
Multi-resolution Attention (MRA) Model
Zhang, Youmei
Zhou, Chunluan
Chang, Faliang
Kot, Alex Chichung
Multi-resolution attention convolutional neural network for crowd counting
description Estimating crowd counts remains a challenging task due to the problems of scale variations, non-uniform distribution and complex backgrounds. In this paper, we propose a multi-resolution attention convolutional neural network (MRA-CNN) to address this challenging task. Except for the counting task, we exploit an additional density-level classification task during training and combine features learned for the two tasks, thus forming multi-scale, multi-contextual features to cope with the scale variation and non-uniform distribution. Besides, we utilize a multi-resolution attention (MRA) model to generate score maps, where head locations are with higher scores to guide the network to focus on head regions and suppress non-head regions regardless of the complex backgrounds. During the generation of score maps, atrous convolution layers are used to expand the receptive field with fewer parameters, thus getting higher-level features and providing the MRA model more comprehensive information. Experiments on ShanghaiTech, WorldExpo’10 and UCF datasets demonstrate the effectiveness of our method.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Zhang, Youmei
Zhou, Chunluan
Chang, Faliang
Kot, Alex Chichung
format Article
author Zhang, Youmei
Zhou, Chunluan
Chang, Faliang
Kot, Alex Chichung
author_sort Zhang, Youmei
title Multi-resolution attention convolutional neural network for crowd counting
title_short Multi-resolution attention convolutional neural network for crowd counting
title_full Multi-resolution attention convolutional neural network for crowd counting
title_fullStr Multi-resolution attention convolutional neural network for crowd counting
title_full_unstemmed Multi-resolution attention convolutional neural network for crowd counting
title_sort multi-resolution attention convolutional neural network for crowd counting
publishDate 2020
url https://hdl.handle.net/10356/144965
_version_ 1692012947327418368