Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement
Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multi...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/172042 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-172042 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1720422023-11-20T05:06:33Z Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement Ma, Junjie Dai, Yaping Jia, Zhiyang Sun, Fuchun Tan, Yap Peng Liu, Jun School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Crowd Counting Density Estimation Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios. 2023-11-20T05:06:33Z 2023-11-20T05:06:33Z 2023 Journal Article Ma, J., Dai, Y., Jia, Z., Sun, F., Tan, Y. P. & Liu, J. (2023). Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement. Pattern Recognition, 141, 109585-. https://dx.doi.org/10.1016/j.patcog.2023.109585 0031-3203 https://hdl.handle.net/10356/172042 10.1016/j.patcog.2023.109585 2-s2.0-85152433578 141 109585 en Pattern Recognition © 2023 Elsevier Ltd. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Crowd Counting Density Estimation |
spellingShingle |
Engineering::Electrical and electronic engineering Crowd Counting Density Estimation Ma, Junjie Dai, Yaping Jia, Zhiyang Sun, Fuchun Tan, Yap Peng Liu, Jun Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement |
description |
Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Ma, Junjie Dai, Yaping Jia, Zhiyang Sun, Fuchun Tan, Yap Peng Liu, Jun |
format |
Article |
author |
Ma, Junjie Dai, Yaping Jia, Zhiyang Sun, Fuchun Tan, Yap Peng Liu, Jun |
author_sort |
Ma, Junjie |
title |
Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement |
title_short |
Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement |
title_full |
Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement |
title_fullStr |
Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement |
title_full_unstemmed |
Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement |
title_sort |
crowd counting from single images using recursive multi-pathway zooming and foreground enhancement |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/172042 |
_version_ |
1783955573280079872 |