Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multi...

Full description

Saved in:
Bibliographic Details
Main Authors: Ma, Junjie, Dai, Yaping, Jia, Zhiyang, Sun, Fuchun, Tan, Yap Peng, Liu, Jun
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/172042
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-172042
record_format dspace
spelling sg-ntu-dr.10356-1720422023-11-20T05:06:33Z Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement Ma, Junjie Dai, Yaping Jia, Zhiyang Sun, Fuchun Tan, Yap Peng Liu, Jun School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Crowd Counting Density Estimation Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios. 2023-11-20T05:06:33Z 2023-11-20T05:06:33Z 2023 Journal Article Ma, J., Dai, Y., Jia, Z., Sun, F., Tan, Y. P. & Liu, J. (2023). Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement. Pattern Recognition, 141, 109585-. https://dx.doi.org/10.1016/j.patcog.2023.109585 0031-3203 https://hdl.handle.net/10356/172042 10.1016/j.patcog.2023.109585 2-s2.0-85152433578 141 109585 en Pattern Recognition © 2023 Elsevier Ltd. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
Crowd Counting
Density Estimation
spellingShingle Engineering::Electrical and electronic engineering
Crowd Counting
Density Estimation
Ma, Junjie
Dai, Yaping
Jia, Zhiyang
Sun, Fuchun
Tan, Yap Peng
Liu, Jun
Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement
description Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Ma, Junjie
Dai, Yaping
Jia, Zhiyang
Sun, Fuchun
Tan, Yap Peng
Liu, Jun
format Article
author Ma, Junjie
Dai, Yaping
Jia, Zhiyang
Sun, Fuchun
Tan, Yap Peng
Liu, Jun
author_sort Ma, Junjie
title Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement
title_short Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement
title_full Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement
title_fullStr Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement
title_full_unstemmed Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement
title_sort crowd counting from single images using recursive multi-pathway zooming and foreground enhancement
publishDate 2023
url https://hdl.handle.net/10356/172042
_version_ 1783955573280079872