Cross-scale generative adversarial network for crowd density estimation from images

This research develops a cross-scale convolutional spatial generative adversarial network (CSGAN), in order to estimate the crowd density from images accurately. It consists of two similar generators, one for the whole feature extraction, and the other for patch scale feature extraction. An encoder–...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Gaowei, Pan, Yue, Zhang, Limao, Tiong, Robert Lee Kong
Other Authors: School of Civil and Environmental Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/161128
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-161128
record_format dspace
spelling sg-ntu-dr.10356-1611282022-08-16T06:58:31Z Cross-scale generative adversarial network for crowd density estimation from images Zhang, Gaowei Pan, Yue Zhang, Limao Tiong, Robert Lee Kong School of Civil and Environmental Engineering Engineering::Civil engineering Generative Adversarial Network Crowd Density Estimation This research develops a cross-scale convolutional spatial generative adversarial network (CSGAN), in order to estimate the crowd density from images accurately. It consists of two similar generators, one for the whole feature extraction, and the other for patch scale feature extraction. An encoder–decoder structure is employed to generate density maps from input images or patches. Additionally, a new objective function for crowd counting called cross-scale consistency pursuit containing an adversarial loss, L2 loss, perceptual loss, and consistency loss, is developed to make the generated density maps more realistic and closer to the ground truth. The effectiveness of the proposed CSGAN is verified in two public datasets. Results indicate that the new objective function is able to reach the most satisfying value of evaluation metrics in both the low-density and high-density crowd scenes when it is compared with other state-of-the-art methods on the test datasets. Moreover, the proposed CSGAN is more practical and flexible due to the smaller computational complexity. Its estimation capability will be significantly improved even in a small size of training data. Overall, this research contributes to the development of a novel computer vision approach together with a new objective function to generate density maps from cross-scale crowd images, enabling the counting process more accurately and efficiently. Ministry of Education (MOE) Nanyang Technological University The Ministry of Education Tier 1 Grant, Singapore (No.M4011971.030) and the Start-Up Grant at Nanyang Technological University, Singapore (No. M4082160.030) are acknowledged for their financial support of this research. 2022-08-16T06:58:31Z 2022-08-16T06:58:31Z 2020 Journal Article Zhang, G., Pan, Y., Zhang, L. & Tiong, R. L. K. (2020). Cross-scale generative adversarial network for crowd density estimation from images. Engineering Applications of Artificial Intelligence, 94, 103777-. https://dx.doi.org/10.1016/j.engappai.2020.103777 0952-1976 https://hdl.handle.net/10356/161128 10.1016/j.engappai.2020.103777 2-s2.0-85087931129 94 103777 en M4011971.030 M4082160.030 Engineering Applications of Artificial Intelligence © 2020 Elsevier Ltd. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Civil engineering
Generative Adversarial Network
Crowd Density Estimation
spellingShingle Engineering::Civil engineering
Generative Adversarial Network
Crowd Density Estimation
Zhang, Gaowei
Pan, Yue
Zhang, Limao
Tiong, Robert Lee Kong
Cross-scale generative adversarial network for crowd density estimation from images
description This research develops a cross-scale convolutional spatial generative adversarial network (CSGAN), in order to estimate the crowd density from images accurately. It consists of two similar generators, one for the whole feature extraction, and the other for patch scale feature extraction. An encoder–decoder structure is employed to generate density maps from input images or patches. Additionally, a new objective function for crowd counting called cross-scale consistency pursuit containing an adversarial loss, L2 loss, perceptual loss, and consistency loss, is developed to make the generated density maps more realistic and closer to the ground truth. The effectiveness of the proposed CSGAN is verified in two public datasets. Results indicate that the new objective function is able to reach the most satisfying value of evaluation metrics in both the low-density and high-density crowd scenes when it is compared with other state-of-the-art methods on the test datasets. Moreover, the proposed CSGAN is more practical and flexible due to the smaller computational complexity. Its estimation capability will be significantly improved even in a small size of training data. Overall, this research contributes to the development of a novel computer vision approach together with a new objective function to generate density maps from cross-scale crowd images, enabling the counting process more accurately and efficiently.
author2 School of Civil and Environmental Engineering
author_facet School of Civil and Environmental Engineering
Zhang, Gaowei
Pan, Yue
Zhang, Limao
Tiong, Robert Lee Kong
format Article
author Zhang, Gaowei
Pan, Yue
Zhang, Limao
Tiong, Robert Lee Kong
author_sort Zhang, Gaowei
title Cross-scale generative adversarial network for crowd density estimation from images
title_short Cross-scale generative adversarial network for crowd density estimation from images
title_full Cross-scale generative adversarial network for crowd density estimation from images
title_fullStr Cross-scale generative adversarial network for crowd density estimation from images
title_full_unstemmed Cross-scale generative adversarial network for crowd density estimation from images
title_sort cross-scale generative adversarial network for crowd density estimation from images
publishDate 2022
url https://hdl.handle.net/10356/161128
_version_ 1743119523461988352