Cross-scale generative adversarial network for crowd density estimation from images
This research develops a cross-scale convolutional spatial generative adversarial network (CSGAN), in order to estimate the crowd density from images accurately. It consists of two similar generators, one for the whole feature extraction, and the other for patch scale feature extraction. An encoder–...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/161128 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-161128 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1611282022-08-16T06:58:31Z Cross-scale generative adversarial network for crowd density estimation from images Zhang, Gaowei Pan, Yue Zhang, Limao Tiong, Robert Lee Kong School of Civil and Environmental Engineering Engineering::Civil engineering Generative Adversarial Network Crowd Density Estimation This research develops a cross-scale convolutional spatial generative adversarial network (CSGAN), in order to estimate the crowd density from images accurately. It consists of two similar generators, one for the whole feature extraction, and the other for patch scale feature extraction. An encoder–decoder structure is employed to generate density maps from input images or patches. Additionally, a new objective function for crowd counting called cross-scale consistency pursuit containing an adversarial loss, L2 loss, perceptual loss, and consistency loss, is developed to make the generated density maps more realistic and closer to the ground truth. The effectiveness of the proposed CSGAN is verified in two public datasets. Results indicate that the new objective function is able to reach the most satisfying value of evaluation metrics in both the low-density and high-density crowd scenes when it is compared with other state-of-the-art methods on the test datasets. Moreover, the proposed CSGAN is more practical and flexible due to the smaller computational complexity. Its estimation capability will be significantly improved even in a small size of training data. Overall, this research contributes to the development of a novel computer vision approach together with a new objective function to generate density maps from cross-scale crowd images, enabling the counting process more accurately and efficiently. Ministry of Education (MOE) Nanyang Technological University The Ministry of Education Tier 1 Grant, Singapore (No.M4011971.030) and the Start-Up Grant at Nanyang Technological University, Singapore (No. M4082160.030) are acknowledged for their financial support of this research. 2022-08-16T06:58:31Z 2022-08-16T06:58:31Z 2020 Journal Article Zhang, G., Pan, Y., Zhang, L. & Tiong, R. L. K. (2020). Cross-scale generative adversarial network for crowd density estimation from images. Engineering Applications of Artificial Intelligence, 94, 103777-. https://dx.doi.org/10.1016/j.engappai.2020.103777 0952-1976 https://hdl.handle.net/10356/161128 10.1016/j.engappai.2020.103777 2-s2.0-85087931129 94 103777 en M4011971.030 M4082160.030 Engineering Applications of Artificial Intelligence © 2020 Elsevier Ltd. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Civil engineering Generative Adversarial Network Crowd Density Estimation |
spellingShingle |
Engineering::Civil engineering Generative Adversarial Network Crowd Density Estimation Zhang, Gaowei Pan, Yue Zhang, Limao Tiong, Robert Lee Kong Cross-scale generative adversarial network for crowd density estimation from images |
description |
This research develops a cross-scale convolutional spatial generative adversarial network (CSGAN), in order to estimate the crowd density from images accurately. It consists of two similar generators, one for the whole feature extraction, and the other for patch scale feature extraction. An encoder–decoder structure is employed to generate density maps from input images or patches. Additionally, a new objective function for crowd counting called cross-scale consistency pursuit containing an adversarial loss, L2 loss, perceptual loss, and consistency loss, is developed to make the generated density maps more realistic and closer to the ground truth. The effectiveness of the proposed CSGAN is verified in two public datasets. Results indicate that the new objective function is able to reach the most satisfying value of evaluation metrics in both the low-density and high-density crowd scenes when it is compared with other state-of-the-art methods on the test datasets. Moreover, the proposed CSGAN is more practical and flexible due to the smaller computational complexity. Its estimation capability will be significantly improved even in a small size of training data. Overall, this research contributes to the development of a novel computer vision approach together with a new objective function to generate density maps from cross-scale crowd images, enabling the counting process more accurately and efficiently. |
author2 |
School of Civil and Environmental Engineering |
author_facet |
School of Civil and Environmental Engineering Zhang, Gaowei Pan, Yue Zhang, Limao Tiong, Robert Lee Kong |
format |
Article |
author |
Zhang, Gaowei Pan, Yue Zhang, Limao Tiong, Robert Lee Kong |
author_sort |
Zhang, Gaowei |
title |
Cross-scale generative adversarial network for crowd density estimation from images |
title_short |
Cross-scale generative adversarial network for crowd density estimation from images |
title_full |
Cross-scale generative adversarial network for crowd density estimation from images |
title_fullStr |
Cross-scale generative adversarial network for crowd density estimation from images |
title_full_unstemmed |
Cross-scale generative adversarial network for crowd density estimation from images |
title_sort |
cross-scale generative adversarial network for crowd density estimation from images |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/161128 |
_version_ |
1743119523461988352 |