Multi-stage generative adversarial networks for generating pavement crack images

The application of machine learning techniques in pavement health monitoring based on computer vision has greatly improved the accuracy and efficiency in the detection of pavement distress levels and categories. However, a persistent challenge in this field is the issue of sample imbalance, primaril...

Full description

Saved in:

Bibliographic Details
Main Authors:	Han, Chengjia, Ma, Tao, Huyan, Ju, Tong, Zheng, Yang, Handuo, Yang, Yaowen
Other Authors:	School of Civil and Environmental Engineering
Format:	Article
Language:	English
Published:	2024
Subjects:	Engineering Generative adversarial network Pavement crack
Online Access:	https://hdl.handle.net/10356/180178
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-180178
record_format	dspace
spelling	sg-ntu-dr.10356-1801782024-09-23T04:42:05Z Multi-stage generative adversarial networks for generating pavement crack images Han, Chengjia Ma, Tao Huyan, Ju Tong, Zheng Yang, Handuo Yang, Yaowen School of Civil and Environmental Engineering Engineering Generative adversarial network Pavement crack The application of machine learning techniques in pavement health monitoring based on computer vision has greatly improved the accuracy and efficiency in the detection of pavement distress levels and categories. However, a persistent challenge in this field is the issue of sample imbalance, primarily arising from the scarcity of cracked pavement images, which hampers their effectiveness in road maintenance engineering. To address this issue and enhance the fast and stable generation of high-quality crack images for engineering purposes, this study proposes two frameworks based on Generative Adversarial Networks (GAN): Multi-Stage GAN-v1 and Multi-Stage GAN-v2. These frameworks break down the complex task of directly generating high-quality images into a series of incremental steps, gradually increasing the image resolution from initially generated low-precision images. Both versions, v1 and v2, consist of multiple sequentially connected generation units, with each unit utilizing the Wasserstein Generative Adversarial Network-Gradient Penalty (WGAN-GP). Furthermore, v2 has the additional capability of generating pavement crack images of specified types and simultaneously providing crack segmentation labels. This feature significantly enhances the practical applicability of the generated data in engineering contexts. In a comprehensive case study, the evaluation results clearly illustrate the superior image generation quality from the two proposed frameworks. Moreover, the results from ablation experiments, involving the training of nine state-of-the-art crack semantic segmentation and object detection networks using both generated images and real images, demonstrate the effective utility of these generated images for training pavement distress detection networks. National Research Foundation (NRF) This research was funded by the National Key Research and Development Project (grant number 2020YFB1600102, 2020YFA0714302), National Natural Science Foundation of China (grant number 51878164, 51922030), Fundamental Research Funds for the Central Universities (No. 2242021R10042), Postgraduate Research & Practice Innovation Program of Jiangsu Province (grant number KYCX21_0136) and Southeast University “Zhongying Young Scholars” Project. This research is also supported by the National Research Foundation, Singapore under its AI Singapore Programme (AISG Award No: AISG2-TC-2021-001). The authors would also like to acknowledge the financial support provided by China Road and Bridge Engineering Co., Ltd (grant number CRBC/ KHM/2021/053). The authors would also like to acknowledge the financial support provided by Jiangsu Provincial Department of Science and Technology (grant number BZ2023019). 2024-09-23T04:42:05Z 2024-09-23T04:42:05Z 2024 Journal Article Han, C., Ma, T., Huyan, J., Tong, Z., Yang, H. & Yang, Y. (2024). Multi-stage generative adversarial networks for generating pavement crack images. Engineering Applications of Artificial Intelligence, 131, 107767-. https://dx.doi.org/10.1016/j.engappai.2023.107767 0952-1976 https://hdl.handle.net/10356/180178 10.1016/j.engappai.2023.107767 2-s2.0-85181766395 131 107767 en AISG2-TC-2021-001 Engineering Applications of Artificial Intelligence © 2023 Elsevier Ltd. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering Generative adversarial network Pavement crack
spellingShingle	Engineering Generative adversarial network Pavement crack Han, Chengjia Ma, Tao Huyan, Ju Tong, Zheng Yang, Handuo Yang, Yaowen Multi-stage generative adversarial networks for generating pavement crack images
description	The application of machine learning techniques in pavement health monitoring based on computer vision has greatly improved the accuracy and efficiency in the detection of pavement distress levels and categories. However, a persistent challenge in this field is the issue of sample imbalance, primarily arising from the scarcity of cracked pavement images, which hampers their effectiveness in road maintenance engineering. To address this issue and enhance the fast and stable generation of high-quality crack images for engineering purposes, this study proposes two frameworks based on Generative Adversarial Networks (GAN): Multi-Stage GAN-v1 and Multi-Stage GAN-v2. These frameworks break down the complex task of directly generating high-quality images into a series of incremental steps, gradually increasing the image resolution from initially generated low-precision images. Both versions, v1 and v2, consist of multiple sequentially connected generation units, with each unit utilizing the Wasserstein Generative Adversarial Network-Gradient Penalty (WGAN-GP). Furthermore, v2 has the additional capability of generating pavement crack images of specified types and simultaneously providing crack segmentation labels. This feature significantly enhances the practical applicability of the generated data in engineering contexts. In a comprehensive case study, the evaluation results clearly illustrate the superior image generation quality from the two proposed frameworks. Moreover, the results from ablation experiments, involving the training of nine state-of-the-art crack semantic segmentation and object detection networks using both generated images and real images, demonstrate the effective utility of these generated images for training pavement distress detection networks.
author2	School of Civil and Environmental Engineering
author_facet	School of Civil and Environmental Engineering Han, Chengjia Ma, Tao Huyan, Ju Tong, Zheng Yang, Handuo Yang, Yaowen
format	Article
author	Han, Chengjia Ma, Tao Huyan, Ju Tong, Zheng Yang, Handuo Yang, Yaowen
author_sort	Han, Chengjia
title	Multi-stage generative adversarial networks for generating pavement crack images
title_short	Multi-stage generative adversarial networks for generating pavement crack images
title_full	Multi-stage generative adversarial networks for generating pavement crack images
title_fullStr	Multi-stage generative adversarial networks for generating pavement crack images
title_full_unstemmed	Multi-stage generative adversarial networks for generating pavement crack images
title_sort	multi-stage generative adversarial networks for generating pavement crack images
publishDate	2024
url	https://hdl.handle.net/10356/180178
_version_	1814047236898684928

Multi-stage generative adversarial networks for generating pavement crack images

Similar Items