Multi-stage generative adversarial networks for generating pavement crack images

The application of machine learning techniques in pavement health monitoring based on computer vision has greatly improved the accuracy and efficiency in the detection of pavement distress levels and categories. However, a persistent challenge in this field is the issue of sample imbalance, primaril...

Full description

Saved in:
Bibliographic Details
Main Authors: Han, Chengjia, Ma, Tao, Huyan, Ju, Tong, Zheng, Yang, Handuo, Yang, Yaowen
Other Authors: School of Civil and Environmental Engineering
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/180178
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-180178
record_format dspace
spelling sg-ntu-dr.10356-1801782024-09-23T04:42:05Z Multi-stage generative adversarial networks for generating pavement crack images Han, Chengjia Ma, Tao Huyan, Ju Tong, Zheng Yang, Handuo Yang, Yaowen School of Civil and Environmental Engineering Engineering Generative adversarial network Pavement crack The application of machine learning techniques in pavement health monitoring based on computer vision has greatly improved the accuracy and efficiency in the detection of pavement distress levels and categories. However, a persistent challenge in this field is the issue of sample imbalance, primarily arising from the scarcity of cracked pavement images, which hampers their effectiveness in road maintenance engineering. To address this issue and enhance the fast and stable generation of high-quality crack images for engineering purposes, this study proposes two frameworks based on Generative Adversarial Networks (GAN): Multi-Stage GAN-v1 and Multi-Stage GAN-v2. These frameworks break down the complex task of directly generating high-quality images into a series of incremental steps, gradually increasing the image resolution from initially generated low-precision images. Both versions, v1 and v2, consist of multiple sequentially connected generation units, with each unit utilizing the Wasserstein Generative Adversarial Network-Gradient Penalty (WGAN-GP). Furthermore, v2 has the additional capability of generating pavement crack images of specified types and simultaneously providing crack segmentation labels. This feature significantly enhances the practical applicability of the generated data in engineering contexts. In a comprehensive case study, the evaluation results clearly illustrate the superior image generation quality from the two proposed frameworks. Moreover, the results from ablation experiments, involving the training of nine state-of-the-art crack semantic segmentation and object detection networks using both generated images and real images, demonstrate the effective utility of these generated images for training pavement distress detection networks. National Research Foundation (NRF) This research was funded by the National Key Research and Development Project (grant number 2020YFB1600102, 2020YFA0714302), National Natural Science Foundation of China (grant number 51878164, 51922030), Fundamental Research Funds for the Central Universities (No. 2242021R10042), Postgraduate Research & Practice Innovation Program of Jiangsu Province (grant number KYCX21_0136) and Southeast University “Zhongying Young Scholars” Project. This research is also supported by the National Research Foundation, Singapore under its AI Singapore Programme (AISG Award No: AISG2-TC-2021-001). The authors would also like to acknowledge the financial support provided by China Road and Bridge Engineering Co., Ltd (grant number CRBC/ KHM/2021/053). The authors would also like to acknowledge the financial support provided by Jiangsu Provincial Department of Science and Technology (grant number BZ2023019). 2024-09-23T04:42:05Z 2024-09-23T04:42:05Z 2024 Journal Article Han, C., Ma, T., Huyan, J., Tong, Z., Yang, H. & Yang, Y. (2024). Multi-stage generative adversarial networks for generating pavement crack images. Engineering Applications of Artificial Intelligence, 131, 107767-. https://dx.doi.org/10.1016/j.engappai.2023.107767 0952-1976 https://hdl.handle.net/10356/180178 10.1016/j.engappai.2023.107767 2-s2.0-85181766395 131 107767 en AISG2-TC-2021-001 Engineering Applications of Artificial Intelligence © 2023 Elsevier Ltd. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
Generative adversarial network
Pavement crack
spellingShingle Engineering
Generative adversarial network
Pavement crack
Han, Chengjia
Ma, Tao
Huyan, Ju
Tong, Zheng
Yang, Handuo
Yang, Yaowen
Multi-stage generative adversarial networks for generating pavement crack images
description The application of machine learning techniques in pavement health monitoring based on computer vision has greatly improved the accuracy and efficiency in the detection of pavement distress levels and categories. However, a persistent challenge in this field is the issue of sample imbalance, primarily arising from the scarcity of cracked pavement images, which hampers their effectiveness in road maintenance engineering. To address this issue and enhance the fast and stable generation of high-quality crack images for engineering purposes, this study proposes two frameworks based on Generative Adversarial Networks (GAN): Multi-Stage GAN-v1 and Multi-Stage GAN-v2. These frameworks break down the complex task of directly generating high-quality images into a series of incremental steps, gradually increasing the image resolution from initially generated low-precision images. Both versions, v1 and v2, consist of multiple sequentially connected generation units, with each unit utilizing the Wasserstein Generative Adversarial Network-Gradient Penalty (WGAN-GP). Furthermore, v2 has the additional capability of generating pavement crack images of specified types and simultaneously providing crack segmentation labels. This feature significantly enhances the practical applicability of the generated data in engineering contexts. In a comprehensive case study, the evaluation results clearly illustrate the superior image generation quality from the two proposed frameworks. Moreover, the results from ablation experiments, involving the training of nine state-of-the-art crack semantic segmentation and object detection networks using both generated images and real images, demonstrate the effective utility of these generated images for training pavement distress detection networks.
author2 School of Civil and Environmental Engineering
author_facet School of Civil and Environmental Engineering
Han, Chengjia
Ma, Tao
Huyan, Ju
Tong, Zheng
Yang, Handuo
Yang, Yaowen
format Article
author Han, Chengjia
Ma, Tao
Huyan, Ju
Tong, Zheng
Yang, Handuo
Yang, Yaowen
author_sort Han, Chengjia
title Multi-stage generative adversarial networks for generating pavement crack images
title_short Multi-stage generative adversarial networks for generating pavement crack images
title_full Multi-stage generative adversarial networks for generating pavement crack images
title_fullStr Multi-stage generative adversarial networks for generating pavement crack images
title_full_unstemmed Multi-stage generative adversarial networks for generating pavement crack images
title_sort multi-stage generative adversarial networks for generating pavement crack images
publishDate 2024
url https://hdl.handle.net/10356/180178
_version_ 1814047236898684928