Multi-stage generative adversarial networks for generating pavement crack images
The application of machine learning techniques in pavement health monitoring based on computer vision has greatly improved the accuracy and efficiency in the detection of pavement distress levels and categories. However, a persistent challenge in this field is the issue of sample imbalance, primaril...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/180178 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-180178 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1801782024-09-23T04:42:05Z Multi-stage generative adversarial networks for generating pavement crack images Han, Chengjia Ma, Tao Huyan, Ju Tong, Zheng Yang, Handuo Yang, Yaowen School of Civil and Environmental Engineering Engineering Generative adversarial network Pavement crack The application of machine learning techniques in pavement health monitoring based on computer vision has greatly improved the accuracy and efficiency in the detection of pavement distress levels and categories. However, a persistent challenge in this field is the issue of sample imbalance, primarily arising from the scarcity of cracked pavement images, which hampers their effectiveness in road maintenance engineering. To address this issue and enhance the fast and stable generation of high-quality crack images for engineering purposes, this study proposes two frameworks based on Generative Adversarial Networks (GAN): Multi-Stage GAN-v1 and Multi-Stage GAN-v2. These frameworks break down the complex task of directly generating high-quality images into a series of incremental steps, gradually increasing the image resolution from initially generated low-precision images. Both versions, v1 and v2, consist of multiple sequentially connected generation units, with each unit utilizing the Wasserstein Generative Adversarial Network-Gradient Penalty (WGAN-GP). Furthermore, v2 has the additional capability of generating pavement crack images of specified types and simultaneously providing crack segmentation labels. This feature significantly enhances the practical applicability of the generated data in engineering contexts. In a comprehensive case study, the evaluation results clearly illustrate the superior image generation quality from the two proposed frameworks. Moreover, the results from ablation experiments, involving the training of nine state-of-the-art crack semantic segmentation and object detection networks using both generated images and real images, demonstrate the effective utility of these generated images for training pavement distress detection networks. National Research Foundation (NRF) This research was funded by the National Key Research and Development Project (grant number 2020YFB1600102, 2020YFA0714302), National Natural Science Foundation of China (grant number 51878164, 51922030), Fundamental Research Funds for the Central Universities (No. 2242021R10042), Postgraduate Research & Practice Innovation Program of Jiangsu Province (grant number KYCX21_0136) and Southeast University “Zhongying Young Scholars” Project. This research is also supported by the National Research Foundation, Singapore under its AI Singapore Programme (AISG Award No: AISG2-TC-2021-001). The authors would also like to acknowledge the financial support provided by China Road and Bridge Engineering Co., Ltd (grant number CRBC/ KHM/2021/053). The authors would also like to acknowledge the financial support provided by Jiangsu Provincial Department of Science and Technology (grant number BZ2023019). 2024-09-23T04:42:05Z 2024-09-23T04:42:05Z 2024 Journal Article Han, C., Ma, T., Huyan, J., Tong, Z., Yang, H. & Yang, Y. (2024). Multi-stage generative adversarial networks for generating pavement crack images. Engineering Applications of Artificial Intelligence, 131, 107767-. https://dx.doi.org/10.1016/j.engappai.2023.107767 0952-1976 https://hdl.handle.net/10356/180178 10.1016/j.engappai.2023.107767 2-s2.0-85181766395 131 107767 en AISG2-TC-2021-001 Engineering Applications of Artificial Intelligence © 2023 Elsevier Ltd. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering Generative adversarial network Pavement crack |
spellingShingle |
Engineering Generative adversarial network Pavement crack Han, Chengjia Ma, Tao Huyan, Ju Tong, Zheng Yang, Handuo Yang, Yaowen Multi-stage generative adversarial networks for generating pavement crack images |
description |
The application of machine learning techniques in pavement health monitoring based on computer vision has greatly improved the accuracy and efficiency in the detection of pavement distress levels and categories. However, a persistent challenge in this field is the issue of sample imbalance, primarily arising from the scarcity of cracked pavement images, which hampers their effectiveness in road maintenance engineering. To address this issue and enhance the fast and stable generation of high-quality crack images for engineering purposes, this study proposes two frameworks based on Generative Adversarial Networks (GAN): Multi-Stage GAN-v1 and Multi-Stage GAN-v2. These frameworks break down the complex task of directly generating high-quality images into a series of incremental steps, gradually increasing the image resolution from initially generated low-precision images. Both versions, v1 and v2, consist of multiple sequentially connected generation units, with each unit utilizing the Wasserstein Generative Adversarial Network-Gradient Penalty (WGAN-GP). Furthermore, v2 has the additional capability of generating pavement crack images of specified types and simultaneously providing crack segmentation labels. This feature significantly enhances the practical applicability of the generated data in engineering contexts. In a comprehensive case study, the evaluation results clearly illustrate the superior image generation quality from the two proposed frameworks. Moreover, the results from ablation experiments, involving the training of nine state-of-the-art crack semantic segmentation and object detection networks using both generated images and real images, demonstrate the effective utility of these generated images for training pavement distress detection networks. |
author2 |
School of Civil and Environmental Engineering |
author_facet |
School of Civil and Environmental Engineering Han, Chengjia Ma, Tao Huyan, Ju Tong, Zheng Yang, Handuo Yang, Yaowen |
format |
Article |
author |
Han, Chengjia Ma, Tao Huyan, Ju Tong, Zheng Yang, Handuo Yang, Yaowen |
author_sort |
Han, Chengjia |
title |
Multi-stage generative adversarial networks for generating pavement crack images |
title_short |
Multi-stage generative adversarial networks for generating pavement crack images |
title_full |
Multi-stage generative adversarial networks for generating pavement crack images |
title_fullStr |
Multi-stage generative adversarial networks for generating pavement crack images |
title_full_unstemmed |
Multi-stage generative adversarial networks for generating pavement crack images |
title_sort |
multi-stage generative adversarial networks for generating pavement crack images |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/180178 |
_version_ |
1814047236898684928 |