Hybrid DNN training using both synthetic and real construction images to overcome training data shortage
Although deep neural network (DNN)-powered visual scene understanding is a driving factor in a transition toward construction digitalization and robotic automation, a shortage of construction training images has been a roadblock to achieving DNNs' maximum performance potential. This data shorta...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/172882 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-172882 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1728822023-12-27T07:43:53Z Hybrid DNN training using both synthetic and real construction images to overcome training data shortage Kim, Jinwoo Kim, Daeho Lee, SangHyun Chi, Seokho School of Civil and Environmental Engineering Engineering::Civil engineering Visual Scene Understanding Deep Neural Network Although deep neural network (DNN)-powered visual scene understanding is a driving factor in a transition toward construction digitalization and robotic automation, a shortage of construction training images has been a roadblock to achieving DNNs' maximum performance potential. This data shortage becomes more problematic in digitally monitoring field workers who perform a variety of activities in an unstructured outdoor construction environment. To address this issue, the authors present a construction worker-centric image synthetization approach that can automatically synthesize and label limitless artificial human images with diverse poses, activities, and outdoor imaging conditions. Using synthesized construction worker-centric images, the authors conduct training experiments to characterize the effects of synthetic images on DNN-powered worker detection. In addition, the authors explore the hybrid effects of synthetic and real images on DNN performance. Results showed that a synthetic image-trained model potentially performs well in diverse field conditions and can even detect construction workers who are missed by a real image-trained model. It was also shown that a hybrid use of synthetic and real images can reduce the number of necessary real training images by 50% and improve DNN performance by 16% on average, compared to when only one of the two data sources are adopted. Moreover, the data hybridity enabled DNNs to reach its near-maximum performance while scaling down the size of a real training dataset by up to 80%. These findings indicate that synthetic images have promising potential for worker-centric DNN training in that they enable higher performance while reducing the human effort needed for real construction image collection and labeling. This capability can help to address the problem of data shortage in construction and enable the training of more accurate and scalable DNN models. Furthermore, this will stimulate the development and implementation of visual artificial intelligence for robotic automation and digitization. This research was conducted with the support of the “National R&D Project for Smart Construction Technology (No. 22SMIP-A158708-03)” funded by the Korea Agency for Infrastructure Technology Advancement under the Ministry of Land, Infrastructure and Transport. 2023-12-27T07:43:53Z 2023-12-27T07:43:53Z 2023 Journal Article Kim, J., Kim, D., Lee, S. & Chi, S. (2023). Hybrid DNN training using both synthetic and real construction images to overcome training data shortage. Automation in Construction, 149, 104771-. https://dx.doi.org/10.1016/j.autcon.2023.104771 0926-5805 https://hdl.handle.net/10356/172882 10.1016/j.autcon.2023.104771 2-s2.0-85148063258 149 104771 en Automation in Construction © 2023 Elsevier B.V. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Civil engineering Visual Scene Understanding Deep Neural Network |
spellingShingle |
Engineering::Civil engineering Visual Scene Understanding Deep Neural Network Kim, Jinwoo Kim, Daeho Lee, SangHyun Chi, Seokho Hybrid DNN training using both synthetic and real construction images to overcome training data shortage |
description |
Although deep neural network (DNN)-powered visual scene understanding is a driving factor in a transition toward construction digitalization and robotic automation, a shortage of construction training images has been a roadblock to achieving DNNs' maximum performance potential. This data shortage becomes more problematic in digitally monitoring field workers who perform a variety of activities in an unstructured outdoor construction environment. To address this issue, the authors present a construction worker-centric image synthetization approach that can automatically synthesize and label limitless artificial human images with diverse poses, activities, and outdoor imaging conditions. Using synthesized construction worker-centric images, the authors conduct training experiments to characterize the effects of synthetic images on DNN-powered worker detection. In addition, the authors explore the hybrid effects of synthetic and real images on DNN performance. Results showed that a synthetic image-trained model potentially performs well in diverse field conditions and can even detect construction workers who are missed by a real image-trained model. It was also shown that a hybrid use of synthetic and real images can reduce the number of necessary real training images by 50% and improve DNN performance by 16% on average, compared to when only one of the two data sources are adopted. Moreover, the data hybridity enabled DNNs to reach its near-maximum performance while scaling down the size of a real training dataset by up to 80%. These findings indicate that synthetic images have promising potential for worker-centric DNN training in that they enable higher performance while reducing the human effort needed for real construction image collection and labeling. This capability can help to address the problem of data shortage in construction and enable the training of more accurate and scalable DNN models. Furthermore, this will stimulate the development and implementation of visual artificial intelligence for robotic automation and digitization. |
author2 |
School of Civil and Environmental Engineering |
author_facet |
School of Civil and Environmental Engineering Kim, Jinwoo Kim, Daeho Lee, SangHyun Chi, Seokho |
format |
Article |
author |
Kim, Jinwoo Kim, Daeho Lee, SangHyun Chi, Seokho |
author_sort |
Kim, Jinwoo |
title |
Hybrid DNN training using both synthetic and real construction images to overcome training data shortage |
title_short |
Hybrid DNN training using both synthetic and real construction images to overcome training data shortage |
title_full |
Hybrid DNN training using both synthetic and real construction images to overcome training data shortage |
title_fullStr |
Hybrid DNN training using both synthetic and real construction images to overcome training data shortage |
title_full_unstemmed |
Hybrid DNN training using both synthetic and real construction images to overcome training data shortage |
title_sort |
hybrid dnn training using both synthetic and real construction images to overcome training data shortage |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/172882 |
_version_ |
1787136536892932096 |