Hybrid DNN training using both synthetic and real construction images to overcome training data shortage

Although deep neural network (DNN)-powered visual scene understanding is a driving factor in a transition toward construction digitalization and robotic automation, a shortage of construction training images has been a roadblock to achieving DNNs' maximum performance potential. This data shorta...

Full description

Saved in:
Bibliographic Details
Main Authors: Kim, Jinwoo, Kim, Daeho, Lee, SangHyun, Chi, Seokho
Other Authors: School of Civil and Environmental Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/172882
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-172882
record_format dspace
spelling sg-ntu-dr.10356-1728822023-12-27T07:43:53Z Hybrid DNN training using both synthetic and real construction images to overcome training data shortage Kim, Jinwoo Kim, Daeho Lee, SangHyun Chi, Seokho School of Civil and Environmental Engineering Engineering::Civil engineering Visual Scene Understanding Deep Neural Network Although deep neural network (DNN)-powered visual scene understanding is a driving factor in a transition toward construction digitalization and robotic automation, a shortage of construction training images has been a roadblock to achieving DNNs' maximum performance potential. This data shortage becomes more problematic in digitally monitoring field workers who perform a variety of activities in an unstructured outdoor construction environment. To address this issue, the authors present a construction worker-centric image synthetization approach that can automatically synthesize and label limitless artificial human images with diverse poses, activities, and outdoor imaging conditions. Using synthesized construction worker-centric images, the authors conduct training experiments to characterize the effects of synthetic images on DNN-powered worker detection. In addition, the authors explore the hybrid effects of synthetic and real images on DNN performance. Results showed that a synthetic image-trained model potentially performs well in diverse field conditions and can even detect construction workers who are missed by a real image-trained model. It was also shown that a hybrid use of synthetic and real images can reduce the number of necessary real training images by 50% and improve DNN performance by 16% on average, compared to when only one of the two data sources are adopted. Moreover, the data hybridity enabled DNNs to reach its near-maximum performance while scaling down the size of a real training dataset by up to 80%. These findings indicate that synthetic images have promising potential for worker-centric DNN training in that they enable higher performance while reducing the human effort needed for real construction image collection and labeling. This capability can help to address the problem of data shortage in construction and enable the training of more accurate and scalable DNN models. Furthermore, this will stimulate the development and implementation of visual artificial intelligence for robotic automation and digitization. This research was conducted with the support of the “National R&D Project for Smart Construction Technology (No. 22SMIP-A158708-03)” funded by the Korea Agency for Infrastructure Technology Advancement under the Ministry of Land, Infrastructure and Transport. 2023-12-27T07:43:53Z 2023-12-27T07:43:53Z 2023 Journal Article Kim, J., Kim, D., Lee, S. & Chi, S. (2023). Hybrid DNN training using both synthetic and real construction images to overcome training data shortage. Automation in Construction, 149, 104771-. https://dx.doi.org/10.1016/j.autcon.2023.104771 0926-5805 https://hdl.handle.net/10356/172882 10.1016/j.autcon.2023.104771 2-s2.0-85148063258 149 104771 en Automation in Construction © 2023 Elsevier B.V. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Civil engineering
Visual Scene Understanding
Deep Neural Network
spellingShingle Engineering::Civil engineering
Visual Scene Understanding
Deep Neural Network
Kim, Jinwoo
Kim, Daeho
Lee, SangHyun
Chi, Seokho
Hybrid DNN training using both synthetic and real construction images to overcome training data shortage
description Although deep neural network (DNN)-powered visual scene understanding is a driving factor in a transition toward construction digitalization and robotic automation, a shortage of construction training images has been a roadblock to achieving DNNs' maximum performance potential. This data shortage becomes more problematic in digitally monitoring field workers who perform a variety of activities in an unstructured outdoor construction environment. To address this issue, the authors present a construction worker-centric image synthetization approach that can automatically synthesize and label limitless artificial human images with diverse poses, activities, and outdoor imaging conditions. Using synthesized construction worker-centric images, the authors conduct training experiments to characterize the effects of synthetic images on DNN-powered worker detection. In addition, the authors explore the hybrid effects of synthetic and real images on DNN performance. Results showed that a synthetic image-trained model potentially performs well in diverse field conditions and can even detect construction workers who are missed by a real image-trained model. It was also shown that a hybrid use of synthetic and real images can reduce the number of necessary real training images by 50% and improve DNN performance by 16% on average, compared to when only one of the two data sources are adopted. Moreover, the data hybridity enabled DNNs to reach its near-maximum performance while scaling down the size of a real training dataset by up to 80%. These findings indicate that synthetic images have promising potential for worker-centric DNN training in that they enable higher performance while reducing the human effort needed for real construction image collection and labeling. This capability can help to address the problem of data shortage in construction and enable the training of more accurate and scalable DNN models. Furthermore, this will stimulate the development and implementation of visual artificial intelligence for robotic automation and digitization.
author2 School of Civil and Environmental Engineering
author_facet School of Civil and Environmental Engineering
Kim, Jinwoo
Kim, Daeho
Lee, SangHyun
Chi, Seokho
format Article
author Kim, Jinwoo
Kim, Daeho
Lee, SangHyun
Chi, Seokho
author_sort Kim, Jinwoo
title Hybrid DNN training using both synthetic and real construction images to overcome training data shortage
title_short Hybrid DNN training using both synthetic and real construction images to overcome training data shortage
title_full Hybrid DNN training using both synthetic and real construction images to overcome training data shortage
title_fullStr Hybrid DNN training using both synthetic and real construction images to overcome training data shortage
title_full_unstemmed Hybrid DNN training using both synthetic and real construction images to overcome training data shortage
title_sort hybrid dnn training using both synthetic and real construction images to overcome training data shortage
publishDate 2023
url https://hdl.handle.net/10356/172882
_version_ 1787136536892932096