Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation
Semantic segmentation, which aims to acquire pixel-level understanding about images, is among the key components in computer vision. To train a good segmentation model for real-world images, it usually requires a huge amount of time and labor effort to obtain sufficient pixel-level annotations of re...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/160522 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-160522 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1605222022-07-26T05:03:23Z Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation Lv, Fengmao Lin, Guosheng Liu, Peng Yang, Guowu Pan, Sinno Jialin Duan, Lixin School of Computer Science and Engineering Engineering::Computer science and engineering Semantic Segmentation Domain Adaptation Semantic segmentation, which aims to acquire pixel-level understanding about images, is among the key components in computer vision. To train a good segmentation model for real-world images, it usually requires a huge amount of time and labor effort to obtain sufficient pixel-level annotations of real-world images beforehand. To get rid of such a nontrivial burden, one can use simulators to automatically generate synthetic images that inherently contain full pixel-level annotations and use them to train a segmentation model for the real-world images. However, training with synthetic images usually cannot lead to good performance due to the domain difference between the synthetic images (i.e., source domain) and the real-world images (i.e., target domain). To deal with this issue, a number of unsupervised domain adaptation (UDA) approaches have been proposed, where no labeled real-world images are available. Different from those methods, in this work, we conduct a pioneer attempt by using easy-to-collect image-level annotations for target images to improve the performance of cross-domain segmentation. Specifically, we leverage those image-level annotations to construct curriculums for the domain adaptation problem. The curriculums describe multi-level properties of the target domain, including label distributions over full images, local regions and single pixels. Since image annotations are 'weak' labels compared to pixel annotations for segmentation, we coin this new problem as weakly-supervised cross-domain segmentation. Comprehensive experiments on the GTA5 -> Cityscapes and SYNTHIA -> Cityscapes settings demonstrate the effectiveness of our method over the existing state-of-the-art baselines. This work was supported in part by the Major Project for New Generation of AI under Grant 2018AAA0100400; in part by the National Natural Science Foundation of China under Grant 11829101, Grant 11931014, and Grant 61772118; and in part by the Fundamental Research Funds for the Central Universities of China under Grant JBK1806002. 2022-07-26T05:03:23Z 2022-07-26T05:03:23Z 2020 Journal Article Lv, F., Lin, G., Liu, P., Yang, G., Pan, S. J. & Duan, L. (2020). Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation. IEEE Transactions On Circuits and Systems for Video Technology, 31(9), 3493-3503. https://dx.doi.org/10.1109/TCSVT.2020.3040343 1051-8215 https://hdl.handle.net/10356/160522 10.1109/TCSVT.2020.3040343 2-s2.0-85097186183 9 31 3493 3503 en IEEE Transactions on Circuits and Systems for Video Technology © 2020 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Semantic Segmentation Domain Adaptation |
spellingShingle |
Engineering::Computer science and engineering Semantic Segmentation Domain Adaptation Lv, Fengmao Lin, Guosheng Liu, Peng Yang, Guowu Pan, Sinno Jialin Duan, Lixin Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation |
description |
Semantic segmentation, which aims to acquire pixel-level understanding about images, is among the key components in computer vision. To train a good segmentation model for real-world images, it usually requires a huge amount of time and labor effort to obtain sufficient pixel-level annotations of real-world images beforehand. To get rid of such a nontrivial burden, one can use simulators to automatically generate synthetic images that inherently contain full pixel-level annotations and use them to train a segmentation model for the real-world images. However, training with synthetic images usually cannot lead to good performance due to the domain difference between the synthetic images (i.e., source domain) and the real-world images (i.e., target domain). To deal with this issue, a number of unsupervised domain adaptation (UDA) approaches have been proposed, where no labeled real-world images are available. Different from those methods, in this work, we conduct a pioneer attempt by using easy-to-collect image-level annotations for target images to improve the performance of cross-domain segmentation. Specifically, we leverage those image-level annotations to construct curriculums for the domain adaptation problem. The curriculums describe multi-level properties of the target domain, including label distributions over full images, local regions and single pixels. Since image annotations are 'weak' labels compared to pixel annotations for segmentation, we coin this new problem as weakly-supervised cross-domain segmentation. Comprehensive experiments on the GTA5 -> Cityscapes and SYNTHIA -> Cityscapes settings demonstrate the effectiveness of our method over the existing state-of-the-art baselines. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Lv, Fengmao Lin, Guosheng Liu, Peng Yang, Guowu Pan, Sinno Jialin Duan, Lixin |
format |
Article |
author |
Lv, Fengmao Lin, Guosheng Liu, Peng Yang, Guowu Pan, Sinno Jialin Duan, Lixin |
author_sort |
Lv, Fengmao |
title |
Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation |
title_short |
Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation |
title_full |
Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation |
title_fullStr |
Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation |
title_full_unstemmed |
Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation |
title_sort |
weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/160522 |
_version_ |
1739837398928326656 |