Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation

Semantic segmentation, which aims to acquire pixel-level understanding about images, is among the key components in computer vision. To train a good segmentation model for real-world images, it usually requires a huge amount of time and labor effort to obtain sufficient pixel-level annotations of re...

Full description

Saved in:
Bibliographic Details
Main Authors: Lv, Fengmao, Lin, Guosheng, Liu, Peng, Yang, Guowu, Pan, Sinno Jialin, Duan, Lixin
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/160522
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-160522
record_format dspace
spelling sg-ntu-dr.10356-1605222022-07-26T05:03:23Z Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation Lv, Fengmao Lin, Guosheng Liu, Peng Yang, Guowu Pan, Sinno Jialin Duan, Lixin School of Computer Science and Engineering Engineering::Computer science and engineering Semantic Segmentation Domain Adaptation Semantic segmentation, which aims to acquire pixel-level understanding about images, is among the key components in computer vision. To train a good segmentation model for real-world images, it usually requires a huge amount of time and labor effort to obtain sufficient pixel-level annotations of real-world images beforehand. To get rid of such a nontrivial burden, one can use simulators to automatically generate synthetic images that inherently contain full pixel-level annotations and use them to train a segmentation model for the real-world images. However, training with synthetic images usually cannot lead to good performance due to the domain difference between the synthetic images (i.e., source domain) and the real-world images (i.e., target domain). To deal with this issue, a number of unsupervised domain adaptation (UDA) approaches have been proposed, where no labeled real-world images are available. Different from those methods, in this work, we conduct a pioneer attempt by using easy-to-collect image-level annotations for target images to improve the performance of cross-domain segmentation. Specifically, we leverage those image-level annotations to construct curriculums for the domain adaptation problem. The curriculums describe multi-level properties of the target domain, including label distributions over full images, local regions and single pixels. Since image annotations are 'weak' labels compared to pixel annotations for segmentation, we coin this new problem as weakly-supervised cross-domain segmentation. Comprehensive experiments on the GTA5 -> Cityscapes and SYNTHIA -> Cityscapes settings demonstrate the effectiveness of our method over the existing state-of-the-art baselines. This work was supported in part by the Major Project for New Generation of AI under Grant 2018AAA0100400; in part by the National Natural Science Foundation of China under Grant 11829101, Grant 11931014, and Grant 61772118; and in part by the Fundamental Research Funds for the Central Universities of China under Grant JBK1806002. 2022-07-26T05:03:23Z 2022-07-26T05:03:23Z 2020 Journal Article Lv, F., Lin, G., Liu, P., Yang, G., Pan, S. J. & Duan, L. (2020). Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation. IEEE Transactions On Circuits and Systems for Video Technology, 31(9), 3493-3503. https://dx.doi.org/10.1109/TCSVT.2020.3040343 1051-8215 https://hdl.handle.net/10356/160522 10.1109/TCSVT.2020.3040343 2-s2.0-85097186183 9 31 3493 3503 en IEEE Transactions on Circuits and Systems for Video Technology © 2020 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Semantic Segmentation
Domain Adaptation
spellingShingle Engineering::Computer science and engineering
Semantic Segmentation
Domain Adaptation
Lv, Fengmao
Lin, Guosheng
Liu, Peng
Yang, Guowu
Pan, Sinno Jialin
Duan, Lixin
Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation
description Semantic segmentation, which aims to acquire pixel-level understanding about images, is among the key components in computer vision. To train a good segmentation model for real-world images, it usually requires a huge amount of time and labor effort to obtain sufficient pixel-level annotations of real-world images beforehand. To get rid of such a nontrivial burden, one can use simulators to automatically generate synthetic images that inherently contain full pixel-level annotations and use them to train a segmentation model for the real-world images. However, training with synthetic images usually cannot lead to good performance due to the domain difference between the synthetic images (i.e., source domain) and the real-world images (i.e., target domain). To deal with this issue, a number of unsupervised domain adaptation (UDA) approaches have been proposed, where no labeled real-world images are available. Different from those methods, in this work, we conduct a pioneer attempt by using easy-to-collect image-level annotations for target images to improve the performance of cross-domain segmentation. Specifically, we leverage those image-level annotations to construct curriculums for the domain adaptation problem. The curriculums describe multi-level properties of the target domain, including label distributions over full images, local regions and single pixels. Since image annotations are 'weak' labels compared to pixel annotations for segmentation, we coin this new problem as weakly-supervised cross-domain segmentation. Comprehensive experiments on the GTA5 -> Cityscapes and SYNTHIA -> Cityscapes settings demonstrate the effectiveness of our method over the existing state-of-the-art baselines.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Lv, Fengmao
Lin, Guosheng
Liu, Peng
Yang, Guowu
Pan, Sinno Jialin
Duan, Lixin
format Article
author Lv, Fengmao
Lin, Guosheng
Liu, Peng
Yang, Guowu
Pan, Sinno Jialin
Duan, Lixin
author_sort Lv, Fengmao
title Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation
title_short Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation
title_full Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation
title_fullStr Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation
title_full_unstemmed Weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation
title_sort weakly-supervised cross-domain road scene segmentation via multi-level curriculum adaptation
publishDate 2022
url https://hdl.handle.net/10356/160522
_version_ 1739837398928326656