Semantic segmentation with less annotation efforts

Semantic segmentation is a pixel-wise classification task, which is to predict class label to every pixel within an image. However, one of the obstacles limiting the development of semantic segmentation is that the pixel-wise segmentation annotations for the training images are quite difficult and e...

Full description

Saved in:
Bibliographic Details
Main Author: Zhang, Tianyi
Other Authors: Lin Guosheng
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/140292
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-140292
record_format dspace
spelling sg-ntu-dr.10356-1402922020-11-01T05:03:21Z Semantic segmentation with less annotation efforts Zhang, Tianyi Lin Guosheng Interdisciplinary Graduate School (IGS) gslin@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Semantic segmentation is a pixel-wise classification task, which is to predict class label to every pixel within an image. However, one of the obstacles limiting the development of semantic segmentation is that the pixel-wise segmentation annotations for the training images are quite difficult and expensive to obtain. Thus, it is difficult to apply the segmentation methods to new datasets or semantic classes since it is labor-intensive to obtain new annotated data. The target of this thesis is to reduce the annotation work load for semantic segmentation tasks on images. From the view of data domain, the approaches could be roughly classified into intra-domain approaches and inter-domain approaches. The category of intra-domain approaches is to utilize weaker level of supervisions within the datasets of same data domain. The supervision format of image-level labels for weakly supervised semantic segmentation is investigated in this thesis. To recover the pixel-wise annotations from image-level labels, region-mining models are trained to approximate the target object regions. The goal is to train the region-mining models which could highlight the integral object regions instead of only the most discriminative regions. In this thesis, I investigate regularizing the region-mining model both in the forward pass and the backward pass of the training process. The category of inter-domain approaches is to transfer the pixel-wise knowledge from another domain, whose data and pixel-wise annotations are easier to generate, to the target data domain. This thesis investigates the case of transferring from the synthetic source data with pixel-wise annotations to the real-world unlabeled target data. Adversarial learning approaches are applied to narrow the domain gap between the synthetic data and real-world data. Adversarial learning approaches usually suffer from the problem of content misalignment. To alleviate the content misalignment problem, two approaches are proposed in this thesis to regularize adversarial learning methods: the first is to embed the global structure knowledge into the feature-level adversarial learning step. The second is to back-propagate the final task loss into the pixel-wise adversarial learning step. This thesis presents the methods of both the categories to alleviate the need of pixelwise annotations for semantic image segmentation. The experiments show that the proposed methods could achieve promising segmentation performance without utilizing the pixel-wise annotations. Doctor of Philosophy 2020-05-28T00:30:22Z 2020-05-28T00:30:22Z 2020 Thesis-Doctor of Philosophy Zhang, T. (2020). Semantic segmentation with less annotation efforts. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/140292 10.32657/10356/140292 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Zhang, Tianyi
Semantic segmentation with less annotation efforts
description Semantic segmentation is a pixel-wise classification task, which is to predict class label to every pixel within an image. However, one of the obstacles limiting the development of semantic segmentation is that the pixel-wise segmentation annotations for the training images are quite difficult and expensive to obtain. Thus, it is difficult to apply the segmentation methods to new datasets or semantic classes since it is labor-intensive to obtain new annotated data. The target of this thesis is to reduce the annotation work load for semantic segmentation tasks on images. From the view of data domain, the approaches could be roughly classified into intra-domain approaches and inter-domain approaches. The category of intra-domain approaches is to utilize weaker level of supervisions within the datasets of same data domain. The supervision format of image-level labels for weakly supervised semantic segmentation is investigated in this thesis. To recover the pixel-wise annotations from image-level labels, region-mining models are trained to approximate the target object regions. The goal is to train the region-mining models which could highlight the integral object regions instead of only the most discriminative regions. In this thesis, I investigate regularizing the region-mining model both in the forward pass and the backward pass of the training process. The category of inter-domain approaches is to transfer the pixel-wise knowledge from another domain, whose data and pixel-wise annotations are easier to generate, to the target data domain. This thesis investigates the case of transferring from the synthetic source data with pixel-wise annotations to the real-world unlabeled target data. Adversarial learning approaches are applied to narrow the domain gap between the synthetic data and real-world data. Adversarial learning approaches usually suffer from the problem of content misalignment. To alleviate the content misalignment problem, two approaches are proposed in this thesis to regularize adversarial learning methods: the first is to embed the global structure knowledge into the feature-level adversarial learning step. The second is to back-propagate the final task loss into the pixel-wise adversarial learning step. This thesis presents the methods of both the categories to alleviate the need of pixelwise annotations for semantic image segmentation. The experiments show that the proposed methods could achieve promising segmentation performance without utilizing the pixel-wise annotations.
author2 Lin Guosheng
author_facet Lin Guosheng
Zhang, Tianyi
format Thesis-Doctor of Philosophy
author Zhang, Tianyi
author_sort Zhang, Tianyi
title Semantic segmentation with less annotation efforts
title_short Semantic segmentation with less annotation efforts
title_full Semantic segmentation with less annotation efforts
title_fullStr Semantic segmentation with less annotation efforts
title_full_unstemmed Semantic segmentation with less annotation efforts
title_sort semantic segmentation with less annotation efforts
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/140292
_version_ 1683494598582730752