Semantic segmentation with less annotation efforts

Semantic segmentation is a pixel-wise classification task, which is to predict class label to every pixel within an image. However, one of the obstacles limiting the development of semantic segmentation is that the pixel-wise segmentation annotations for the training images are quite difficult and e...

Full description

Saved in:

Bibliographic Details
Main Author:	Zhang, Tianyi
Other Authors:	Lin Guosheng
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2020
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Online Access:	https://hdl.handle.net/10356/140292
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-140292
record_format	dspace
spelling	sg-ntu-dr.10356-1402922020-11-01T05:03:21Z Semantic segmentation with less annotation efforts Zhang, Tianyi Lin Guosheng Interdisciplinary Graduate School (IGS) gslin@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Semantic segmentation is a pixel-wise classification task, which is to predict class label to every pixel within an image. However, one of the obstacles limiting the development of semantic segmentation is that the pixel-wise segmentation annotations for the training images are quite difficult and expensive to obtain. Thus, it is difficult to apply the segmentation methods to new datasets or semantic classes since it is labor-intensive to obtain new annotated data. The target of this thesis is to reduce the annotation work load for semantic segmentation tasks on images. From the view of data domain, the approaches could be roughly classified into intra-domain approaches and inter-domain approaches. The category of intra-domain approaches is to utilize weaker level of supervisions within the datasets of same data domain. The supervision format of image-level labels for weakly supervised semantic segmentation is investigated in this thesis. To recover the pixel-wise annotations from image-level labels, region-mining models are trained to approximate the target object regions. The goal is to train the region-mining models which could highlight the integral object regions instead of only the most discriminative regions. In this thesis, I investigate regularizing the region-mining model both in the forward pass and the backward pass of the training process. The category of inter-domain approaches is to transfer the pixel-wise knowledge from another domain, whose data and pixel-wise annotations are easier to generate, to the target data domain. This thesis investigates the case of transferring from the synthetic source data with pixel-wise annotations to the real-world unlabeled target data. Adversarial learning approaches are applied to narrow the domain gap between the synthetic data and real-world data. Adversarial learning approaches usually suffer from the problem of content misalignment. To alleviate the content misalignment problem, two approaches are proposed in this thesis to regularize adversarial learning methods: the first is to embed the global structure knowledge into the feature-level adversarial learning step. The second is to back-propagate the final task loss into the pixel-wise adversarial learning step. This thesis presents the methods of both the categories to alleviate the need of pixelwise annotations for semantic image segmentation. The experiments show that the proposed methods could achieve promising segmentation performance without utilizing the pixel-wise annotations. Doctor of Philosophy 2020-05-28T00:30:22Z 2020-05-28T00:30:22Z 2020 Thesis-Doctor of Philosophy Zhang, T. (2020). Semantic segmentation with less annotation efforts. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/140292 10.32657/10356/140292 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Zhang, Tianyi Semantic segmentation with less annotation efforts
description	Semantic segmentation is a pixel-wise classification task, which is to predict class label to every pixel within an image. However, one of the obstacles limiting the development of semantic segmentation is that the pixel-wise segmentation annotations for the training images are quite difficult and expensive to obtain. Thus, it is difficult to apply the segmentation methods to new datasets or semantic classes since it is labor-intensive to obtain new annotated data. The target of this thesis is to reduce the annotation work load for semantic segmentation tasks on images. From the view of data domain, the approaches could be roughly classified into intra-domain approaches and inter-domain approaches. The category of intra-domain approaches is to utilize weaker level of supervisions within the datasets of same data domain. The supervision format of image-level labels for weakly supervised semantic segmentation is investigated in this thesis. To recover the pixel-wise annotations from image-level labels, region-mining models are trained to approximate the target object regions. The goal is to train the region-mining models which could highlight the integral object regions instead of only the most discriminative regions. In this thesis, I investigate regularizing the region-mining model both in the forward pass and the backward pass of the training process. The category of inter-domain approaches is to transfer the pixel-wise knowledge from another domain, whose data and pixel-wise annotations are easier to generate, to the target data domain. This thesis investigates the case of transferring from the synthetic source data with pixel-wise annotations to the real-world unlabeled target data. Adversarial learning approaches are applied to narrow the domain gap between the synthetic data and real-world data. Adversarial learning approaches usually suffer from the problem of content misalignment. To alleviate the content misalignment problem, two approaches are proposed in this thesis to regularize adversarial learning methods: the first is to embed the global structure knowledge into the feature-level adversarial learning step. The second is to back-propagate the final task loss into the pixel-wise adversarial learning step. This thesis presents the methods of both the categories to alleviate the need of pixelwise annotations for semantic image segmentation. The experiments show that the proposed methods could achieve promising segmentation performance without utilizing the pixel-wise annotations.
author2	Lin Guosheng
author_facet	Lin Guosheng Zhang, Tianyi
format	Thesis-Doctor of Philosophy
author	Zhang, Tianyi
author_sort	Zhang, Tianyi
title	Semantic segmentation with less annotation efforts
title_short	Semantic segmentation with less annotation efforts
title_full	Semantic segmentation with less annotation efforts
title_fullStr	Semantic segmentation with less annotation efforts
title_full_unstemmed	Semantic segmentation with less annotation efforts
title_sort	semantic segmentation with less annotation efforts
publisher	Nanyang Technological University
publishDate	2020
url	https://hdl.handle.net/10356/140292
_version_	1683494598582730752

Semantic segmentation with less annotation efforts

Similar Items