Deep learning approaches for object co-segmentation and one-shot segmentation
Image co-segmentation is an active computer vision task that aims to discover and segment the shared objects given multiple images. Recently, researchers design various learning-based algorithms to handle the co-segmentation task. The main difficulty in this task is how to effectively transfer infor...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Research |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/149937 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-149937 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1499372021-07-08T16:01:19Z Deep learning approaches for object co-segmentation and one-shot segmentation Li, Guankai Lin Guosheng School of Computer Science and Engineering Hardware & Embedded Systems Lab (HESL) gslin@ntu.edu.sg Engineering::Computer science and engineering Image co-segmentation is an active computer vision task that aims to discover and segment the shared objects given multiple images. Recently, researchers design various learning-based algorithms to handle the co-segmentation task. The main difficulty in this task is how to effectively transfer information between images to infer the common object regions. In this thesis, we present CycleSegNet, an effective and novel approach for the co-segmentation task. Our network design has two key components: a region correspondence module which is the basic operation for exchanging information between local image regions, and a cycle refinement module which utilizes ConvLSTMs to progressively update image embeddings and exchange information in a cycle manner. Experiment results on four popular benchmark datasets --- PASCAL VOC dataset, MSRC dataset, Internet dataset, and iCoseg dataset indicate that our proposed approach greatly outperforms the existing networks and achieves new state-of-the-art performance. In addition to image co-segmentation, we also explore a method to solve one-shot segmentation with only weak supervision (bounding box). One-shot semantic segmentation has recently gained attention for its strong generalization ability to segment unseen-class images given only limited annotated image. However, existing methods in one-shot object segmentation have mainly relied on manually pixel-wise labeled segmentation masks. The main challenge in this task is limited data and weak supervision. In this thesis, we present an effective approach, which utilizes the recent weakly-supervised semantic segmentation method to generate pseudo mask labels in the bounding box regions and then integrates the detailed information and correlation between support image and query image for solving one-shot image segmentation. Extensive experiments on the PASCAL-5i dataset show that our weakly-supervised method narrows down the performance gap between bounding box supervision and pixel-wise annotations, and performs comparably with the state-of-the-art fully-supervised one-shot methods. Master of Engineering 2021-06-22T09:20:51Z 2021-06-22T09:20:51Z 2021 Thesis-Master by Research Li, G. (2021). Deep learning approaches for object co-segmentation and one-shot segmentation. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149937 https://hdl.handle.net/10356/149937 10.32657/10356/149937 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Li, Guankai Deep learning approaches for object co-segmentation and one-shot segmentation |
description |
Image co-segmentation is an active computer vision task that aims to discover and segment the shared objects given multiple images. Recently, researchers design various learning-based algorithms to handle the co-segmentation task. The main difficulty in this task is how to effectively transfer information between images to infer the common object regions. In this thesis, we present CycleSegNet, an effective and novel approach for the co-segmentation task. Our network design has two key components: a region correspondence module which is the basic operation for exchanging information between local image regions, and a cycle refinement module which utilizes ConvLSTMs to progressively update image embeddings and exchange information in a cycle manner. Experiment results on four popular benchmark datasets --- PASCAL VOC dataset, MSRC dataset, Internet dataset, and iCoseg dataset indicate that our proposed approach greatly outperforms the existing networks and achieves new state-of-the-art performance.
In addition to image co-segmentation, we also explore a method to solve one-shot segmentation with only weak supervision (bounding box). One-shot semantic segmentation has recently gained attention for its strong generalization ability to segment unseen-class images given only limited annotated image. However, existing methods in one-shot object segmentation have mainly relied on manually pixel-wise labeled segmentation masks.
The main challenge in this task is limited data and weak supervision. In this thesis, we present an effective approach, which utilizes the recent weakly-supervised semantic segmentation method to generate pseudo mask labels in the bounding box regions and then integrates the detailed information and correlation between support image and query image for solving one-shot image segmentation. Extensive experiments on the PASCAL-5i dataset show that our weakly-supervised method narrows down the performance gap between bounding box supervision and pixel-wise annotations, and performs comparably with the state-of-the-art fully-supervised one-shot methods. |
author2 |
Lin Guosheng |
author_facet |
Lin Guosheng Li, Guankai |
format |
Thesis-Master by Research |
author |
Li, Guankai |
author_sort |
Li, Guankai |
title |
Deep learning approaches for object co-segmentation and one-shot segmentation |
title_short |
Deep learning approaches for object co-segmentation and one-shot segmentation |
title_full |
Deep learning approaches for object co-segmentation and one-shot segmentation |
title_fullStr |
Deep learning approaches for object co-segmentation and one-shot segmentation |
title_full_unstemmed |
Deep learning approaches for object co-segmentation and one-shot segmentation |
title_sort |
deep learning approaches for object co-segmentation and one-shot segmentation |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/149937 |
_version_ |
1705151279558295552 |