Semantic correlation promoted shape-variant context for segmentation
Context is essential for semantic segmentation. Due to the diverse shapes of objects and their complex layout in various scene images, the spatial scales and shapes of contexts for different objects have very large variation. It is thus ineffective or inefficient to aggregate various context informa...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/140371 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-140371 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1403712020-09-26T21:52:49Z Semantic correlation promoted shape-variant context for segmentation Ding, Henghui Jiang, Xudong Shuai, Bing Liu, Ai Qun Wang, Gang School of Electrical and Electronic Engineering 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Institute for Media Innovation (IMI) Engineering::Computer science and engineering Segmentation Computer Vision Context is essential for semantic segmentation. Due to the diverse shapes of objects and their complex layout in various scene images, the spatial scales and shapes of contexts for different objects have very large variation. It is thus ineffective or inefficient to aggregate various context information from a predefined fixed region. In this work, we propose to generate a scale- and shape-variant semantic mask for each pixel to confine its contextual region. To this end, we first propose a novel paired convolution to infer the semantic correlation of the pair and based on that to generate a shape mask. Using the inferred spatial scope of the contextual region, we propose a shape-variant convolution, of which the receptive field is controlled by the shape mask that varies with the appearance of input. In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region. Furthermore, this work also proposes a labeling denoising model to reduce wrong predictions caused by the noisy low-level features. Without bells and whistles, the proposed segmentation network achieves new state-of-the-arts consistently on the six public segmentation datasets. NRF (Natl Research Foundation, S’pore) MOE (Min. of Education, S’pore) Accepted version 2020-05-28T06:01:29Z 2020-05-28T06:01:29Z 2020 Conference Paper Ding, H., Jiang, X., Shuai, B., Liu, A. Q., & Wang, G. (2020). Semantic correlation promoted shape-variant context for segmentation. Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8877-8886. doi:10.1109/CVPR.2019.00909 978-1-7281-3294-5 https://hdl.handle.net/10356/140371 10.1109/CVPR.2019.00909 2-s2.0-85076823241 8877 8886 en © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/CVPR.2019.00909 application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Segmentation Computer Vision |
spellingShingle |
Engineering::Computer science and engineering Segmentation Computer Vision Ding, Henghui Jiang, Xudong Shuai, Bing Liu, Ai Qun Wang, Gang Semantic correlation promoted shape-variant context for segmentation |
description |
Context is essential for semantic segmentation. Due to the diverse shapes of objects and their complex layout in various scene images, the spatial scales and shapes of contexts for different objects have very large variation. It is thus ineffective or inefficient to aggregate various context information from a predefined fixed region. In this work, we propose to generate a scale- and shape-variant semantic mask for each pixel to confine its contextual region. To this end, we first propose a novel paired convolution to infer the semantic correlation of the pair and based on that to generate a shape mask. Using the inferred spatial scope of the contextual region, we propose a shape-variant convolution, of which the receptive field is controlled by the shape mask that varies with the appearance of input. In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region. Furthermore, this work also proposes a labeling denoising model to reduce wrong predictions caused by the noisy low-level features. Without bells and whistles, the proposed segmentation network achieves new state-of-the-arts consistently on the six public segmentation datasets. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Ding, Henghui Jiang, Xudong Shuai, Bing Liu, Ai Qun Wang, Gang |
format |
Conference or Workshop Item |
author |
Ding, Henghui Jiang, Xudong Shuai, Bing Liu, Ai Qun Wang, Gang |
author_sort |
Ding, Henghui |
title |
Semantic correlation promoted shape-variant context for segmentation |
title_short |
Semantic correlation promoted shape-variant context for segmentation |
title_full |
Semantic correlation promoted shape-variant context for segmentation |
title_fullStr |
Semantic correlation promoted shape-variant context for segmentation |
title_full_unstemmed |
Semantic correlation promoted shape-variant context for segmentation |
title_sort |
semantic correlation promoted shape-variant context for segmentation |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/140371 |
_version_ |
1681057292452102144 |