Semantic correlation promoted shape-variant context for segmentation

Context is essential for semantic segmentation. Due to the diverse shapes of objects and their complex layout in various scene images, the spatial scales and shapes of contexts for different objects have very large variation. It is thus ineffective or inefficient to aggregate various context informa...

Full description

Saved in:
Bibliographic Details
Main Authors: Ding, Henghui, Jiang, Xudong, Shuai, Bing, Liu, Ai Qun, Wang, Gang
Other Authors: School of Electrical and Electronic Engineering
Format: Conference or Workshop Item
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/140371
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-140371
record_format dspace
spelling sg-ntu-dr.10356-1403712020-09-26T21:52:49Z Semantic correlation promoted shape-variant context for segmentation Ding, Henghui Jiang, Xudong Shuai, Bing Liu, Ai Qun Wang, Gang School of Electrical and Electronic Engineering 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Institute for Media Innovation (IMI) Engineering::Computer science and engineering Segmentation Computer Vision Context is essential for semantic segmentation. Due to the diverse shapes of objects and their complex layout in various scene images, the spatial scales and shapes of contexts for different objects have very large variation. It is thus ineffective or inefficient to aggregate various context information from a predefined fixed region. In this work, we propose to generate a scale- and shape-variant semantic mask for each pixel to confine its contextual region. To this end, we first propose a novel paired convolution to infer the semantic correlation of the pair and based on that to generate a shape mask. Using the inferred spatial scope of the contextual region, we propose a shape-variant convolution, of which the receptive field is controlled by the shape mask that varies with the appearance of input. In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region. Furthermore, this work also proposes a labeling denoising model to reduce wrong predictions caused by the noisy low-level features. Without bells and whistles, the proposed segmentation network achieves new state-of-the-arts consistently on the six public segmentation datasets. NRF (Natl Research Foundation, S’pore) MOE (Min. of Education, S’pore) Accepted version 2020-05-28T06:01:29Z 2020-05-28T06:01:29Z 2020 Conference Paper Ding, H., Jiang, X., Shuai, B., Liu, A. Q., & Wang, G. (2020). Semantic correlation promoted shape-variant context for segmentation. Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8877-8886. doi:10.1109/CVPR.2019.00909 978-1-7281-3294-5 https://hdl.handle.net/10356/140371 10.1109/CVPR.2019.00909 2-s2.0-85076823241 8877 8886 en © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/CVPR.2019.00909 application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Segmentation
Computer Vision
spellingShingle Engineering::Computer science and engineering
Segmentation
Computer Vision
Ding, Henghui
Jiang, Xudong
Shuai, Bing
Liu, Ai Qun
Wang, Gang
Semantic correlation promoted shape-variant context for segmentation
description Context is essential for semantic segmentation. Due to the diverse shapes of objects and their complex layout in various scene images, the spatial scales and shapes of contexts for different objects have very large variation. It is thus ineffective or inefficient to aggregate various context information from a predefined fixed region. In this work, we propose to generate a scale- and shape-variant semantic mask for each pixel to confine its contextual region. To this end, we first propose a novel paired convolution to infer the semantic correlation of the pair and based on that to generate a shape mask. Using the inferred spatial scope of the contextual region, we propose a shape-variant convolution, of which the receptive field is controlled by the shape mask that varies with the appearance of input. In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region. Furthermore, this work also proposes a labeling denoising model to reduce wrong predictions caused by the noisy low-level features. Without bells and whistles, the proposed segmentation network achieves new state-of-the-arts consistently on the six public segmentation datasets.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Ding, Henghui
Jiang, Xudong
Shuai, Bing
Liu, Ai Qun
Wang, Gang
format Conference or Workshop Item
author Ding, Henghui
Jiang, Xudong
Shuai, Bing
Liu, Ai Qun
Wang, Gang
author_sort Ding, Henghui
title Semantic correlation promoted shape-variant context for segmentation
title_short Semantic correlation promoted shape-variant context for segmentation
title_full Semantic correlation promoted shape-variant context for segmentation
title_fullStr Semantic correlation promoted shape-variant context for segmentation
title_full_unstemmed Semantic correlation promoted shape-variant context for segmentation
title_sort semantic correlation promoted shape-variant context for segmentation
publishDate 2020
url https://hdl.handle.net/10356/140371
_version_ 1681057292452102144