Exploring context with deep structured models for semantic segmentation

State-of-the-art semantic image segmentation methods are mostly based on training deep convolutional neural networks (CNNs). In this work, we proffer to improve semantic segmentation with the use of contextual information. In particular, we explore patch-patch context and patch-background context in...

Full description

Saved in:
Bibliographic Details
Main Authors: van den Hengel, Anton, Lin, Guosheng, Shen, Chunhua, Reid, Ian
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2019
Subjects:
Online Access:https://hdl.handle.net/10356/106417
http://hdl.handle.net/10220/47945
http://dx.doi.org/10.1109/TPAMI.2017.2708714
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-106417
record_format dspace
spelling sg-ntu-dr.10356-1064172019-12-06T22:11:14Z Exploring context with deep structured models for semantic segmentation van den Hengel, Anton Lin, Guosheng Shen, Chunhua Reid, Ian School of Computer Science and Engineering Convolutional Neural Networks DRNTU::Engineering::Computer science and engineering Semantic Segmentation State-of-the-art semantic image segmentation methods are mostly based on training deep convolutional neural networks (CNNs). In this work, we proffer to improve semantic segmentation with the use of contextual information. In particular, we explore patch-patch context and patch-background context in deep CNNs. We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions. Specifically, we formulate CNN-based pairwise potential functions to capture semantic correlations between neighboring patches. Efficient piecewise training of the proposed deep structured model is then applied in order to avoid repeated expensive CRF inference during the course of back propagation.For capturing the patch-background context, we show that a network design with traditional multi-scale image inputs and sliding pyramid pooling is very effective for improving performance. We perform comprehensive evaluation of the proposed method. We achieve new state-of-the-art performance on a number of challenging semantic segmentation datasets. Accepted version 2019-04-01T02:58:34Z 2019-12-06T22:11:14Z 2019-04-01T02:58:34Z 2019-12-06T22:11:14Z 2017 Journal Article Lin, G., Shen, C., van den Hengel, A., & Reid, I. (2018). Exploring context with deep structured models for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(6), 1352-1366. doi:10.1109/TPAMI.2017.2708714 0162-8828 https://hdl.handle.net/10356/106417 http://hdl.handle.net/10220/47945 http://dx.doi.org/10.1109/TPAMI.2017.2708714 en IEEE Transactions on Pattern Analysis and Machine Intelligence © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TPAMI.2017.2708714 16 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Convolutional Neural Networks
DRNTU::Engineering::Computer science and engineering
Semantic Segmentation
spellingShingle Convolutional Neural Networks
DRNTU::Engineering::Computer science and engineering
Semantic Segmentation
van den Hengel, Anton
Lin, Guosheng
Shen, Chunhua
Reid, Ian
Exploring context with deep structured models for semantic segmentation
description State-of-the-art semantic image segmentation methods are mostly based on training deep convolutional neural networks (CNNs). In this work, we proffer to improve semantic segmentation with the use of contextual information. In particular, we explore patch-patch context and patch-background context in deep CNNs. We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions. Specifically, we formulate CNN-based pairwise potential functions to capture semantic correlations between neighboring patches. Efficient piecewise training of the proposed deep structured model is then applied in order to avoid repeated expensive CRF inference during the course of back propagation.For capturing the patch-background context, we show that a network design with traditional multi-scale image inputs and sliding pyramid pooling is very effective for improving performance. We perform comprehensive evaluation of the proposed method. We achieve new state-of-the-art performance on a number of challenging semantic segmentation datasets.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
van den Hengel, Anton
Lin, Guosheng
Shen, Chunhua
Reid, Ian
format Article
author van den Hengel, Anton
Lin, Guosheng
Shen, Chunhua
Reid, Ian
author_sort van den Hengel, Anton
title Exploring context with deep structured models for semantic segmentation
title_short Exploring context with deep structured models for semantic segmentation
title_full Exploring context with deep structured models for semantic segmentation
title_fullStr Exploring context with deep structured models for semantic segmentation
title_full_unstemmed Exploring context with deep structured models for semantic segmentation
title_sort exploring context with deep structured models for semantic segmentation
publishDate 2019
url https://hdl.handle.net/10356/106417
http://hdl.handle.net/10220/47945
http://dx.doi.org/10.1109/TPAMI.2017.2708714
_version_ 1681040572952870912