Exploring context with deep structured models for semantic segmentation
State-of-the-art semantic image segmentation methods are mostly based on training deep convolutional neural networks (CNNs). In this work, we proffer to improve semantic segmentation with the use of contextual information. In particular, we explore patch-patch context and patch-background context in...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/106417 http://hdl.handle.net/10220/47945 http://dx.doi.org/10.1109/TPAMI.2017.2708714 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-106417 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1064172019-12-06T22:11:14Z Exploring context with deep structured models for semantic segmentation van den Hengel, Anton Lin, Guosheng Shen, Chunhua Reid, Ian School of Computer Science and Engineering Convolutional Neural Networks DRNTU::Engineering::Computer science and engineering Semantic Segmentation State-of-the-art semantic image segmentation methods are mostly based on training deep convolutional neural networks (CNNs). In this work, we proffer to improve semantic segmentation with the use of contextual information. In particular, we explore patch-patch context and patch-background context in deep CNNs. We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions. Specifically, we formulate CNN-based pairwise potential functions to capture semantic correlations between neighboring patches. Efficient piecewise training of the proposed deep structured model is then applied in order to avoid repeated expensive CRF inference during the course of back propagation.For capturing the patch-background context, we show that a network design with traditional multi-scale image inputs and sliding pyramid pooling is very effective for improving performance. We perform comprehensive evaluation of the proposed method. We achieve new state-of-the-art performance on a number of challenging semantic segmentation datasets. Accepted version 2019-04-01T02:58:34Z 2019-12-06T22:11:14Z 2019-04-01T02:58:34Z 2019-12-06T22:11:14Z 2017 Journal Article Lin, G., Shen, C., van den Hengel, A., & Reid, I. (2018). Exploring context with deep structured models for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(6), 1352-1366. doi:10.1109/TPAMI.2017.2708714 0162-8828 https://hdl.handle.net/10356/106417 http://hdl.handle.net/10220/47945 http://dx.doi.org/10.1109/TPAMI.2017.2708714 en IEEE Transactions on Pattern Analysis and Machine Intelligence © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TPAMI.2017.2708714 16 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Convolutional Neural Networks DRNTU::Engineering::Computer science and engineering Semantic Segmentation |
spellingShingle |
Convolutional Neural Networks DRNTU::Engineering::Computer science and engineering Semantic Segmentation van den Hengel, Anton Lin, Guosheng Shen, Chunhua Reid, Ian Exploring context with deep structured models for semantic segmentation |
description |
State-of-the-art semantic image segmentation methods are mostly based on training deep convolutional neural networks (CNNs). In this work, we proffer to improve semantic segmentation with the use of contextual information. In particular, we explore patch-patch context and patch-background context in deep CNNs. We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions. Specifically, we formulate CNN-based pairwise potential functions to capture semantic correlations between neighboring patches. Efficient piecewise training of the proposed deep structured model is then applied in order to avoid repeated expensive CRF inference during the course of back propagation.For capturing the patch-background context, we show that a network design with traditional multi-scale image inputs and sliding pyramid pooling is very effective for improving performance. We perform comprehensive evaluation of the proposed method. We achieve new state-of-the-art performance on a number of challenging semantic segmentation datasets. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering van den Hengel, Anton Lin, Guosheng Shen, Chunhua Reid, Ian |
format |
Article |
author |
van den Hengel, Anton Lin, Guosheng Shen, Chunhua Reid, Ian |
author_sort |
van den Hengel, Anton |
title |
Exploring context with deep structured models for semantic segmentation |
title_short |
Exploring context with deep structured models for semantic segmentation |
title_full |
Exploring context with deep structured models for semantic segmentation |
title_fullStr |
Exploring context with deep structured models for semantic segmentation |
title_full_unstemmed |
Exploring context with deep structured models for semantic segmentation |
title_sort |
exploring context with deep structured models for semantic segmentation |
publishDate |
2019 |
url |
https://hdl.handle.net/10356/106417 http://hdl.handle.net/10220/47945 http://dx.doi.org/10.1109/TPAMI.2017.2708714 |
_version_ |
1681040572952870912 |