SmooSeg: smoothness prior for unsupervised semantic segmentation
Unsupervised semantic segmentation is a challenging task that segments images into semantic groups without manual annotation. Prior works have primarily focused on leveraging prior knowledge of semantic consistency or priori concepts from self-supervised learning methods, which often overlook the co...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/173324 https://neurips.cc/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-173324 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1733242024-02-02T15:34:59Z SmooSeg: smoothness prior for unsupervised semantic segmentation Lan, Mengcheng Wang, Xinjiang Ke, Yiping Xu, Jiaxing Feng, Litong Zhang, Wayne School of Computer Science and Engineering 37th Conference on Neural Information Processing Systems (NeurIPS 2023) SenseTime Research S-Lab Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Unsupervised Semantic Segmentation Smoothness Prior Unsupervised semantic segmentation is a challenging task that segments images into semantic groups without manual annotation. Prior works have primarily focused on leveraging prior knowledge of semantic consistency or priori concepts from self-supervised learning methods, which often overlook the coherence property of image segments. In this paper, we demonstrate that the smoothness prior, asserting that close features in a metric space share the same semantics, can significantly simplify segmentation by casting unsupervised semantic segmentation as an energy minimization problem. Under this paradigm, we propose a novel approach called SmooSeg that harnesses self-supervised learning methods to model the closeness relationships among observations as smoothness signals. To effectively discover coherent semantic segments, we introduce a novel smoothness loss that promotes piecewise smoothness within segments while preserving discontinuities across different segments. Additionally, to further enhance segmentation quality, we design an asymmetric teacher-student style predictor that generates smoothly updated pseudo labels, facilitating an optimal fit between observations and labeling outputs. Thanks to the rich supervision cues of the smoothness prior, our SmooSeg significantly outperforms STEGO in terms of pixel accuracy on three datasets: COCOStuff (+14.9%), Cityscapes (+13.0%), and Potsdam-3 (+5.7%). Ministry of Education (MOE) National Research Foundation (NRF) Submitted/Accepted version This research is supported under the RIE2020 Industry Alignment Fund – Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from the industry partner(s), by the National Research Foundation, Singapore under its Industry Alignment Fund – Pre-positioning (IAF-PP) Funding Initiative, and by the Ministry of Education, Singapore under its MOE Academic Research Fund Tier 2 (STEM RIE2025 Award MOE-T2EP20220- 0006). 2024-01-29T01:49:34Z 2024-01-29T01:49:34Z 2023 Conference Paper Lan, M., Wang, X., Ke, Y., Xu, J., Feng, L. & Zhang, W. (2023). SmooSeg: smoothness prior for unsupervised semantic segmentation. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 1-21. https://hdl.handle.net/10356/173324 https://neurips.cc/ 1 21 en IAF-ICP MOE-T2EP20220- 0006 © 2023 The Author(s). Published by Neural Information Processing Systems. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Unsupervised Semantic Segmentation Smoothness Prior |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Unsupervised Semantic Segmentation Smoothness Prior Lan, Mengcheng Wang, Xinjiang Ke, Yiping Xu, Jiaxing Feng, Litong Zhang, Wayne SmooSeg: smoothness prior for unsupervised semantic segmentation |
description |
Unsupervised semantic segmentation is a challenging task that segments images into semantic groups without manual annotation. Prior works have primarily focused on leveraging prior knowledge of semantic consistency or priori concepts from self-supervised learning methods, which often overlook the coherence property of image segments. In this paper, we demonstrate that the smoothness prior, asserting that close features in a metric space share the same semantics, can significantly simplify segmentation by casting unsupervised semantic segmentation as an energy minimization problem. Under this paradigm, we propose a novel approach called SmooSeg that harnesses self-supervised learning methods to model the
closeness relationships among observations as smoothness signals. To effectively discover coherent semantic segments, we introduce a novel smoothness loss that promotes piecewise smoothness within segments while preserving discontinuities across different segments. Additionally, to further enhance segmentation quality, we design an asymmetric teacher-student style predictor that generates smoothly
updated pseudo labels, facilitating an optimal fit between observations and labeling outputs. Thanks to the rich supervision cues of the smoothness prior, our SmooSeg significantly outperforms STEGO in terms of pixel accuracy on three datasets: COCOStuff (+14.9%), Cityscapes (+13.0%), and Potsdam-3 (+5.7%). |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Lan, Mengcheng Wang, Xinjiang Ke, Yiping Xu, Jiaxing Feng, Litong Zhang, Wayne |
format |
Conference or Workshop Item |
author |
Lan, Mengcheng Wang, Xinjiang Ke, Yiping Xu, Jiaxing Feng, Litong Zhang, Wayne |
author_sort |
Lan, Mengcheng |
title |
SmooSeg: smoothness prior for unsupervised semantic segmentation |
title_short |
SmooSeg: smoothness prior for unsupervised semantic segmentation |
title_full |
SmooSeg: smoothness prior for unsupervised semantic segmentation |
title_fullStr |
SmooSeg: smoothness prior for unsupervised semantic segmentation |
title_full_unstemmed |
SmooSeg: smoothness prior for unsupervised semantic segmentation |
title_sort |
smooseg: smoothness prior for unsupervised semantic segmentation |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/173324 https://neurips.cc/ |
_version_ |
1789968695748984832 |