Visiting the Invisible: layer-by-layer completed scene decomposition

Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for the invisible regions, but requires a manual mask as input....

Full description

Saved in:

Bibliographic Details
Main Authors:	Zheng, Chuanxia, Dao, Duy-Son, Song, Guoxian, Cham, Tat-Jen, Cai, Jianfei
Other Authors:	School of Computer Science and Engineering
Format:	Article
Language:	English
Published:	2023
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Layered Scene Decomposition Scene Completion
Online Access:	https://hdl.handle.net/10356/172650
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-172650
record_format	dspace
spelling	sg-ntu-dr.10356-1726502023-12-19T02:15:46Z Visiting the Invisible: layer-by-layer completed scene decomposition Zheng, Chuanxia Dao, Duy-Son Song, Guoxian Cham, Tat-Jen Cai, Jianfei School of Computer Science and Engineering Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Layered Scene Decomposition Scene Completion Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for the invisible regions, but requires a manual mask as input. In this work, we propose a higher-level scene understanding system to tackle both visible and invisible parts of objects and backgrounds in a given scene. Particularly, we built a system to decompose a scene into individual objects, infer their underlying occlusion relationships, and even automatically learn which parts of the objects are occluded that need to be completed. In order to disentangle the occluded relationships of all objects in a complex scene, we use the fact that the front object without being occluded is easy to be identified, detected, and segmented. Our system interleaves the two tasks of instance segmentation and scene completion through multiple iterations, solving for objects layer-by-layer. We first provide a thorough experiment using a new realistically rendered dataset with ground-truths for all invisible regions. To bridge the domain gap to real imagery where ground-truths are unavailable, we then train another model with the pseudo-ground-truths generated from our trained synthesis model. We demonstrate results on a wide variety of datasets and show significant improvement over the state-of-the-art. This study is supported under the RIE2020 Industry Alignment Fund – Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from Singapore Telecommunications Limited (Singtel), through Singtel Cognitive and Artificial Intelligence Lab for Enterprises (SCALE@NTU). This research is also supported by the Monash FIT Start-up Grant. 2023-12-19T02:15:45Z 2023-12-19T02:15:45Z 2021 Journal Article Zheng, C., Dao, D., Song, G., Cham, T. & Cai, J. (2021). Visiting the Invisible: layer-by-layer completed scene decomposition. International Journal of Computer Vision, 129(12), 3195-3215. https://dx.doi.org/10.1007/s11263-021-01517-0 0920-5691 https://hdl.handle.net/10356/172650 10.1007/s11263-021-01517-0 2-s2.0-85115825345 12 129 3195 3215 en IAF-ICP International Journal of Computer Vision © 2021 The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Layered Scene Decomposition Scene Completion
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Layered Scene Decomposition Scene Completion Zheng, Chuanxia Dao, Duy-Son Song, Guoxian Cham, Tat-Jen Cai, Jianfei Visiting the Invisible: layer-by-layer completed scene decomposition
description	Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for the invisible regions, but requires a manual mask as input. In this work, we propose a higher-level scene understanding system to tackle both visible and invisible parts of objects and backgrounds in a given scene. Particularly, we built a system to decompose a scene into individual objects, infer their underlying occlusion relationships, and even automatically learn which parts of the objects are occluded that need to be completed. In order to disentangle the occluded relationships of all objects in a complex scene, we use the fact that the front object without being occluded is easy to be identified, detected, and segmented. Our system interleaves the two tasks of instance segmentation and scene completion through multiple iterations, solving for objects layer-by-layer. We first provide a thorough experiment using a new realistically rendered dataset with ground-truths for all invisible regions. To bridge the domain gap to real imagery where ground-truths are unavailable, we then train another model with the pseudo-ground-truths generated from our trained synthesis model. We demonstrate results on a wide variety of datasets and show significant improvement over the state-of-the-art.
author2	School of Computer Science and Engineering
author_facet	School of Computer Science and Engineering Zheng, Chuanxia Dao, Duy-Son Song, Guoxian Cham, Tat-Jen Cai, Jianfei
format	Article
author	Zheng, Chuanxia Dao, Duy-Son Song, Guoxian Cham, Tat-Jen Cai, Jianfei
author_sort	Zheng, Chuanxia
title	Visiting the Invisible: layer-by-layer completed scene decomposition
title_short	Visiting the Invisible: layer-by-layer completed scene decomposition
title_full	Visiting the Invisible: layer-by-layer completed scene decomposition
title_fullStr	Visiting the Invisible: layer-by-layer completed scene decomposition
title_full_unstemmed	Visiting the Invisible: layer-by-layer completed scene decomposition
title_sort	visiting the invisible: layer-by-layer completed scene decomposition
publishDate	2023
url	https://hdl.handle.net/10356/172650
_version_	1787136690410749952

Visiting the Invisible: layer-by-layer completed scene decomposition

Similar Items