Visiting the Invisible: layer-by-layer completed scene decomposition

Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for the invisible regions, but requires a manual mask as input....

Full description

Saved in:
Bibliographic Details
Main Authors: Zheng, Chuanxia, Dao, Duy-Son, Song, Guoxian, Cham, Tat-Jen, Cai, Jianfei
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/172650
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-172650
record_format dspace
spelling sg-ntu-dr.10356-1726502023-12-19T02:15:46Z Visiting the Invisible: layer-by-layer completed scene decomposition Zheng, Chuanxia Dao, Duy-Son Song, Guoxian Cham, Tat-Jen Cai, Jianfei School of Computer Science and Engineering Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Layered Scene Decomposition Scene Completion Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for the invisible regions, but requires a manual mask as input. In this work, we propose a higher-level scene understanding system to tackle both visible and invisible parts of objects and backgrounds in a given scene. Particularly, we built a system to decompose a scene into individual objects, infer their underlying occlusion relationships, and even automatically learn which parts of the objects are occluded that need to be completed. In order to disentangle the occluded relationships of all objects in a complex scene, we use the fact that the front object without being occluded is easy to be identified, detected, and segmented. Our system interleaves the two tasks of instance segmentation and scene completion through multiple iterations, solving for objects layer-by-layer. We first provide a thorough experiment using a new realistically rendered dataset with ground-truths for all invisible regions. To bridge the domain gap to real imagery where ground-truths are unavailable, we then train another model with the pseudo-ground-truths generated from our trained synthesis model. We demonstrate results on a wide variety of datasets and show significant improvement over the state-of-the-art. This study is supported under the RIE2020 Industry Alignment Fund – Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from Singapore Telecommunications Limited (Singtel), through Singtel Cognitive and Artificial Intelligence Lab for Enterprises (SCALE@NTU). This research is also supported by the Monash FIT Start-up Grant. 2023-12-19T02:15:45Z 2023-12-19T02:15:45Z 2021 Journal Article Zheng, C., Dao, D., Song, G., Cham, T. & Cai, J. (2021). Visiting the Invisible: layer-by-layer completed scene decomposition. International Journal of Computer Vision, 129(12), 3195-3215. https://dx.doi.org/10.1007/s11263-021-01517-0 0920-5691 https://hdl.handle.net/10356/172650 10.1007/s11263-021-01517-0 2-s2.0-85115825345 12 129 3195 3215 en IAF-ICP International Journal of Computer Vision © 2021 The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Layered Scene Decomposition
Scene Completion
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Layered Scene Decomposition
Scene Completion
Zheng, Chuanxia
Dao, Duy-Son
Song, Guoxian
Cham, Tat-Jen
Cai, Jianfei
Visiting the Invisible: layer-by-layer completed scene decomposition
description Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for the invisible regions, but requires a manual mask as input. In this work, we propose a higher-level scene understanding system to tackle both visible and invisible parts of objects and backgrounds in a given scene. Particularly, we built a system to decompose a scene into individual objects, infer their underlying occlusion relationships, and even automatically learn which parts of the objects are occluded that need to be completed. In order to disentangle the occluded relationships of all objects in a complex scene, we use the fact that the front object without being occluded is easy to be identified, detected, and segmented. Our system interleaves the two tasks of instance segmentation and scene completion through multiple iterations, solving for objects layer-by-layer. We first provide a thorough experiment using a new realistically rendered dataset with ground-truths for all invisible regions. To bridge the domain gap to real imagery where ground-truths are unavailable, we then train another model with the pseudo-ground-truths generated from our trained synthesis model. We demonstrate results on a wide variety of datasets and show significant improvement over the state-of-the-art.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Zheng, Chuanxia
Dao, Duy-Son
Song, Guoxian
Cham, Tat-Jen
Cai, Jianfei
format Article
author Zheng, Chuanxia
Dao, Duy-Son
Song, Guoxian
Cham, Tat-Jen
Cai, Jianfei
author_sort Zheng, Chuanxia
title Visiting the Invisible: layer-by-layer completed scene decomposition
title_short Visiting the Invisible: layer-by-layer completed scene decomposition
title_full Visiting the Invisible: layer-by-layer completed scene decomposition
title_fullStr Visiting the Invisible: layer-by-layer completed scene decomposition
title_full_unstemmed Visiting the Invisible: layer-by-layer completed scene decomposition
title_sort visiting the invisible: layer-by-layer completed scene decomposition
publishDate 2023
url https://hdl.handle.net/10356/172650
_version_ 1787136690410749952