Visiting the Invisible: layer-by-layer completed scene decomposition
Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for the invisible regions, but requires a manual mask as input....
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/172650 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-172650 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1726502023-12-19T02:15:46Z Visiting the Invisible: layer-by-layer completed scene decomposition Zheng, Chuanxia Dao, Duy-Son Song, Guoxian Cham, Tat-Jen Cai, Jianfei School of Computer Science and Engineering Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Layered Scene Decomposition Scene Completion Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for the invisible regions, but requires a manual mask as input. In this work, we propose a higher-level scene understanding system to tackle both visible and invisible parts of objects and backgrounds in a given scene. Particularly, we built a system to decompose a scene into individual objects, infer their underlying occlusion relationships, and even automatically learn which parts of the objects are occluded that need to be completed. In order to disentangle the occluded relationships of all objects in a complex scene, we use the fact that the front object without being occluded is easy to be identified, detected, and segmented. Our system interleaves the two tasks of instance segmentation and scene completion through multiple iterations, solving for objects layer-by-layer. We first provide a thorough experiment using a new realistically rendered dataset with ground-truths for all invisible regions. To bridge the domain gap to real imagery where ground-truths are unavailable, we then train another model with the pseudo-ground-truths generated from our trained synthesis model. We demonstrate results on a wide variety of datasets and show significant improvement over the state-of-the-art. This study is supported under the RIE2020 Industry Alignment Fund – Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from Singapore Telecommunications Limited (Singtel), through Singtel Cognitive and Artificial Intelligence Lab for Enterprises (SCALE@NTU). This research is also supported by the Monash FIT Start-up Grant. 2023-12-19T02:15:45Z 2023-12-19T02:15:45Z 2021 Journal Article Zheng, C., Dao, D., Song, G., Cham, T. & Cai, J. (2021). Visiting the Invisible: layer-by-layer completed scene decomposition. International Journal of Computer Vision, 129(12), 3195-3215. https://dx.doi.org/10.1007/s11263-021-01517-0 0920-5691 https://hdl.handle.net/10356/172650 10.1007/s11263-021-01517-0 2-s2.0-85115825345 12 129 3195 3215 en IAF-ICP International Journal of Computer Vision © 2021 The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Layered Scene Decomposition Scene Completion |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Layered Scene Decomposition Scene Completion Zheng, Chuanxia Dao, Duy-Son Song, Guoxian Cham, Tat-Jen Cai, Jianfei Visiting the Invisible: layer-by-layer completed scene decomposition |
description |
Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for the invisible regions, but requires a manual mask as input. In this work, we propose a higher-level scene understanding system to tackle both visible and invisible parts of objects and backgrounds in a given scene. Particularly, we built a system to decompose a scene into individual objects, infer their underlying occlusion relationships, and even automatically learn which parts of the objects are occluded that need to be completed. In order to disentangle the occluded relationships of all objects in a complex scene, we use the fact that the front object without being occluded is easy to be identified, detected, and segmented. Our system interleaves the two tasks of instance segmentation and scene completion through multiple iterations, solving for objects layer-by-layer. We first provide a thorough experiment using a new realistically rendered dataset with ground-truths for all invisible regions. To bridge the domain gap to real imagery where ground-truths are unavailable, we then train another model with the pseudo-ground-truths generated from our trained synthesis model. We demonstrate results on a wide variety of datasets and show significant improvement over the state-of-the-art. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Zheng, Chuanxia Dao, Duy-Son Song, Guoxian Cham, Tat-Jen Cai, Jianfei |
format |
Article |
author |
Zheng, Chuanxia Dao, Duy-Son Song, Guoxian Cham, Tat-Jen Cai, Jianfei |
author_sort |
Zheng, Chuanxia |
title |
Visiting the Invisible: layer-by-layer completed scene decomposition |
title_short |
Visiting the Invisible: layer-by-layer completed scene decomposition |
title_full |
Visiting the Invisible: layer-by-layer completed scene decomposition |
title_fullStr |
Visiting the Invisible: layer-by-layer completed scene decomposition |
title_full_unstemmed |
Visiting the Invisible: layer-by-layer completed scene decomposition |
title_sort |
visiting the invisible: layer-by-layer completed scene decomposition |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/172650 |
_version_ |
1787136690410749952 |