Panoramic image outpainting

Currently, Latent Diffusion Models (LDMs) are very adept at generating completely novel images. However, they tend to be lacking in generating images following specific conditions. This final year research project addresses the challenge of enhancing image generation using LDMs by incorporating cond...

Full description

Saved in:
Bibliographic Details
Main Author: Teo, Sydney Wen Xuen
Other Authors: Cham Tat Jen
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/174917
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-174917
record_format dspace
spelling sg-ntu-dr.10356-1749172024-04-19T15:45:39Z Panoramic image outpainting Teo, Sydney Wen Xuen Cham Tat Jen School of Computer Science and Engineering ASTJCham@ntu.edu.sg Computer and Information Science Generative AI Panoramic image outpainting Diffusion models Currently, Latent Diffusion Models (LDMs) are very adept at generating completely novel images. However, they tend to be lacking in generating images following specific conditions. This final year research project addresses the challenge of enhancing image generation using LDMs by incorporating conditional control. The purpose of the project is to explore the potential of conditional LDMs in facilitating light editing on images. This allows users to create realistic modifications without deep technical knowledge of the underlying processes. There is an increasingly large artistic community growing around generative AI, primarily developed with text prompts. However, there is a gap in light editing capabilities, hence expanding in this area can provide additional creative options. The project schedule breaks down the project into planning, researching, development, testing and evaluation stages. The method involved preparation of indoor dataset and light mask dataset, research of LDM functionality, and exploration of techniques for integrating conditioning mechanisms into LDMs. We developed a conditional LDM utilizing concatenation mechanism and binary light mask dataset which is able to produce high fidelity panoramic image outpainting. Thus, the results of the project demonstrate the feasibility and effectiveness of utilizing conditional LDMs for light editing tasks. Recommendations include the exploration of dynamic light masks dataset and the development of an intuitive user interface. Bachelor's degree 2024-04-16T04:44:23Z 2024-04-16T04:44:23Z 2024 Final Year Project (FYP) Teo, S. W. X. (2024). Panoramic image outpainting. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/174917 https://hdl.handle.net/10356/174917 en SCSE23-0032 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Generative AI
Panoramic image outpainting
Diffusion models
spellingShingle Computer and Information Science
Generative AI
Panoramic image outpainting
Diffusion models
Teo, Sydney Wen Xuen
Panoramic image outpainting
description Currently, Latent Diffusion Models (LDMs) are very adept at generating completely novel images. However, they tend to be lacking in generating images following specific conditions. This final year research project addresses the challenge of enhancing image generation using LDMs by incorporating conditional control. The purpose of the project is to explore the potential of conditional LDMs in facilitating light editing on images. This allows users to create realistic modifications without deep technical knowledge of the underlying processes. There is an increasingly large artistic community growing around generative AI, primarily developed with text prompts. However, there is a gap in light editing capabilities, hence expanding in this area can provide additional creative options. The project schedule breaks down the project into planning, researching, development, testing and evaluation stages. The method involved preparation of indoor dataset and light mask dataset, research of LDM functionality, and exploration of techniques for integrating conditioning mechanisms into LDMs. We developed a conditional LDM utilizing concatenation mechanism and binary light mask dataset which is able to produce high fidelity panoramic image outpainting. Thus, the results of the project demonstrate the feasibility and effectiveness of utilizing conditional LDMs for light editing tasks. Recommendations include the exploration of dynamic light masks dataset and the development of an intuitive user interface.
author2 Cham Tat Jen
author_facet Cham Tat Jen
Teo, Sydney Wen Xuen
format Final Year Project
author Teo, Sydney Wen Xuen
author_sort Teo, Sydney Wen Xuen
title Panoramic image outpainting
title_short Panoramic image outpainting
title_full Panoramic image outpainting
title_fullStr Panoramic image outpainting
title_full_unstemmed Panoramic image outpainting
title_sort panoramic image outpainting
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/174917
_version_ 1806059895542251520