Panoramic image outpainting
Currently, Latent Diffusion Models (LDMs) are very adept at generating completely novel images. However, they tend to be lacking in generating images following specific conditions. This final year research project addresses the challenge of enhancing image generation using LDMs by incorporating cond...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/174917 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-174917 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1749172024-04-19T15:45:39Z Panoramic image outpainting Teo, Sydney Wen Xuen Cham Tat Jen School of Computer Science and Engineering ASTJCham@ntu.edu.sg Computer and Information Science Generative AI Panoramic image outpainting Diffusion models Currently, Latent Diffusion Models (LDMs) are very adept at generating completely novel images. However, they tend to be lacking in generating images following specific conditions. This final year research project addresses the challenge of enhancing image generation using LDMs by incorporating conditional control. The purpose of the project is to explore the potential of conditional LDMs in facilitating light editing on images. This allows users to create realistic modifications without deep technical knowledge of the underlying processes. There is an increasingly large artistic community growing around generative AI, primarily developed with text prompts. However, there is a gap in light editing capabilities, hence expanding in this area can provide additional creative options. The project schedule breaks down the project into planning, researching, development, testing and evaluation stages. The method involved preparation of indoor dataset and light mask dataset, research of LDM functionality, and exploration of techniques for integrating conditioning mechanisms into LDMs. We developed a conditional LDM utilizing concatenation mechanism and binary light mask dataset which is able to produce high fidelity panoramic image outpainting. Thus, the results of the project demonstrate the feasibility and effectiveness of utilizing conditional LDMs for light editing tasks. Recommendations include the exploration of dynamic light masks dataset and the development of an intuitive user interface. Bachelor's degree 2024-04-16T04:44:23Z 2024-04-16T04:44:23Z 2024 Final Year Project (FYP) Teo, S. W. X. (2024). Panoramic image outpainting. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/174917 https://hdl.handle.net/10356/174917 en SCSE23-0032 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Generative AI Panoramic image outpainting Diffusion models |
spellingShingle |
Computer and Information Science Generative AI Panoramic image outpainting Diffusion models Teo, Sydney Wen Xuen Panoramic image outpainting |
description |
Currently, Latent Diffusion Models (LDMs) are very adept at generating completely novel images. However, they tend to be lacking in generating images following specific conditions. This final year research project addresses the challenge of enhancing image generation using LDMs by incorporating conditional control.
The purpose of the project is to explore the potential of conditional LDMs in facilitating light editing on images. This allows users to create realistic modifications without deep technical knowledge of the underlying processes. There is an increasingly large artistic community growing around generative AI, primarily developed with text prompts. However, there is a gap in light editing capabilities, hence expanding in this area can provide additional creative options.
The project schedule breaks down the project into planning, researching, development, testing and evaluation stages. The method involved preparation of indoor dataset and light mask dataset, research of LDM functionality, and exploration of techniques for integrating conditioning mechanisms into LDMs. We developed a conditional LDM utilizing concatenation mechanism and binary light mask dataset which is able to produce high fidelity panoramic image outpainting. Thus, the results of the project demonstrate the feasibility and effectiveness of utilizing conditional LDMs for light editing tasks. Recommendations include the exploration of dynamic light masks dataset and the development of an intuitive user interface. |
author2 |
Cham Tat Jen |
author_facet |
Cham Tat Jen Teo, Sydney Wen Xuen |
format |
Final Year Project |
author |
Teo, Sydney Wen Xuen |
author_sort |
Teo, Sydney Wen Xuen |
title |
Panoramic image outpainting |
title_short |
Panoramic image outpainting |
title_full |
Panoramic image outpainting |
title_fullStr |
Panoramic image outpainting |
title_full_unstemmed |
Panoramic image outpainting |
title_sort |
panoramic image outpainting |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/174917 |
_version_ |
1806059895542251520 |