Visual recognition using artificial intelligence (image inpainting with transformers)

As a major task in Computer Vision area, image inpainting is the process of filling in the missing part of an image. The traditional methods for image inpainting always struggle to address complex or large missing parts. In the last decade, the deep learning methods have made significant progress in...

Full description

Saved in:
Bibliographic Details
Main Author: Dou, Yuxiao
Other Authors: Yap Kim Hui
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/167299
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-167299
record_format dspace
spelling sg-ntu-dr.10356-1672992023-07-07T15:47:11Z Visual recognition using artificial intelligence (image inpainting with transformers) Dou, Yuxiao Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Engineering::Electrical and electronic engineering As a major task in Computer Vision area, image inpainting is the process of filling in the missing part of an image. The traditional methods for image inpainting always struggle to address complex or large missing parts. In the last decade, the deep learning methods have made significant progress in this area. In this project, the potential of transformers in image inpainting is explored. Transformers have already demonstrated their outstanding global structure understanding ability in NLP, which could be quite useful in image inpainting as well. However, transformers’ computational inefficiency would be magnified when dealing with image data type. To overcome this weakness, a method combing both transformers and CNNs are explored and researched on. We achieve high performance in the Places2 and FFHQ datasets. Since the FFHQ dataset contains limited number of Asian-face images, an Asian-face dataset AFD-dataset was used to extend the application of the proposed method as well. To conclude, this project helps to further explore the possibility of transformers in image inpainting area and provides some useful data and information for the future research. Bachelor of Engineering (Electrical and Electronic Engineering) 2023-05-25T07:59:37Z 2023-05-25T07:59:37Z 2023 Final Year Project (FYP) Dou, Y. (2023). Visual recognition using artificial intelligence (image inpainting with transformers). Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167299 https://hdl.handle.net/10356/167299 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Dou, Yuxiao
Visual recognition using artificial intelligence (image inpainting with transformers)
description As a major task in Computer Vision area, image inpainting is the process of filling in the missing part of an image. The traditional methods for image inpainting always struggle to address complex or large missing parts. In the last decade, the deep learning methods have made significant progress in this area. In this project, the potential of transformers in image inpainting is explored. Transformers have already demonstrated their outstanding global structure understanding ability in NLP, which could be quite useful in image inpainting as well. However, transformers’ computational inefficiency would be magnified when dealing with image data type. To overcome this weakness, a method combing both transformers and CNNs are explored and researched on. We achieve high performance in the Places2 and FFHQ datasets. Since the FFHQ dataset contains limited number of Asian-face images, an Asian-face dataset AFD-dataset was used to extend the application of the proposed method as well. To conclude, this project helps to further explore the possibility of transformers in image inpainting area and provides some useful data and information for the future research.
author2 Yap Kim Hui
author_facet Yap Kim Hui
Dou, Yuxiao
format Final Year Project
author Dou, Yuxiao
author_sort Dou, Yuxiao
title Visual recognition using artificial intelligence (image inpainting with transformers)
title_short Visual recognition using artificial intelligence (image inpainting with transformers)
title_full Visual recognition using artificial intelligence (image inpainting with transformers)
title_fullStr Visual recognition using artificial intelligence (image inpainting with transformers)
title_full_unstemmed Visual recognition using artificial intelligence (image inpainting with transformers)
title_sort visual recognition using artificial intelligence (image inpainting with transformers)
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/167299
_version_ 1772827248543399936