Image-to-image translation based on generative models

Image-to-image translation tasks have become a widely studied topic in computer vision. Image-to-image translation aims at finding a model that is fed with the input image and generating desired output image correspondingly. Previous studies that are based on deep neural networks were mostly built u...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Tang, Mengxiao
مؤلفون آخرون: Ponnuthurai Nagaratnam Suganthan
التنسيق: Thesis-Master by Coursework
اللغة:English
منشور في: Nanyang Technological University 2022
الموضوعات:
الوصول للمادة أونلاين:https://hdl.handle.net/10356/154672
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
الوصف
الملخص:Image-to-image translation tasks have become a widely studied topic in computer vision. Image-to-image translation aims at finding a model that is fed with the input image and generating desired output image correspondingly. Previous studies that are based on deep neural networks were mostly built upon encoder-decoder architecture, where a direct mapping from input to target output is learned, without exploring the distribution of images. In this thesis, generative models are used to capture the distribution of images, and the potentials of generative models on the image-to-image translation tasks are explored. Specifically, an improved CycleGAN is proposed to conduct the style transfer task and a DDPM-based conditional generative model is used for image colorization. Empirical results show that the generative models can achieve competitive results in image-to-image translation tasks.