GP-UNIT: generative prior for versatile unsupervised image-to-image translation
Recent advances in deep learning have witnessed many successful unsupervised image-to-image translation models that learn correspondences between two visual domains without paired data. However, it is still a great challenge to build robust mappings between various domains especially for those with...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/171788 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-171788 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1717882023-11-08T01:51:49Z GP-UNIT: generative prior for versatile unsupervised image-to-image translation Yang, Shuai Jiang, Liming Liu, Ziwei Loy, Chen Change School of Computer Science and Engineering S-Lab Engineering::Computer science and engineering Multi-level Correspondence Prior Distillation Recent advances in deep learning have witnessed many successful unsupervised image-to-image translation models that learn correspondences between two visual domains without paired data. However, it is still a great challenge to build robust mappings between various domains especially for those with drastic visual discrepancies. In this paper, we introduce a novel versatile framework, Generative Prior-guided UNsupervised Image-to-image Translation (GP-UNIT), that improves the quality, applicability and controllability of the existing translation models. The key idea of GP-UNIT is to distill the generative prior from pre-trained class-conditional GANs to build coarse-level cross-domain correspondences, and to apply the learned prior to adversarial translations to excavate fine-level correspondences. With the learned multi-level content correspondences, GP-UNIT is able to perform valid translations between both close domains and distant domains. For close domains, GP-UNIT can be conditioned on a parameter to determine the intensity of the content correspondences during translation, allowing users to balance between content and style consistency. For distant domains, semi-supervised learning is explored to guide GP-UNIT to discover accurate semantic correspondences that are hard to learn solely from the appearance. We validate the superiority of GP-UNIT over state-of-the-art translation models in robust, high-quality and diversified translations between various domains through extensive experiments. Agency for Science, Technology and Research (A*STAR) Ministry of Education (MOE) Nanyang Technological University This work was supported in part by the RIE2020 Industry Alignment Fund - Industry Collaboration Projects under Grant IAF-ICP Funding Initiative, as well as cash and in-kind contribution from the industry partner(s) and in part by MOE AcRF Tier 2 under Grants T2EP20221-0011 and T2EP20221-0012 and NTU NAP grant. 2023-11-08T01:51:49Z 2023-11-08T01:51:49Z 2023 Journal Article Yang, S., Jiang, L., Liu, Z. & Loy, C. C. (2023). GP-UNIT: generative prior for versatile unsupervised image-to-image translation. IEEE Transactions On Pattern Analysis and Machine Intelligence, 45(10), 11869-11883. https://dx.doi.org/10.1109/TPAMI.2023.3284003 0162-8828 https://hdl.handle.net/10356/171788 10.1109/TPAMI.2023.3284003 37289604 2-s2.0-85162652296 10 45 11869 11883 en T2EP20221-0011 T2EP20221-0012 IEEE Transactions on Pattern Analysis and Machine Intelligence © 2023 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Multi-level Correspondence Prior Distillation |
spellingShingle |
Engineering::Computer science and engineering Multi-level Correspondence Prior Distillation Yang, Shuai Jiang, Liming Liu, Ziwei Loy, Chen Change GP-UNIT: generative prior for versatile unsupervised image-to-image translation |
description |
Recent advances in deep learning have witnessed many successful unsupervised image-to-image translation models that learn correspondences between two visual domains without paired data. However, it is still a great challenge to build robust mappings between various domains especially for those with drastic visual discrepancies. In this paper, we introduce a novel versatile framework, Generative Prior-guided UNsupervised Image-to-image Translation (GP-UNIT), that improves the quality, applicability and controllability of the existing translation models. The key idea of GP-UNIT is to distill the generative prior from pre-trained class-conditional GANs to build coarse-level cross-domain correspondences, and to apply the learned prior to adversarial translations to excavate fine-level correspondences. With the learned multi-level content correspondences, GP-UNIT is able to perform valid translations between both close domains and distant domains. For close domains, GP-UNIT can be conditioned on a parameter to determine the intensity of the content correspondences during translation, allowing users to balance between content and style consistency. For distant domains, semi-supervised learning is explored to guide GP-UNIT to discover accurate semantic correspondences that are hard to learn solely from the appearance. We validate the superiority of GP-UNIT over state-of-the-art translation models in robust, high-quality and diversified translations between various domains through extensive experiments. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Yang, Shuai Jiang, Liming Liu, Ziwei Loy, Chen Change |
format |
Article |
author |
Yang, Shuai Jiang, Liming Liu, Ziwei Loy, Chen Change |
author_sort |
Yang, Shuai |
title |
GP-UNIT: generative prior for versatile unsupervised image-to-image translation |
title_short |
GP-UNIT: generative prior for versatile unsupervised image-to-image translation |
title_full |
GP-UNIT: generative prior for versatile unsupervised image-to-image translation |
title_fullStr |
GP-UNIT: generative prior for versatile unsupervised image-to-image translation |
title_full_unstemmed |
GP-UNIT: generative prior for versatile unsupervised image-to-image translation |
title_sort |
gp-unit: generative prior for versatile unsupervised image-to-image translation |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/171788 |
_version_ |
1783955612620554240 |