GP-UNIT: generative prior for versatile unsupervised image-to-image translation

Recent advances in deep learning have witnessed many successful unsupervised image-to-image translation models that learn correspondences between two visual domains without paired data. However, it is still a great challenge to build robust mappings between various domains especially for those with...

Full description

Saved in:
Bibliographic Details
Main Authors: Yang, Shuai, Jiang, Liming, Liu, Ziwei, Loy, Chen Change
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/171788
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-171788
record_format dspace
spelling sg-ntu-dr.10356-1717882023-11-08T01:51:49Z GP-UNIT: generative prior for versatile unsupervised image-to-image translation Yang, Shuai Jiang, Liming Liu, Ziwei Loy, Chen Change School of Computer Science and Engineering S-Lab Engineering::Computer science and engineering Multi-level Correspondence Prior Distillation Recent advances in deep learning have witnessed many successful unsupervised image-to-image translation models that learn correspondences between two visual domains without paired data. However, it is still a great challenge to build robust mappings between various domains especially for those with drastic visual discrepancies. In this paper, we introduce a novel versatile framework, Generative Prior-guided UNsupervised Image-to-image Translation (GP-UNIT), that improves the quality, applicability and controllability of the existing translation models. The key idea of GP-UNIT is to distill the generative prior from pre-trained class-conditional GANs to build coarse-level cross-domain correspondences, and to apply the learned prior to adversarial translations to excavate fine-level correspondences. With the learned multi-level content correspondences, GP-UNIT is able to perform valid translations between both close domains and distant domains. For close domains, GP-UNIT can be conditioned on a parameter to determine the intensity of the content correspondences during translation, allowing users to balance between content and style consistency. For distant domains, semi-supervised learning is explored to guide GP-UNIT to discover accurate semantic correspondences that are hard to learn solely from the appearance. We validate the superiority of GP-UNIT over state-of-the-art translation models in robust, high-quality and diversified translations between various domains through extensive experiments. Agency for Science, Technology and Research (A*STAR) Ministry of Education (MOE) Nanyang Technological University This work was supported in part by the RIE2020 Industry Alignment Fund - Industry Collaboration Projects under Grant IAF-ICP Funding Initiative, as well as cash and in-kind contribution from the industry partner(s) and in part by MOE AcRF Tier 2 under Grants T2EP20221-0011 and T2EP20221-0012 and NTU NAP grant. 2023-11-08T01:51:49Z 2023-11-08T01:51:49Z 2023 Journal Article Yang, S., Jiang, L., Liu, Z. & Loy, C. C. (2023). GP-UNIT: generative prior for versatile unsupervised image-to-image translation. IEEE Transactions On Pattern Analysis and Machine Intelligence, 45(10), 11869-11883. https://dx.doi.org/10.1109/TPAMI.2023.3284003 0162-8828 https://hdl.handle.net/10356/171788 10.1109/TPAMI.2023.3284003 37289604 2-s2.0-85162652296 10 45 11869 11883 en T2EP20221-0011 T2EP20221-0012 IEEE Transactions on Pattern Analysis and Machine Intelligence © 2023 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Multi-level Correspondence
Prior Distillation
spellingShingle Engineering::Computer science and engineering
Multi-level Correspondence
Prior Distillation
Yang, Shuai
Jiang, Liming
Liu, Ziwei
Loy, Chen Change
GP-UNIT: generative prior for versatile unsupervised image-to-image translation
description Recent advances in deep learning have witnessed many successful unsupervised image-to-image translation models that learn correspondences between two visual domains without paired data. However, it is still a great challenge to build robust mappings between various domains especially for those with drastic visual discrepancies. In this paper, we introduce a novel versatile framework, Generative Prior-guided UNsupervised Image-to-image Translation (GP-UNIT), that improves the quality, applicability and controllability of the existing translation models. The key idea of GP-UNIT is to distill the generative prior from pre-trained class-conditional GANs to build coarse-level cross-domain correspondences, and to apply the learned prior to adversarial translations to excavate fine-level correspondences. With the learned multi-level content correspondences, GP-UNIT is able to perform valid translations between both close domains and distant domains. For close domains, GP-UNIT can be conditioned on a parameter to determine the intensity of the content correspondences during translation, allowing users to balance between content and style consistency. For distant domains, semi-supervised learning is explored to guide GP-UNIT to discover accurate semantic correspondences that are hard to learn solely from the appearance. We validate the superiority of GP-UNIT over state-of-the-art translation models in robust, high-quality and diversified translations between various domains through extensive experiments.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Yang, Shuai
Jiang, Liming
Liu, Ziwei
Loy, Chen Change
format Article
author Yang, Shuai
Jiang, Liming
Liu, Ziwei
Loy, Chen Change
author_sort Yang, Shuai
title GP-UNIT: generative prior for versatile unsupervised image-to-image translation
title_short GP-UNIT: generative prior for versatile unsupervised image-to-image translation
title_full GP-UNIT: generative prior for versatile unsupervised image-to-image translation
title_fullStr GP-UNIT: generative prior for versatile unsupervised image-to-image translation
title_full_unstemmed GP-UNIT: generative prior for versatile unsupervised image-to-image translation
title_sort gp-unit: generative prior for versatile unsupervised image-to-image translation
publishDate 2023
url https://hdl.handle.net/10356/171788
_version_ 1783955612620554240