Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling
The state-of-the-art photo upsampling method, PULSE, demonstrates that a sharp, high-resolution (HR) version of a given low-resolution (LR) input can be obtained by exploring the latent space of generative models. However, mapping an extreme LR input (16(2)) directly to an HR image (1024(2)) is too...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7877 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8880 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-88802023-06-15T09:00:05Z Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling ZHOU, Yang XU, Yangyang DU, Yong WEN, Qiang HE, Shengfeng The state-of-the-art photo upsampling method, PULSE, demonstrates that a sharp, high-resolution (HR) version of a given low-resolution (LR) input can be obtained by exploring the latent space of generative models. However, mapping an extreme LR input (16(2)) directly to an HR image (1024(2)) is too ambiguous to preserve faithful local facial semantics. In this paper, we propose an enhanced upsampling approach, Pro-PULSE, that addresses the issues of semantic inconsistency and optimization complexity. Our idea is to learn an encoder that progressively constructs the HR latent codes in the extended W+ latent space of StyleGAN. This design divides the complex 64x upsampling problem into several steps, and therefore small-scale facial semantics can be inherited from one end to the other. In particular, we train two encoders, the base encoder maps latent vectors in W space and serves as a foundation of the HR latent vector, while the second scale-specific encoder performed in W+ space gradually replaces the previous vector produced by the base encoder at each scale. This process produces intermediate side-outputs, which injects deep supervision into the training of encoder. Extensive experiments demonstrate superiorities over the latest latent space exploration methods, in terms of efficiency, quantitative quality metrics, and qualitative visual results. 2022-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/7877 info:doi/10.1109/TIP.2022.3140603 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Photo upsampling GANs progressive learning latent space Information Security |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Photo upsampling GANs progressive learning latent space Information Security |
spellingShingle |
Photo upsampling GANs progressive learning latent space Information Security ZHOU, Yang XU, Yangyang DU, Yong WEN, Qiang HE, Shengfeng Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling |
description |
The state-of-the-art photo upsampling method, PULSE, demonstrates that a sharp, high-resolution (HR) version of a given low-resolution (LR) input can be obtained by exploring the latent space of generative models. However, mapping an extreme LR input (16(2)) directly to an HR image (1024(2)) is too ambiguous to preserve faithful local facial semantics. In this paper, we propose an enhanced upsampling approach, Pro-PULSE, that addresses the issues of semantic inconsistency and optimization complexity. Our idea is to learn an encoder that progressively constructs the HR latent codes in the extended W+ latent space of StyleGAN. This design divides the complex 64x upsampling problem into several steps, and therefore small-scale facial semantics can be inherited from one end to the other. In particular, we train two encoders, the base encoder maps latent vectors in W space and serves as a foundation of the HR latent vector, while the second scale-specific encoder performed in W+ space gradually replaces the previous vector produced by the base encoder at each scale. This process produces intermediate side-outputs, which injects deep supervision into the training of encoder. Extensive experiments demonstrate superiorities over the latest latent space exploration methods, in terms of efficiency, quantitative quality metrics, and qualitative visual results. |
format |
text |
author |
ZHOU, Yang XU, Yangyang DU, Yong WEN, Qiang HE, Shengfeng |
author_facet |
ZHOU, Yang XU, Yangyang DU, Yong WEN, Qiang HE, Shengfeng |
author_sort |
ZHOU, Yang |
title |
Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling |
title_short |
Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling |
title_full |
Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling |
title_fullStr |
Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling |
title_full_unstemmed |
Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling |
title_sort |
pro-pulse: learning progressive encoders of latent semantics in gans for photo upsampling |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/7877 |
_version_ |
1770576574562697216 |