Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling

The state-of-the-art photo upsampling method, PULSE, demonstrates that a sharp, high-resolution (HR) version of a given low-resolution (LR) input can be obtained by exploring the latent space of generative models. However, mapping an extreme LR input (16(2)) directly to an HR image (1024(2)) is too...

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHOU, Yang, XU, Yangyang, DU, Yong, WEN, Qiang, HE, Shengfeng
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2022
Subjects:	Photo upsampling GANs progressive learning latent space Information Security
Online Access:	https://ink.library.smu.edu.sg/sis_research/7877
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-8880
record_format	dspace
spelling	sg-smu-ink.sis_research-88802023-06-15T09:00:05Z Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling ZHOU, Yang XU, Yangyang DU, Yong WEN, Qiang HE, Shengfeng The state-of-the-art photo upsampling method, PULSE, demonstrates that a sharp, high-resolution (HR) version of a given low-resolution (LR) input can be obtained by exploring the latent space of generative models. However, mapping an extreme LR input (16(2)) directly to an HR image (1024(2)) is too ambiguous to preserve faithful local facial semantics. In this paper, we propose an enhanced upsampling approach, Pro-PULSE, that addresses the issues of semantic inconsistency and optimization complexity. Our idea is to learn an encoder that progressively constructs the HR latent codes in the extended W+ latent space of StyleGAN. This design divides the complex 64x upsampling problem into several steps, and therefore small-scale facial semantics can be inherited from one end to the other. In particular, we train two encoders, the base encoder maps latent vectors in W space and serves as a foundation of the HR latent vector, while the second scale-specific encoder performed in W+ space gradually replaces the previous vector produced by the base encoder at each scale. This process produces intermediate side-outputs, which injects deep supervision into the training of encoder. Extensive experiments demonstrate superiorities over the latest latent space exploration methods, in terms of efficiency, quantitative quality metrics, and qualitative visual results. 2022-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/7877 info:doi/10.1109/TIP.2022.3140603 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Photo upsampling GANs progressive learning latent space Information Security
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Photo upsampling GANs progressive learning latent space Information Security
spellingShingle	Photo upsampling GANs progressive learning latent space Information Security ZHOU, Yang XU, Yangyang DU, Yong WEN, Qiang HE, Shengfeng Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling
description	The state-of-the-art photo upsampling method, PULSE, demonstrates that a sharp, high-resolution (HR) version of a given low-resolution (LR) input can be obtained by exploring the latent space of generative models. However, mapping an extreme LR input (16(2)) directly to an HR image (1024(2)) is too ambiguous to preserve faithful local facial semantics. In this paper, we propose an enhanced upsampling approach, Pro-PULSE, that addresses the issues of semantic inconsistency and optimization complexity. Our idea is to learn an encoder that progressively constructs the HR latent codes in the extended W+ latent space of StyleGAN. This design divides the complex 64x upsampling problem into several steps, and therefore small-scale facial semantics can be inherited from one end to the other. In particular, we train two encoders, the base encoder maps latent vectors in W space and serves as a foundation of the HR latent vector, while the second scale-specific encoder performed in W+ space gradually replaces the previous vector produced by the base encoder at each scale. This process produces intermediate side-outputs, which injects deep supervision into the training of encoder. Extensive experiments demonstrate superiorities over the latest latent space exploration methods, in terms of efficiency, quantitative quality metrics, and qualitative visual results.
format	text
author	ZHOU, Yang XU, Yangyang DU, Yong WEN, Qiang HE, Shengfeng
author_facet	ZHOU, Yang XU, Yangyang DU, Yong WEN, Qiang HE, Shengfeng
author_sort	ZHOU, Yang
title	Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling
title_short	Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling
title_full	Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling
title_fullStr	Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling
title_full_unstemmed	Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling
title_sort	pro-pulse: learning progressive encoders of latent semantics in gans for photo upsampling
publisher	Institutional Knowledge at Singapore Management University
publishDate	2022
url	https://ink.library.smu.edu.sg/sis_research/7877
_version_	1770576574562697216

Pro-pulse: Learning progressive encoders of latent semantics in gans for photo upsampling

Similar Items