Reference-based image and video super-resolution via C²-matching

Reference-based Super-Resolution (Ref-SR) has recently emerged as a promising paradigm to enhance a low-resolution (LR) input image or video by introducing an additional high-resolution (HR) reference image. Existing Ref-SR methods mostly rely on implicit correspondence matching to borrow HR texture...

Full description

Saved in:
Bibliographic Details
Main Authors: Jiang, Yuming, Chan, Kelvin C. K., Wang, Xintao, Loy, Chen Change, Liu, Ziwei
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/172175
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-172175
record_format dspace
spelling sg-ntu-dr.10356-1721752023-11-28T05:19:31Z Reference-based image and video super-resolution via C²-matching Jiang, Yuming Chan, Kelvin C. K. Wang, Xintao Loy, Chen Change Liu, Ziwei School of Computer Science and Engineering S-Lab Engineering::Computer science and engineering Image Super-Resolution Reference-Based Super-Resolution Reference-based Super-Resolution (Ref-SR) has recently emerged as a promising paradigm to enhance a low-resolution (LR) input image or video by introducing an additional high-resolution (HR) reference image. Existing Ref-SR methods mostly rely on implicit correspondence matching to borrow HR textures from reference images to compensate for the information loss in input images. However, performing local transfer is difficult because of two gaps between input and reference images: the transformation gap (e.g., scale and rotation) and the resolution gap (e.g., HR and LR). To tackle these challenges, we propose C2-Matching in this work, which performs explicit robust matching crossing transformation and resolution. 1) To bridge the transformation gap, we propose a contrastive correspondence network, which learns transformation-robust correspondences using augmented views of the input image. 2) To address the resolution gap, we adopt teacher-student correlation distillation, which distills knowledge from the easier HR-HR matching to guide the more ambiguous LR-HR matching. 3) Finally, we design a dynamic aggregation module to address the potential misalignment issue between input images and reference images. In addition, to faithfully evaluate the performance of Reference-based Image Super-Resolution (Ref Image SR) under a realistic setting, we contribute the Webly-Referenced SR (WR-SR) dataset, mimicking the practical usage scenario. We also extend C2-Matching to Reference-based Video Super-Resolution (Ref VSR) task, where an image taken in a similar scene serves as the HR reference image. Extensive experiments demonstrate that our proposed C2-Matching significantly outperforms state of the arts by up to 0.7 dB on the standard CUFED5 benchmark and also boosts the performance of video super-resolution by incorporating the C2-Matching component into Video SR pipelines. Notably, C2-Matching also shows great generalizability on WR-SR dataset as well as robustness across large scale and rotation transformations. Codes and datasets are available at https://github.com/yumingj/C2-Matching. Ministry of Education (MOE) Nanyang Technological University This work was supported by in part by NTU NAP, MOE AcRF Tier 1 (2021- T1-001-088), and in part by the RIE2020 Industry Alignment Fund – Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from the industry partner(s). 2023-11-28T05:19:31Z 2023-11-28T05:19:31Z 2023 Journal Article Jiang, Y., Chan, K. C. K., Wang, X., Loy, C. C. & Liu, Z. (2023). Reference-based image and video super-resolution via C²-matching. IEEE Transactions On Pattern Analysis and Machine Intelligence, 45(7), 8874-8887. https://dx.doi.org/10.1109/TPAMI.2022.3231089 0162-8828 https://hdl.handle.net/10356/172175 10.1109/TPAMI.2022.3231089 37015431 2-s2.0-85146222873 7 45 8874 8887 en 2021- T1-001-088 IEEE Transactions on Pattern Analysis and Machine Intelligence © 2022 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Image Super-Resolution
Reference-Based Super-Resolution
spellingShingle Engineering::Computer science and engineering
Image Super-Resolution
Reference-Based Super-Resolution
Jiang, Yuming
Chan, Kelvin C. K.
Wang, Xintao
Loy, Chen Change
Liu, Ziwei
Reference-based image and video super-resolution via C²-matching
description Reference-based Super-Resolution (Ref-SR) has recently emerged as a promising paradigm to enhance a low-resolution (LR) input image or video by introducing an additional high-resolution (HR) reference image. Existing Ref-SR methods mostly rely on implicit correspondence matching to borrow HR textures from reference images to compensate for the information loss in input images. However, performing local transfer is difficult because of two gaps between input and reference images: the transformation gap (e.g., scale and rotation) and the resolution gap (e.g., HR and LR). To tackle these challenges, we propose C2-Matching in this work, which performs explicit robust matching crossing transformation and resolution. 1) To bridge the transformation gap, we propose a contrastive correspondence network, which learns transformation-robust correspondences using augmented views of the input image. 2) To address the resolution gap, we adopt teacher-student correlation distillation, which distills knowledge from the easier HR-HR matching to guide the more ambiguous LR-HR matching. 3) Finally, we design a dynamic aggregation module to address the potential misalignment issue between input images and reference images. In addition, to faithfully evaluate the performance of Reference-based Image Super-Resolution (Ref Image SR) under a realistic setting, we contribute the Webly-Referenced SR (WR-SR) dataset, mimicking the practical usage scenario. We also extend C2-Matching to Reference-based Video Super-Resolution (Ref VSR) task, where an image taken in a similar scene serves as the HR reference image. Extensive experiments demonstrate that our proposed C2-Matching significantly outperforms state of the arts by up to 0.7 dB on the standard CUFED5 benchmark and also boosts the performance of video super-resolution by incorporating the C2-Matching component into Video SR pipelines. Notably, C2-Matching also shows great generalizability on WR-SR dataset as well as robustness across large scale and rotation transformations. Codes and datasets are available at https://github.com/yumingj/C2-Matching.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Jiang, Yuming
Chan, Kelvin C. K.
Wang, Xintao
Loy, Chen Change
Liu, Ziwei
format Article
author Jiang, Yuming
Chan, Kelvin C. K.
Wang, Xintao
Loy, Chen Change
Liu, Ziwei
author_sort Jiang, Yuming
title Reference-based image and video super-resolution via C²-matching
title_short Reference-based image and video super-resolution via C²-matching
title_full Reference-based image and video super-resolution via C²-matching
title_fullStr Reference-based image and video super-resolution via C²-matching
title_full_unstemmed Reference-based image and video super-resolution via C²-matching
title_sort reference-based image and video super-resolution via c²-matching
publishDate 2023
url https://hdl.handle.net/10356/172175
_version_ 1783955579439415296