Exploring the effectiveness of video perceptual representation in blind video quality assessment

With the rapid growth of in-The-wild videos taken by non-specialists, blind video quality assessment (VQA) has become a challenging and demanding problem. Although lots of efforts have been made to solve this problem, it remains unclear how the human visual system (HVS) relates to the temporal quali...

Full description

Saved in:
Bibliographic Details
Main Authors: Liao, Liang, Xu, Kangmin, Wu, Haoning, Chen, Chaofeng, Sun, Wenxiu, Yan, Qiong, Lin, Weisi
Other Authors: College of Computing and Data Science
Format: Conference or Workshop Item
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/178457
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-178457
record_format dspace
spelling sg-ntu-dr.10356-1784572024-06-21T01:03:41Z Exploring the effectiveness of video perceptual representation in blind video quality assessment Liao, Liang Xu, Kangmin Wu, Haoning Chen, Chaofeng Sun, Wenxiu Yan, Qiong Lin, Weisi College of Computing and Data Science School of Computer Science and Engineering 30th ACM International Conference on Multimedia (MM '22) S-Lab Computer and Information Science Perceptual trajectories Primary visual cortex With the rapid growth of in-The-wild videos taken by non-specialists, blind video quality assessment (VQA) has become a challenging and demanding problem. Although lots of efforts have been made to solve this problem, it remains unclear how the human visual system (HVS) relates to the temporal quality of videos. Meanwhile, recent work has found that the frames of natural video transformed into the perceptual domain of the HVS tend to form a straight trajectory of the representations. With the obtained insight that distortion impairs the perceived video quality and results in a curved trajectory of the perceptual representation, we propose a temporal perceptual quality index (TPQI) to measure the temporal distortion by describing the graphic morphology of the representation. Specifically, we first extract the video perceptual representations from the lateral geniculate nucleus (LGN) and primary visual area (V1) of the HVS, and then measure the straightness and compactness of their trajectories to quantify the degradation in naturalness and content continuity of video. Experiments show that the perceptual representation in the HVS is an effective way of predicting subjective temporal quality, and thus TPQI can, for the first time, achieve comparable performance to the spatial quality metric and be even more effective in assessing videos with large temporal variations. We further demonstrate that by combining with NIQE, a spatial quality metric, TPQI can achieve top performance over popular in-The-wild video datasets. More importantly, TPQI does not require any additional information beyond the video being evaluated and thus can be applied to any datasets without parameter tuning. Source code is available at https://github.com/UoLMM/TPQI-VQA. This study is supported under the RIE2020 Industry Alignment Fund – Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from the industry partner(s). 2024-06-21T01:03:41Z 2024-06-21T01:03:41Z 2022 Conference Paper Liao, L., Xu, K., Wu, H., Chen, C., Sun, W., Yan, Q. & Lin, W. (2022). Exploring the effectiveness of video perceptual representation in blind video quality assessment. 30th ACM International Conference on Multimedia (MM '22), 837-846. https://dx.doi.org/10.1145/3503161.3547849 9781450392037 https://hdl.handle.net/10356/178457 10.1145/3503161.3547849 2-s2.0-85140428186 837 846 en © 2022 Association for Computing Machinery. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Perceptual trajectories
Primary visual cortex
spellingShingle Computer and Information Science
Perceptual trajectories
Primary visual cortex
Liao, Liang
Xu, Kangmin
Wu, Haoning
Chen, Chaofeng
Sun, Wenxiu
Yan, Qiong
Lin, Weisi
Exploring the effectiveness of video perceptual representation in blind video quality assessment
description With the rapid growth of in-The-wild videos taken by non-specialists, blind video quality assessment (VQA) has become a challenging and demanding problem. Although lots of efforts have been made to solve this problem, it remains unclear how the human visual system (HVS) relates to the temporal quality of videos. Meanwhile, recent work has found that the frames of natural video transformed into the perceptual domain of the HVS tend to form a straight trajectory of the representations. With the obtained insight that distortion impairs the perceived video quality and results in a curved trajectory of the perceptual representation, we propose a temporal perceptual quality index (TPQI) to measure the temporal distortion by describing the graphic morphology of the representation. Specifically, we first extract the video perceptual representations from the lateral geniculate nucleus (LGN) and primary visual area (V1) of the HVS, and then measure the straightness and compactness of their trajectories to quantify the degradation in naturalness and content continuity of video. Experiments show that the perceptual representation in the HVS is an effective way of predicting subjective temporal quality, and thus TPQI can, for the first time, achieve comparable performance to the spatial quality metric and be even more effective in assessing videos with large temporal variations. We further demonstrate that by combining with NIQE, a spatial quality metric, TPQI can achieve top performance over popular in-The-wild video datasets. More importantly, TPQI does not require any additional information beyond the video being evaluated and thus can be applied to any datasets without parameter tuning. Source code is available at https://github.com/UoLMM/TPQI-VQA.
author2 College of Computing and Data Science
author_facet College of Computing and Data Science
Liao, Liang
Xu, Kangmin
Wu, Haoning
Chen, Chaofeng
Sun, Wenxiu
Yan, Qiong
Lin, Weisi
format Conference or Workshop Item
author Liao, Liang
Xu, Kangmin
Wu, Haoning
Chen, Chaofeng
Sun, Wenxiu
Yan, Qiong
Lin, Weisi
author_sort Liao, Liang
title Exploring the effectiveness of video perceptual representation in blind video quality assessment
title_short Exploring the effectiveness of video perceptual representation in blind video quality assessment
title_full Exploring the effectiveness of video perceptual representation in blind video quality assessment
title_fullStr Exploring the effectiveness of video perceptual representation in blind video quality assessment
title_full_unstemmed Exploring the effectiveness of video perceptual representation in blind video quality assessment
title_sort exploring the effectiveness of video perceptual representation in blind video quality assessment
publishDate 2024
url https://hdl.handle.net/10356/178457
_version_ 1806059890327683072