WebRTC based video quality assessment

The video quality assessment (VQA) methods have been the default mechanism to evaluate the quality of a video. Notably for online video consumption, the video quality might be inconsistent due to many variables that could affect the quality of experience (QOE) for the end user. Therefore, a VQA m...

Full description

Saved in:
Bibliographic Details
Main Author: Muhammad Ezzuddin Jamaluddin
Other Authors: Lin Weisi
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156466
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-156466
record_format dspace
spelling sg-ntu-dr.10356-1564662022-04-17T10:35:12Z WebRTC based video quality assessment Muhammad Ezzuddin Jamaluddin Lin Weisi School of Computer Science and Engineering WSLin@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision The video quality assessment (VQA) methods have been the default mechanism to evaluate the quality of a video. Notably for online video consumption, the video quality might be inconsistent due to many variables that could affect the quality of experience (QOE) for the end user. Therefore, a VQA metric is important to provide a better QOE for the end user. One of the metrics that can be used to determine the video quality is temporal masking. Temporal masking has been researched on since the early days and is a technique to delude individuals into thinking they can identify any differences with a complex spatial and temporal background. Research have been done to evaluate the spatio-temporal in a video using 3D CNN and the results are better compared to other state-of-the-arts technology. In this paper, we aim to explore how Transformers model will perform compared to traditional method of using Convolutional Neural Network (CNN). Transformers have recently started to show effectiveness in solving computer vision tasks. The Transformers' capability to model long-range relationships makes it an obvious choice for learning temporal information over numerous frames for video comprehension problems. Bachelor of Engineering (Computer Science) 2022-04-17T10:35:12Z 2022-04-17T10:35:12Z 2022 Final Year Project (FYP) Muhammad Ezzuddin Jamaluddin (2022). WebRTC based video quality assessment. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156466 https://hdl.handle.net/10356/156466 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Muhammad Ezzuddin Jamaluddin
WebRTC based video quality assessment
description The video quality assessment (VQA) methods have been the default mechanism to evaluate the quality of a video. Notably for online video consumption, the video quality might be inconsistent due to many variables that could affect the quality of experience (QOE) for the end user. Therefore, a VQA metric is important to provide a better QOE for the end user. One of the metrics that can be used to determine the video quality is temporal masking. Temporal masking has been researched on since the early days and is a technique to delude individuals into thinking they can identify any differences with a complex spatial and temporal background. Research have been done to evaluate the spatio-temporal in a video using 3D CNN and the results are better compared to other state-of-the-arts technology. In this paper, we aim to explore how Transformers model will perform compared to traditional method of using Convolutional Neural Network (CNN). Transformers have recently started to show effectiveness in solving computer vision tasks. The Transformers' capability to model long-range relationships makes it an obvious choice for learning temporal information over numerous frames for video comprehension problems.
author2 Lin Weisi
author_facet Lin Weisi
Muhammad Ezzuddin Jamaluddin
format Final Year Project
author Muhammad Ezzuddin Jamaluddin
author_sort Muhammad Ezzuddin Jamaluddin
title WebRTC based video quality assessment
title_short WebRTC based video quality assessment
title_full WebRTC based video quality assessment
title_fullStr WebRTC based video quality assessment
title_full_unstemmed WebRTC based video quality assessment
title_sort webrtc based video quality assessment
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/156466
_version_ 1731235784526135296