WebRTC based video quality assessment
The video quality assessment (VQA) methods have been the default mechanism to evaluate the quality of a video. Notably for online video consumption, the video quality might be inconsistent due to many variables that could affect the quality of experience (QOE) for the end user. Therefore, a VQA m...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/156466 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-156466 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1564662022-04-17T10:35:12Z WebRTC based video quality assessment Muhammad Ezzuddin Jamaluddin Lin Weisi School of Computer Science and Engineering WSLin@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision The video quality assessment (VQA) methods have been the default mechanism to evaluate the quality of a video. Notably for online video consumption, the video quality might be inconsistent due to many variables that could affect the quality of experience (QOE) for the end user. Therefore, a VQA metric is important to provide a better QOE for the end user. One of the metrics that can be used to determine the video quality is temporal masking. Temporal masking has been researched on since the early days and is a technique to delude individuals into thinking they can identify any differences with a complex spatial and temporal background. Research have been done to evaluate the spatio-temporal in a video using 3D CNN and the results are better compared to other state-of-the-arts technology. In this paper, we aim to explore how Transformers model will perform compared to traditional method of using Convolutional Neural Network (CNN). Transformers have recently started to show effectiveness in solving computer vision tasks. The Transformers' capability to model long-range relationships makes it an obvious choice for learning temporal information over numerous frames for video comprehension problems. Bachelor of Engineering (Computer Science) 2022-04-17T10:35:12Z 2022-04-17T10:35:12Z 2022 Final Year Project (FYP) Muhammad Ezzuddin Jamaluddin (2022). WebRTC based video quality assessment. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156466 https://hdl.handle.net/10356/156466 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Muhammad Ezzuddin Jamaluddin WebRTC based video quality assessment |
description |
The video quality assessment (VQA) methods have been the default mechanism to evaluate
the quality of a video. Notably for online video consumption, the video quality might be
inconsistent due to many variables that could affect the quality of experience (QOE) for the
end user. Therefore, a VQA metric is important to provide a better QOE for the end user. One
of the metrics that can be used to determine the video quality is temporal masking. Temporal
masking has been researched on since the early days and is a technique to delude
individuals into thinking they can identify any differences with a complex spatial and
temporal background. Research have been done to evaluate the spatio-temporal in a video
using 3D CNN and the results are better compared to other state-of-the-arts technology. In
this paper, we aim to explore how Transformers model will perform compared to traditional
method of using Convolutional Neural Network (CNN). Transformers have recently started
to show effectiveness in solving computer vision tasks. The Transformers' capability to
model long-range relationships makes it an obvious choice for learning temporal information
over numerous frames for video comprehension problems. |
author2 |
Lin Weisi |
author_facet |
Lin Weisi Muhammad Ezzuddin Jamaluddin |
format |
Final Year Project |
author |
Muhammad Ezzuddin Jamaluddin |
author_sort |
Muhammad Ezzuddin Jamaluddin |
title |
WebRTC based video quality assessment |
title_short |
WebRTC based video quality assessment |
title_full |
WebRTC based video quality assessment |
title_fullStr |
WebRTC based video quality assessment |
title_full_unstemmed |
WebRTC based video quality assessment |
title_sort |
webrtc based video quality assessment |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/156466 |
_version_ |
1731235784526135296 |