Vision-language-model-based video quality assessment

This work introduces a comprehensive approach to video quality assessment (VQA) by both traditional deep-learning-based methods as well as vision-language-model-based methods. Through the development of the DIVIDE-3k database and the DOVER model, we offer nuanced insights into the multifaceted natur...

Full description

Saved in:
Bibliographic Details
Main Author: Zhang, Erli
Other Authors: Lin Weisi
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175035
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English