Improving Natural Language Person Description Search from Videos with Language Model Fine-Tuning and Approximate Nearest Neighbor

Due to the ubiquitous nature of CCTV cameras that record continuously, there is a large amount of video data that are unstructured. Often, when these recordings have to be reviewed, it is to look for a specific person that fits a certain description. Currently, this is achieved by manual inspection...

Full description

Saved in:
Bibliographic Details
Main Author: Yuenyong S.
Other Authors: Mahidol University
Format: Article
Published: 2023
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/83958
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
Be the first to leave a comment!
You must be logged in first