Efficient near-duplicate keyframe retrieval with visual language models

Near-duplicate keyframe retrieval is a critical task for video similarity measure, video threading and tracking. In this paper, instead of using expensive point-to-point matching on keypoints, we investigate the visual language models built on visual keywords to speed up the near-duplicate keyframe...

Full description

Saved in:
Bibliographic Details
Main Authors: WU, Xiao, ZHAO, Wan-Lei, NGO, Chong-wah
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2007
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6603
https://ink.library.smu.edu.sg/context/sis_research/article/7606/viewcontent/Efficient_Near_Duplicate_Keyframe_Retrie.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7606
record_format dspace
spelling sg-smu-ink.sis_research-76062022-01-13T08:19:16Z Efficient near-duplicate keyframe retrieval with visual language models WU, Xiao ZHAO, Wan-Lei NGO, Chong-wah Near-duplicate keyframe retrieval is a critical task for video similarity measure, video threading and tracking. In this paper, instead of using expensive point-to-point matching on keypoints, we investigate the visual language models built on visual keywords to speed up the near-duplicate keyframe retrieval. The main idea is to estimate a visual language model on visual keywords for each keyframe and compare keyframes by the likelihood of their visual language models. Experiments on a subset of TRECVID-2004 video corpus show that visual language models built on visual keywords demonstrate promising performance for near-duplicate keyframe retrieval, which greatly speed up the retrieval speed although sacrifice a little performance compared to expensive point-to-point matching. 2007-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6603 info:doi/10.1109/icme.2007.4284696 https://ink.library.smu.edu.sg/context/sis_research/article/7606/viewcontent/Efficient_Near_Duplicate_Keyframe_Retrie.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Graphics and Human Computer Interfaces
spellingShingle Databases and Information Systems
Graphics and Human Computer Interfaces
WU, Xiao
ZHAO, Wan-Lei
NGO, Chong-wah
Efficient near-duplicate keyframe retrieval with visual language models
description Near-duplicate keyframe retrieval is a critical task for video similarity measure, video threading and tracking. In this paper, instead of using expensive point-to-point matching on keypoints, we investigate the visual language models built on visual keywords to speed up the near-duplicate keyframe retrieval. The main idea is to estimate a visual language model on visual keywords for each keyframe and compare keyframes by the likelihood of their visual language models. Experiments on a subset of TRECVID-2004 video corpus show that visual language models built on visual keywords demonstrate promising performance for near-duplicate keyframe retrieval, which greatly speed up the retrieval speed although sacrifice a little performance compared to expensive point-to-point matching.
format text
author WU, Xiao
ZHAO, Wan-Lei
NGO, Chong-wah
author_facet WU, Xiao
ZHAO, Wan-Lei
NGO, Chong-wah
author_sort WU, Xiao
title Efficient near-duplicate keyframe retrieval with visual language models
title_short Efficient near-duplicate keyframe retrieval with visual language models
title_full Efficient near-duplicate keyframe retrieval with visual language models
title_fullStr Efficient near-duplicate keyframe retrieval with visual language models
title_full_unstemmed Efficient near-duplicate keyframe retrieval with visual language models
title_sort efficient near-duplicate keyframe retrieval with visual language models
publisher Institutional Knowledge at Singapore Management University
publishDate 2007
url https://ink.library.smu.edu.sg/sis_research/6603
https://ink.library.smu.edu.sg/context/sis_research/article/7606/viewcontent/Efficient_Near_Duplicate_Keyframe_Retrie.pdf
_version_ 1770575999598067712