Efficient near-duplicate keyframe retrieval with visual language models

Near-duplicate keyframe retrieval is a critical task for video similarity measure, video threading and tracking. In this paper, instead of using expensive point-to-point matching on keypoints, we investigate the visual language models built on visual keywords to speed up the near-duplicate keyframe...

全面介紹

Saved in:
書目詳細資料
Main Authors: WU, Xiao, ZHAO, Wan-Lei, NGO, Chong-wah
格式: text
語言:English
出版: Institutional Knowledge at Singapore Management University 2007
主題:
在線閱讀:https://ink.library.smu.edu.sg/sis_research/6603
https://ink.library.smu.edu.sg/context/sis_research/article/7606/viewcontent/Efficient_Near_Duplicate_Keyframe_Retrie.pdf
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Singapore Management University
語言: English
id sg-smu-ink.sis_research-7606
record_format dspace
spelling sg-smu-ink.sis_research-76062022-01-13T08:19:16Z Efficient near-duplicate keyframe retrieval with visual language models WU, Xiao ZHAO, Wan-Lei NGO, Chong-wah Near-duplicate keyframe retrieval is a critical task for video similarity measure, video threading and tracking. In this paper, instead of using expensive point-to-point matching on keypoints, we investigate the visual language models built on visual keywords to speed up the near-duplicate keyframe retrieval. The main idea is to estimate a visual language model on visual keywords for each keyframe and compare keyframes by the likelihood of their visual language models. Experiments on a subset of TRECVID-2004 video corpus show that visual language models built on visual keywords demonstrate promising performance for near-duplicate keyframe retrieval, which greatly speed up the retrieval speed although sacrifice a little performance compared to expensive point-to-point matching. 2007-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6603 info:doi/10.1109/icme.2007.4284696 https://ink.library.smu.edu.sg/context/sis_research/article/7606/viewcontent/Efficient_Near_Duplicate_Keyframe_Retrie.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Graphics and Human Computer Interfaces
spellingShingle Databases and Information Systems
Graphics and Human Computer Interfaces
WU, Xiao
ZHAO, Wan-Lei
NGO, Chong-wah
Efficient near-duplicate keyframe retrieval with visual language models
description Near-duplicate keyframe retrieval is a critical task for video similarity measure, video threading and tracking. In this paper, instead of using expensive point-to-point matching on keypoints, we investigate the visual language models built on visual keywords to speed up the near-duplicate keyframe retrieval. The main idea is to estimate a visual language model on visual keywords for each keyframe and compare keyframes by the likelihood of their visual language models. Experiments on a subset of TRECVID-2004 video corpus show that visual language models built on visual keywords demonstrate promising performance for near-duplicate keyframe retrieval, which greatly speed up the retrieval speed although sacrifice a little performance compared to expensive point-to-point matching.
format text
author WU, Xiao
ZHAO, Wan-Lei
NGO, Chong-wah
author_facet WU, Xiao
ZHAO, Wan-Lei
NGO, Chong-wah
author_sort WU, Xiao
title Efficient near-duplicate keyframe retrieval with visual language models
title_short Efficient near-duplicate keyframe retrieval with visual language models
title_full Efficient near-duplicate keyframe retrieval with visual language models
title_fullStr Efficient near-duplicate keyframe retrieval with visual language models
title_full_unstemmed Efficient near-duplicate keyframe retrieval with visual language models
title_sort efficient near-duplicate keyframe retrieval with visual language models
publisher Institutional Knowledge at Singapore Management University
publishDate 2007
url https://ink.library.smu.edu.sg/sis_research/6603
https://ink.library.smu.edu.sg/context/sis_research/article/7606/viewcontent/Efficient_Near_Duplicate_Keyframe_Retrie.pdf
_version_ 1770575999598067712