Practical elimination of near-duplicates from Web video search

Current web video search results rely exclusively on text keywords or user-supplied tags. A search on typical popular video often returns many duplicate and near-duplicate videos in the top results. This paper outlines ways to cluster and filter out the nearduplicate video using a hierarchical appro...

Full description

Saved in:
Bibliographic Details
Main Authors: WU, Xiao, HAUPTMANN, Alexander G., NGO, Chong-wah
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2007
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6480
https://ink.library.smu.edu.sg/context/sis_research/article/7483/viewcontent/1291233.1291280.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7483
record_format dspace
spelling sg-smu-ink.sis_research-74832022-01-10T05:36:32Z Practical elimination of near-duplicates from Web video search WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah Current web video search results rely exclusively on text keywords or user-supplied tags. A search on typical popular video often returns many duplicate and near-duplicate videos in the top results. This paper outlines ways to cluster and filter out the nearduplicate video using a hierarchical approach. Initial triage is performed using fast signatures derived from color histograms. Only when a video cannot be clearly classified as novel or nearduplicate using global signatures, we apply a more expensive local feature based near-duplicate detection which provides very accurate duplicate analysis through more costly computation. The results of 24 queries in a data set of 12,790 videos retrieved from Google, Yahoo! and YouTube show that this hierarchical approach can dramatically reduce redundant video displayed to the user in the top result set, at relatively small computational cost. 2007-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6480 info:doi/10.1145/1291233.1291280 https://ink.library.smu.edu.sg/context/sis_research/article/7483/viewcontent/1291233.1291280.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Copy selection Filtering; Multimodality Near-duplicates Novelty and redundancy detection Similarity measure Web video Data Storage Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Copy selection
Filtering; Multimodality
Near-duplicates
Novelty and redundancy detection
Similarity measure
Web video
Data Storage Systems
Graphics and Human Computer Interfaces
spellingShingle Copy selection
Filtering; Multimodality
Near-duplicates
Novelty and redundancy detection
Similarity measure
Web video
Data Storage Systems
Graphics and Human Computer Interfaces
WU, Xiao
HAUPTMANN, Alexander G.
NGO, Chong-wah
Practical elimination of near-duplicates from Web video search
description Current web video search results rely exclusively on text keywords or user-supplied tags. A search on typical popular video often returns many duplicate and near-duplicate videos in the top results. This paper outlines ways to cluster and filter out the nearduplicate video using a hierarchical approach. Initial triage is performed using fast signatures derived from color histograms. Only when a video cannot be clearly classified as novel or nearduplicate using global signatures, we apply a more expensive local feature based near-duplicate detection which provides very accurate duplicate analysis through more costly computation. The results of 24 queries in a data set of 12,790 videos retrieved from Google, Yahoo! and YouTube show that this hierarchical approach can dramatically reduce redundant video displayed to the user in the top result set, at relatively small computational cost.
format text
author WU, Xiao
HAUPTMANN, Alexander G.
NGO, Chong-wah
author_facet WU, Xiao
HAUPTMANN, Alexander G.
NGO, Chong-wah
author_sort WU, Xiao
title Practical elimination of near-duplicates from Web video search
title_short Practical elimination of near-duplicates from Web video search
title_full Practical elimination of near-duplicates from Web video search
title_fullStr Practical elimination of near-duplicates from Web video search
title_full_unstemmed Practical elimination of near-duplicates from Web video search
title_sort practical elimination of near-duplicates from web video search
publisher Institutional Knowledge at Singapore Management University
publishDate 2007
url https://ink.library.smu.edu.sg/sis_research/6480
https://ink.library.smu.edu.sg/context/sis_research/article/7483/viewcontent/1291233.1291280.pdf
_version_ 1770575973037637632