Practical elimination of near-duplicates from Web video search
Current web video search results rely exclusively on text keywords or user-supplied tags. A search on typical popular video often returns many duplicate and near-duplicate videos in the top results. This paper outlines ways to cluster and filter out the nearduplicate video using a hierarchical appro...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2007
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6480 https://ink.library.smu.edu.sg/context/sis_research/article/7483/viewcontent/1291233.1291280.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7483 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-74832022-01-10T05:36:32Z Practical elimination of near-duplicates from Web video search WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah Current web video search results rely exclusively on text keywords or user-supplied tags. A search on typical popular video often returns many duplicate and near-duplicate videos in the top results. This paper outlines ways to cluster and filter out the nearduplicate video using a hierarchical approach. Initial triage is performed using fast signatures derived from color histograms. Only when a video cannot be clearly classified as novel or nearduplicate using global signatures, we apply a more expensive local feature based near-duplicate detection which provides very accurate duplicate analysis through more costly computation. The results of 24 queries in a data set of 12,790 videos retrieved from Google, Yahoo! and YouTube show that this hierarchical approach can dramatically reduce redundant video displayed to the user in the top result set, at relatively small computational cost. 2007-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6480 info:doi/10.1145/1291233.1291280 https://ink.library.smu.edu.sg/context/sis_research/article/7483/viewcontent/1291233.1291280.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Copy selection Filtering; Multimodality Near-duplicates Novelty and redundancy detection Similarity measure Web video Data Storage Systems Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Copy selection Filtering; Multimodality Near-duplicates Novelty and redundancy detection Similarity measure Web video Data Storage Systems Graphics and Human Computer Interfaces |
spellingShingle |
Copy selection Filtering; Multimodality Near-duplicates Novelty and redundancy detection Similarity measure Web video Data Storage Systems Graphics and Human Computer Interfaces WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah Practical elimination of near-duplicates from Web video search |
description |
Current web video search results rely exclusively on text keywords or user-supplied tags. A search on typical popular video often returns many duplicate and near-duplicate videos in the top results. This paper outlines ways to cluster and filter out the nearduplicate video using a hierarchical approach. Initial triage is performed using fast signatures derived from color histograms. Only when a video cannot be clearly classified as novel or nearduplicate using global signatures, we apply a more expensive local feature based near-duplicate detection which provides very accurate duplicate analysis through more costly computation. The results of 24 queries in a data set of 12,790 videos retrieved from Google, Yahoo! and YouTube show that this hierarchical approach can dramatically reduce redundant video displayed to the user in the top result set, at relatively small computational cost. |
format |
text |
author |
WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah |
author_facet |
WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah |
author_sort |
WU, Xiao |
title |
Practical elimination of near-duplicates from Web video search |
title_short |
Practical elimination of near-duplicates from Web video search |
title_full |
Practical elimination of near-duplicates from Web video search |
title_fullStr |
Practical elimination of near-duplicates from Web video search |
title_full_unstemmed |
Practical elimination of near-duplicates from Web video search |
title_sort |
practical elimination of near-duplicates from web video search |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2007 |
url |
https://ink.library.smu.edu.sg/sis_research/6480 https://ink.library.smu.edu.sg/context/sis_research/article/7483/viewcontent/1291233.1291280.pdf |
_version_ |
1770575973037637632 |