Large-scale near-duplicate web video search: Challenge and opportunity

The massive amount of near-duplicate and duplicate web videos has presented both challenge and opportunity to multimedia computing. On one hand, browsing videos on Internet becomes highly inefficient for the need to repeatedly fast-forward videos of similar content. On the other hand, the tremendous...

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHAO, Wan-Lei, TAN, Song, NGO, Chong-wah
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2009
Subjects:	Data Storage Systems Graphics and Human Computer Interfaces
Online Access:	https://ink.library.smu.edu.sg/sis_research/6641 https://ink.library.smu.edu.sg/context/sis_research/article/7644/viewcontent/icme09_wanlei.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-7644
record_format	dspace
spelling	sg-smu-ink.sis_research-76442022-01-14T03:33:14Z Large-scale near-duplicate web video search: Challenge and opportunity ZHAO, Wan-Lei TAN, Song NGO, Chong-wah The massive amount of near-duplicate and duplicate web videos has presented both challenge and opportunity to multimedia computing. On one hand, browsing videos on Internet becomes highly inefficient for the need to repeatedly fast-forward videos of similar content. On the other hand, the tremendous amount of somewhat duplicate content also makes some traditionally difficult vision tasks become simple and easy. For example, annotating pictures can be as simple as recycling the tags of Internet images retrieved from image search engines. Such tasks, of either to eliminate or to recycle near-duplicates, can usually be achieved by the nearest neighbor search of videos from Internet. The fundamental problem lies on the scalability of a search technique, in face of the intractable volume of videos which keep rolling on the web. In this paper, we investigate scalability of several well-known features including color signature and visual keywords for web-based retrieval. Indexing these features based on embedding technique for scalable retrieval is also presented. On an Internet video dataset of more than 700 hours collected during years 2006 to 2008, we show some preliminary insights to the challenge of scalable retrieval. 2009-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6641 info:doi/10.1109/ICME.2009.5202830 https://ink.library.smu.edu.sg/context/sis_research/article/7644/viewcontent/icme09_wanlei.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Data Storage Systems Graphics and Human Computer Interfaces
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Data Storage Systems Graphics and Human Computer Interfaces
spellingShingle	Data Storage Systems Graphics and Human Computer Interfaces ZHAO, Wan-Lei TAN, Song NGO, Chong-wah Large-scale near-duplicate web video search: Challenge and opportunity
description	The massive amount of near-duplicate and duplicate web videos has presented both challenge and opportunity to multimedia computing. On one hand, browsing videos on Internet becomes highly inefficient for the need to repeatedly fast-forward videos of similar content. On the other hand, the tremendous amount of somewhat duplicate content also makes some traditionally difficult vision tasks become simple and easy. For example, annotating pictures can be as simple as recycling the tags of Internet images retrieved from image search engines. Such tasks, of either to eliminate or to recycle near-duplicates, can usually be achieved by the nearest neighbor search of videos from Internet. The fundamental problem lies on the scalability of a search technique, in face of the intractable volume of videos which keep rolling on the web. In this paper, we investigate scalability of several well-known features including color signature and visual keywords for web-based retrieval. Indexing these features based on embedding technique for scalable retrieval is also presented. On an Internet video dataset of more than 700 hours collected during years 2006 to 2008, we show some preliminary insights to the challenge of scalable retrieval.
format	text
author	ZHAO, Wan-Lei TAN, Song NGO, Chong-wah
author_facet	ZHAO, Wan-Lei TAN, Song NGO, Chong-wah
author_sort	ZHAO, Wan-Lei
title	Large-scale near-duplicate web video search: Challenge and opportunity
title_short	Large-scale near-duplicate web video search: Challenge and opportunity
title_full	Large-scale near-duplicate web video search: Challenge and opportunity
title_fullStr	Large-scale near-duplicate web video search: Challenge and opportunity
title_full_unstemmed	Large-scale near-duplicate web video search: Challenge and opportunity
title_sort	large-scale near-duplicate web video search: challenge and opportunity
publisher	Institutional Knowledge at Singapore Management University
publishDate	2009
url	https://ink.library.smu.edu.sg/sis_research/6641 https://ink.library.smu.edu.sg/context/sis_research/article/7644/viewcontent/icme09_wanlei.pdf
_version_	1770576015040446464

Large-scale near-duplicate web video search: Challenge and opportunity

Similar Items