Probabilistic indexing of media sequences

Accurate and fast nearest neighbor search is often required in applications involving media sequences, such as duplicate detection in video collections, music retrieval in digital libraries, and event discovery in streaming documents. Among various related techniques, developing indexing scheme is p...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	SHEN, Jialie, WANG, Meng, YAN, Shuicheng, QI, TIAN
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2011
الموضوعات:	Databases and Information Systems Numerical Analysis and Scientific Computing
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/sis_research/1445
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Singapore Management University
اللغة:	English

id	sg-smu-ink.sis_research-2444
record_format	dspace
spelling	sg-smu-ink.sis_research-24442012-01-10T09:43:58Z Probabilistic indexing of media sequences SHEN, Jialie WANG, Meng YAN, Shuicheng QI, TIAN Accurate and fast nearest neighbor search is often required in applications involving media sequences, such as duplicate detection in video collections, music retrieval in digital libraries, and event discovery in streaming documents. Among various related techniques, developing indexing scheme is probably most challenging because of its complexity. This paper documents a novel scheme called HMMH (Hidden Markov Model based Hashing) to facilitate scalable and efficient media sequence retrieval based on advanced hashing algorithm. Main conjecture of our approach is that media sequence's content is complex and the associated dynamic characteristics cannot be ignored. As such, we propose to use hidden Markov model (HMM) for comprehensive media sequence modeling and calculate HMM supervector to represent segments of media sequence. With the novel scheme, more discriminative information about temporal structure can be captured. In addition, the difference of two media sequences is approximated by the Euclidean distance between the associated HMM supervectors. The statistical property enables the proposed HMMH to enjoy good system flexibility - various hashing algorithms (e.g., LSH and SPH) can be applied on HMM supervectors for effective binary code calculation. Our experimental results using both large scale video and music collections demonstrate that the proposed scheme various kinds of advantages over existing techniques. 2011-08-01T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/1445 info:doi/10.1145/2043674.2043705 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Databases and Information Systems Numerical Analysis and Scientific Computing
spellingShingle	Databases and Information Systems Numerical Analysis and Scientific Computing SHEN, Jialie WANG, Meng YAN, Shuicheng QI, TIAN Probabilistic indexing of media sequences
description	Accurate and fast nearest neighbor search is often required in applications involving media sequences, such as duplicate detection in video collections, music retrieval in digital libraries, and event discovery in streaming documents. Among various related techniques, developing indexing scheme is probably most challenging because of its complexity. This paper documents a novel scheme called HMMH (Hidden Markov Model based Hashing) to facilitate scalable and efficient media sequence retrieval based on advanced hashing algorithm. Main conjecture of our approach is that media sequence's content is complex and the associated dynamic characteristics cannot be ignored. As such, we propose to use hidden Markov model (HMM) for comprehensive media sequence modeling and calculate HMM supervector to represent segments of media sequence. With the novel scheme, more discriminative information about temporal structure can be captured. In addition, the difference of two media sequences is approximated by the Euclidean distance between the associated HMM supervectors. The statistical property enables the proposed HMMH to enjoy good system flexibility - various hashing algorithms (e.g., LSH and SPH) can be applied on HMM supervectors for effective binary code calculation. Our experimental results using both large scale video and music collections demonstrate that the proposed scheme various kinds of advantages over existing techniques.
format	text
author	SHEN, Jialie WANG, Meng YAN, Shuicheng QI, TIAN
author_facet	SHEN, Jialie WANG, Meng YAN, Shuicheng QI, TIAN
author_sort	SHEN, Jialie
title	Probabilistic indexing of media sequences
title_short	Probabilistic indexing of media sequences
title_full	Probabilistic indexing of media sequences
title_fullStr	Probabilistic indexing of media sequences
title_full_unstemmed	Probabilistic indexing of media sequences
title_sort	probabilistic indexing of media sequences
publisher	Institutional Knowledge at Singapore Management University
publishDate	2011
url	https://ink.library.smu.edu.sg/sis_research/1445
_version_	1770571150956429312

Probabilistic indexing of media sequences

مواد مشابهة