A vector-based approach to broadcast audio database indexing and retrieval
This paper proposes a novel framework to index and retrieve audio content from broadcast database that contains both speech and music. In this framework, we model the acoustic events using hidden Markov models, which are then used to decode the audio content. The decoding results in the form of acou...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/97783 http://hdl.handle.net/10220/17345 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-97783 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-977832023-02-28T19:17:40Z A vector-based approach to broadcast audio database indexing and retrieval Wang, Lei Li, Haizhou Chng, Eng Siong School of Computer Engineering School of Physical and Mathematical Sciences IEEE International Conference on Multimedia and Expo (2007 : Beijing, China) DRNTU::Engineering::Computer science and engineering This paper proposes a novel framework to index and retrieve audio content from broadcast database that contains both speech and music. In this framework, we model the acoustic events using hidden Markov models, which are then used to decode the audio content. The decoding results in the form of acoustic token sequence and acoustic lattice are used to generate features for indexing and retrieval with the vector space model. Experiments were carried out on the TRECVID database and the results showed that the proposed framework is effective in audio information retrieval. The results also showed that the features generated from the acoustic lattice provide more accurate information than token sequence. Accepted version 2013-11-06T06:15:17Z 2019-12-06T19:46:37Z 2013-11-06T06:15:17Z 2019-12-06T19:46:37Z 2007 2007 Conference Paper Wang, L., Li, H., & Chng, E. S. (2007). A vector-based approach to broadcast audio database indexing and retrieval. 2007 IEEE International Conference on Multimedia and Expo, pp512-515. https://hdl.handle.net/10356/97783 http://hdl.handle.net/10220/17345 10.1109/ICME.2007.4284699 en © 2007 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. Published version of this article is available at http://dx.doi.org/10.1109/ICME.2007.4284699 application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering |
spellingShingle |
DRNTU::Engineering::Computer science and engineering Wang, Lei Li, Haizhou Chng, Eng Siong A vector-based approach to broadcast audio database indexing and retrieval |
description |
This paper proposes a novel framework to index and retrieve audio content from broadcast database that contains both speech and music. In this framework, we model the acoustic events using hidden Markov models, which are then used to decode the audio content. The decoding results in the form of acoustic token sequence and acoustic lattice are used to generate features for indexing and retrieval with the vector space model. Experiments were carried out on the TRECVID database and the results showed that the proposed framework is effective in audio information retrieval. The results also showed that the features generated from the acoustic lattice provide more accurate information than token sequence. |
author2 |
School of Computer Engineering |
author_facet |
School of Computer Engineering Wang, Lei Li, Haizhou Chng, Eng Siong |
format |
Conference or Workshop Item |
author |
Wang, Lei Li, Haizhou Chng, Eng Siong |
author_sort |
Wang, Lei |
title |
A vector-based approach to broadcast audio database indexing and retrieval |
title_short |
A vector-based approach to broadcast audio database indexing and retrieval |
title_full |
A vector-based approach to broadcast audio database indexing and retrieval |
title_fullStr |
A vector-based approach to broadcast audio database indexing and retrieval |
title_full_unstemmed |
A vector-based approach to broadcast audio database indexing and retrieval |
title_sort |
vector-based approach to broadcast audio database indexing and retrieval |
publishDate |
2013 |
url |
https://hdl.handle.net/10356/97783 http://hdl.handle.net/10220/17345 |
_version_ |
1759856025374031872 |