Modality Mixture Projections for Semantic Video Event Detection

Event detection is one of the most fundamental components for various kinds of domain applications of video information system. In recent years, it has gained a considerable interest of practitioners and academics from different areas. While detecting video event has been the subject of extensive re...

Full description

Saved in:
Bibliographic Details
Main Authors: SHEN, Jialie, TAO, Dacheng, LI, Xuelong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2008
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/764
https://ink.library.smu.edu.sg/context/sis_research/article/1763/viewcontent/ModalityMixtureProjections_2008.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1763
record_format dspace
spelling sg-smu-ink.sis_research-17632017-03-23T03:26:50Z Modality Mixture Projections for Semantic Video Event Detection SHEN, Jialie TAO, Dacheng LI, Xuelong Event detection is one of the most fundamental components for various kinds of domain applications of video information system. In recent years, it has gained a considerable interest of practitioners and academics from different areas. While detecting video event has been the subject of extensive research efforts recently, much less existing approach has considered multimodal information and related efficiency issues. In this paper, we use a subspace selection technique to achieve fast and accurate video event detection using a subspace selection technique. The approach is capable of discriminating different classes and preserving the intramodal geometry of samples within an identical class. With the method, feature vectors presenting different kind of multi data can be easily projected from different identities and modalities onto a unified subspace, on which recognition process can be performed. Furthermore, the training stage is carried out once and we have a unified transformation matrix to project different modalities. Unlike existing multimodal detection systems, the new system works well when some modalities are not available. Experimental results based on soccer video and TRECVID news video collections demonstrate the effectiveness, efficiency and robustness of the proposed MMP for individual recognition tasks in comparison to the existing approaches. 2008-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/764 info:doi/10.1109/TCSVT.2008.2005607 https://ink.library.smu.edu.sg/context/sis_research/article/1763/viewcontent/ModalityMixtureProjections_2008.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Multimodule semantic event video detection Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Multimodule
semantic event video detection
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Multimodule
semantic event video detection
Databases and Information Systems
Numerical Analysis and Scientific Computing
SHEN, Jialie
TAO, Dacheng
LI, Xuelong
Modality Mixture Projections for Semantic Video Event Detection
description Event detection is one of the most fundamental components for various kinds of domain applications of video information system. In recent years, it has gained a considerable interest of practitioners and academics from different areas. While detecting video event has been the subject of extensive research efforts recently, much less existing approach has considered multimodal information and related efficiency issues. In this paper, we use a subspace selection technique to achieve fast and accurate video event detection using a subspace selection technique. The approach is capable of discriminating different classes and preserving the intramodal geometry of samples within an identical class. With the method, feature vectors presenting different kind of multi data can be easily projected from different identities and modalities onto a unified subspace, on which recognition process can be performed. Furthermore, the training stage is carried out once and we have a unified transformation matrix to project different modalities. Unlike existing multimodal detection systems, the new system works well when some modalities are not available. Experimental results based on soccer video and TRECVID news video collections demonstrate the effectiveness, efficiency and robustness of the proposed MMP for individual recognition tasks in comparison to the existing approaches.
format text
author SHEN, Jialie
TAO, Dacheng
LI, Xuelong
author_facet SHEN, Jialie
TAO, Dacheng
LI, Xuelong
author_sort SHEN, Jialie
title Modality Mixture Projections for Semantic Video Event Detection
title_short Modality Mixture Projections for Semantic Video Event Detection
title_full Modality Mixture Projections for Semantic Video Event Detection
title_fullStr Modality Mixture Projections for Semantic Video Event Detection
title_full_unstemmed Modality Mixture Projections for Semantic Video Event Detection
title_sort modality mixture projections for semantic video event detection
publisher Institutional Knowledge at Singapore Management University
publishDate 2008
url https://ink.library.smu.edu.sg/sis_research/764
https://ink.library.smu.edu.sg/context/sis_research/article/1763/viewcontent/ModalityMixtureProjections_2008.pdf
_version_ 1770570704821944320