Semantic reasoning in zero example video event retrieval

Searching in digital video data for high-level events, such as a parade or a car accident, is challenging when the query is textual and lacks visual example images or videos. Current research in deep neural networks is highly beneficial for the retrieval of high-level events using visual examples, b...

Full description

Saved in:
Bibliographic Details
Main Authors: DE BOER, M. H. T., LU, Yi-Jie, ZHANG, Hao, SCHUTTE, Klamer, NGO, Chong-wah, KRAAIJ, Wessel
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6307
https://ink.library.smu.edu.sg/context/sis_research/article/7310/viewcontent/178704.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7310
record_format dspace
spelling sg-smu-ink.sis_research-73102021-11-23T06:58:43Z Semantic reasoning in zero example video event retrieval DE BOER, M. H. T. LU, Yi-Jie ZHANG, Hao SCHUTTE, Klamer NGO, Chong-wah KRAAIJ, Wessel Searching in digital video data for high-level events, such as a parade or a car accident, is challenging when the query is textual and lacks visual example images or videos. Current research in deep neural networks is highly beneficial for the retrieval of high-level events using visual examples, but without examples it is still hard to (1) determine which concepts are useful to pre-train (Vocabulary challenge) and (2) which pre-trained concept detectors are relevant for a certain unseen high-level event (Concept Selection challenge). In our article, we present our Semantic Event Retrieval Systemwhich (1) shows the importance of high-level concepts in a vocabulary for the retrieval of complex and generic high-level events and (2) uses a novel concept selection method (i-w2v) based on semantic embeddings. Our experiments on the international TRECVID Multimedia Event Detection benchmark show that a diverse vocabulary including high-level concepts improves performance on the retrieval of high-level events in videos and that our novel method outperforms a knowledge-based concept selection method. 2017-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6307 info:doi/10.1145/3131288 https://ink.library.smu.edu.sg/context/sis_research/article/7310/viewcontent/178704.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Content-based visual information retrieval;multimedia event detection;zero shot;semantics Data Storage Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Content-based visual information retrieval;multimedia event detection;zero shot;semantics
Data Storage Systems
Graphics and Human Computer Interfaces
spellingShingle Content-based visual information retrieval;multimedia event detection;zero shot;semantics
Data Storage Systems
Graphics and Human Computer Interfaces
DE BOER, M. H. T.
LU, Yi-Jie
ZHANG, Hao
SCHUTTE, Klamer
NGO, Chong-wah
KRAAIJ, Wessel
Semantic reasoning in zero example video event retrieval
description Searching in digital video data for high-level events, such as a parade or a car accident, is challenging when the query is textual and lacks visual example images or videos. Current research in deep neural networks is highly beneficial for the retrieval of high-level events using visual examples, but without examples it is still hard to (1) determine which concepts are useful to pre-train (Vocabulary challenge) and (2) which pre-trained concept detectors are relevant for a certain unseen high-level event (Concept Selection challenge). In our article, we present our Semantic Event Retrieval Systemwhich (1) shows the importance of high-level concepts in a vocabulary for the retrieval of complex and generic high-level events and (2) uses a novel concept selection method (i-w2v) based on semantic embeddings. Our experiments on the international TRECVID Multimedia Event Detection benchmark show that a diverse vocabulary including high-level concepts improves performance on the retrieval of high-level events in videos and that our novel method outperforms a knowledge-based concept selection method.
format text
author DE BOER, M. H. T.
LU, Yi-Jie
ZHANG, Hao
SCHUTTE, Klamer
NGO, Chong-wah
KRAAIJ, Wessel
author_facet DE BOER, M. H. T.
LU, Yi-Jie
ZHANG, Hao
SCHUTTE, Klamer
NGO, Chong-wah
KRAAIJ, Wessel
author_sort DE BOER, M. H. T.
title Semantic reasoning in zero example video event retrieval
title_short Semantic reasoning in zero example video event retrieval
title_full Semantic reasoning in zero example video event retrieval
title_fullStr Semantic reasoning in zero example video event retrieval
title_full_unstemmed Semantic reasoning in zero example video event retrieval
title_sort semantic reasoning in zero example video event retrieval
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/6307
https://ink.library.smu.edu.sg/context/sis_research/article/7310/viewcontent/178704.pdf
_version_ 1770575931517173760