SQL-like interpretable interactive video search

Concept-free search, which embeds text and video signals in a joint space for retrieval, appears to be a new state-of-the-art. However, this new search paradigm suffers from two limitations. First, the search result is unpredictable and not interpretable. Second, the embedded features are in high-di...

Full description

Saved in:
Bibliographic Details
Main Authors: WU, Jiaxin, NGUYEN, Phuong Anh, MA, Zhixin, NGO, Chong-wah
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6831
https://ink.library.smu.edu.sg/context/sis_research/article/7834/viewcontent/978_3_030_67835_7_34.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7834
record_format dspace
spelling sg-smu-ink.sis_research-78342024-06-06T09:10:26Z SQL-like interpretable interactive video search WU, Jiaxin NGUYEN, Phuong Anh MA, Zhixin NGO, Chong-wah Concept-free search, which embeds text and video signals in a joint space for retrieval, appears to be a new state-of-the-art. However, this new search paradigm suffers from two limitations. First, the search result is unpredictable and not interpretable. Second, the embedded features are in high-dimensional space hindering real-time indexing and search. In this paper, we present a new implementation of the Vireo video search system (Vireo-VSS), which employs a dual-task model to index each video segment with an embedding feature in a low dimension and a concept list for retrieval. The concept list serves as a reference to interpret its associated embedded feature. With these changes, a SQL-like querying interface is designed such that a user can specify the search content (subject, predicate, object) and constraint (logical condition) in a semi-structured way. The system will decompose the SQL-like query into multiple sub-queries depending on the constraint being specified. Each sub-query is translated into an embedding feature and a concept list for video retrieval. The search result is compiled by union or pruning of the search lists from multiple sub-queries. The SQL-like interface is also extended for temporal querying, by providing multiple SQL templates for users to specify the temporal evolution of a query. 2021-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6831 info:doi/10.1007/978-3-030-67835-7_34 https://ink.library.smu.edu.sg/context/sis_research/article/7834/viewcontent/978_3_030_67835_7_34.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Concept-based search Concept-free search Interactive video search SQL-like interpretable search Video browser showdown Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Concept-based search
Concept-free search
Interactive video search
SQL-like interpretable search
Video browser showdown
Software Engineering
spellingShingle Concept-based search
Concept-free search
Interactive video search
SQL-like interpretable search
Video browser showdown
Software Engineering
WU, Jiaxin
NGUYEN, Phuong Anh
MA, Zhixin
NGO, Chong-wah
SQL-like interpretable interactive video search
description Concept-free search, which embeds text and video signals in a joint space for retrieval, appears to be a new state-of-the-art. However, this new search paradigm suffers from two limitations. First, the search result is unpredictable and not interpretable. Second, the embedded features are in high-dimensional space hindering real-time indexing and search. In this paper, we present a new implementation of the Vireo video search system (Vireo-VSS), which employs a dual-task model to index each video segment with an embedding feature in a low dimension and a concept list for retrieval. The concept list serves as a reference to interpret its associated embedded feature. With these changes, a SQL-like querying interface is designed such that a user can specify the search content (subject, predicate, object) and constraint (logical condition) in a semi-structured way. The system will decompose the SQL-like query into multiple sub-queries depending on the constraint being specified. Each sub-query is translated into an embedding feature and a concept list for video retrieval. The search result is compiled by union or pruning of the search lists from multiple sub-queries. The SQL-like interface is also extended for temporal querying, by providing multiple SQL templates for users to specify the temporal evolution of a query.
format text
author WU, Jiaxin
NGUYEN, Phuong Anh
MA, Zhixin
NGO, Chong-wah
author_facet WU, Jiaxin
NGUYEN, Phuong Anh
MA, Zhixin
NGO, Chong-wah
author_sort WU, Jiaxin
title SQL-like interpretable interactive video search
title_short SQL-like interpretable interactive video search
title_full SQL-like interpretable interactive video search
title_fullStr SQL-like interpretable interactive video search
title_full_unstemmed SQL-like interpretable interactive video search
title_sort sql-like interpretable interactive video search
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/6831
https://ink.library.smu.edu.sg/context/sis_research/article/7834/viewcontent/978_3_030_67835_7_34.pdf
_version_ 1814047562277060608