SQL-like interpretable interactive video search
Concept-free search, which embeds text and video signals in a joint space for retrieval, appears to be a new state-of-the-art. However, this new search paradigm suffers from two limitations. First, the search result is unpredictable and not interpretable. Second, the embedded features are in high-di...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6831 https://ink.library.smu.edu.sg/context/sis_research/article/7834/viewcontent/978_3_030_67835_7_34.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7834 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-78342024-06-06T09:10:26Z SQL-like interpretable interactive video search WU, Jiaxin NGUYEN, Phuong Anh MA, Zhixin NGO, Chong-wah Concept-free search, which embeds text and video signals in a joint space for retrieval, appears to be a new state-of-the-art. However, this new search paradigm suffers from two limitations. First, the search result is unpredictable and not interpretable. Second, the embedded features are in high-dimensional space hindering real-time indexing and search. In this paper, we present a new implementation of the Vireo video search system (Vireo-VSS), which employs a dual-task model to index each video segment with an embedding feature in a low dimension and a concept list for retrieval. The concept list serves as a reference to interpret its associated embedded feature. With these changes, a SQL-like querying interface is designed such that a user can specify the search content (subject, predicate, object) and constraint (logical condition) in a semi-structured way. The system will decompose the SQL-like query into multiple sub-queries depending on the constraint being specified. Each sub-query is translated into an embedding feature and a concept list for video retrieval. The search result is compiled by union or pruning of the search lists from multiple sub-queries. The SQL-like interface is also extended for temporal querying, by providing multiple SQL templates for users to specify the temporal evolution of a query. 2021-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6831 info:doi/10.1007/978-3-030-67835-7_34 https://ink.library.smu.edu.sg/context/sis_research/article/7834/viewcontent/978_3_030_67835_7_34.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Concept-based search Concept-free search Interactive video search SQL-like interpretable search Video browser showdown Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Concept-based search Concept-free search Interactive video search SQL-like interpretable search Video browser showdown Software Engineering |
spellingShingle |
Concept-based search Concept-free search Interactive video search SQL-like interpretable search Video browser showdown Software Engineering WU, Jiaxin NGUYEN, Phuong Anh MA, Zhixin NGO, Chong-wah SQL-like interpretable interactive video search |
description |
Concept-free search, which embeds text and video signals in a joint space for retrieval, appears to be a new state-of-the-art. However, this new search paradigm suffers from two limitations. First, the search result is unpredictable and not interpretable. Second, the embedded features are in high-dimensional space hindering real-time indexing and search. In this paper, we present a new implementation of the Vireo video search system (Vireo-VSS), which employs a dual-task model to index each video segment with an embedding feature in a low dimension and a concept list for retrieval. The concept list serves as a reference to interpret its associated embedded feature. With these changes, a SQL-like querying interface is designed such that a user can specify the search content (subject, predicate, object) and constraint (logical condition) in a semi-structured way. The system will decompose the SQL-like query into multiple sub-queries depending on the constraint being specified. Each sub-query is translated into an embedding feature and a concept list for video retrieval. The search result is compiled by union or pruning of the search lists from multiple sub-queries. The SQL-like interface is also extended for temporal querying, by providing multiple SQL templates for users to specify the temporal evolution of a query. |
format |
text |
author |
WU, Jiaxin NGUYEN, Phuong Anh MA, Zhixin NGO, Chong-wah |
author_facet |
WU, Jiaxin NGUYEN, Phuong Anh MA, Zhixin NGO, Chong-wah |
author_sort |
WU, Jiaxin |
title |
SQL-like interpretable interactive video search |
title_short |
SQL-like interpretable interactive video search |
title_full |
SQL-like interpretable interactive video search |
title_fullStr |
SQL-like interpretable interactive video search |
title_full_unstemmed |
SQL-like interpretable interactive video search |
title_sort |
sql-like interpretable interactive video search |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2021 |
url |
https://ink.library.smu.edu.sg/sis_research/6831 https://ink.library.smu.edu.sg/context/sis_research/article/7834/viewcontent/978_3_030_67835_7_34.pdf |
_version_ |
1814047562277060608 |