Beyond ranking loss : deep holographic networks for multi-label video search
In this paper, we propose Deep Holographic Networks (DHN) to learn similarity metrics of videos for multi-label video search. DHN introduces a holographic composition layer to explicitly encode similarity metrics at intermediate layer of the network, instead of conventional deep metric learning appr...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/144186 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-144186 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1441862020-10-20T01:52:53Z Beyond ranking loss : deep holographic networks for multi-label video search Chen, Zhuo Lin, Jie Wang, Zhe Chandrasekhar, Vijay Lin, Weisi School of Computer Science and Engineering 2019 IEEE International Conference on Image Processing (ICIP) Engineering::Computer science and engineering Video Retrieval Correlation In this paper, we propose Deep Holographic Networks (DHN) to learn similarity metrics of videos for multi-label video search. DHN introduces a holographic composition layer to explicitly encode similarity metrics at intermediate layer of the network, instead of conventional deep metric learning approaches driven by ranking losses. The holographic composition layer is parameter-free and enables less memory footprint compared with state-of-the-art. Towards multi-label video search at large scale, we present a new video benchmark built upon the YouTube-8M dataset. Extensive evaluations on this dataset demonstrate that DHN performs better than traditional deep metric learning approaches as well as other compositional networks. Agency for Science, Technology and Research (A*STAR) Ministry of Education (MOE) Accepted version This research was carried out at the Rapid- Rich Object Search (ROSE) Lab at the Nanyang Technological University and the Agency for Science, Technology and Research (A*STAR) under its Hardware-Software Cooptimisation for Deep Learning (Project No.A1892b0026). This work is also partially supported by Singapore Ministry of Education Tier-2 Fund MOE2016-T2-2-057(S). 2020-10-20T01:52:53Z 2020-10-20T01:52:53Z 2019 Conference Paper Chen, Z., Lin, J., Wang, Z., Chandrasekhar, V., & Lin, W. (2019). Beyond ranking loss : deep holographic networks for multi-label video search. 2019 IEEE International Conference on Image Processing (ICIP), 879-883. doi:10.1109/ICIP.2019.8802944 978-1-5386-6249-6 https://hdl.handle.net/10356/144186 10.1109/ICIP.2019.8802944 879 883 en © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/ICIP.2019.8802944 application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Video Retrieval Correlation |
spellingShingle |
Engineering::Computer science and engineering Video Retrieval Correlation Chen, Zhuo Lin, Jie Wang, Zhe Chandrasekhar, Vijay Lin, Weisi Beyond ranking loss : deep holographic networks for multi-label video search |
description |
In this paper, we propose Deep Holographic Networks (DHN) to learn similarity metrics of videos for multi-label video search. DHN introduces a holographic composition layer to explicitly encode similarity metrics at intermediate layer of the network, instead of conventional deep metric learning approaches driven by ranking losses. The holographic composition layer is parameter-free and enables less memory footprint compared with state-of-the-art. Towards multi-label video search at large scale, we present a new video benchmark built upon the YouTube-8M dataset. Extensive evaluations on this dataset demonstrate that DHN performs better than traditional deep metric learning approaches as well as other compositional networks. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Chen, Zhuo Lin, Jie Wang, Zhe Chandrasekhar, Vijay Lin, Weisi |
format |
Conference or Workshop Item |
author |
Chen, Zhuo Lin, Jie Wang, Zhe Chandrasekhar, Vijay Lin, Weisi |
author_sort |
Chen, Zhuo |
title |
Beyond ranking loss : deep holographic networks for multi-label video search |
title_short |
Beyond ranking loss : deep holographic networks for multi-label video search |
title_full |
Beyond ranking loss : deep holographic networks for multi-label video search |
title_fullStr |
Beyond ranking loss : deep holographic networks for multi-label video search |
title_full_unstemmed |
Beyond ranking loss : deep holographic networks for multi-label video search |
title_sort |
beyond ranking loss : deep holographic networks for multi-label video search |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/144186 |
_version_ |
1683492950674243584 |