Beyond ranking loss : deep holographic networks for multi-label video search

In this paper, we propose Deep Holographic Networks (DHN) to learn similarity metrics of videos for multi-label video search. DHN introduces a holographic composition layer to explicitly encode similarity metrics at intermediate layer of the network, instead of conventional deep metric learning appr...

Full description

Saved in:
Bibliographic Details
Main Authors: Chen, Zhuo, Lin, Jie, Wang, Zhe, Chandrasekhar, Vijay, Lin, Weisi
Other Authors: School of Computer Science and Engineering
Format: Conference or Workshop Item
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/144186
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-144186
record_format dspace
spelling sg-ntu-dr.10356-1441862020-10-20T01:52:53Z Beyond ranking loss : deep holographic networks for multi-label video search Chen, Zhuo Lin, Jie Wang, Zhe Chandrasekhar, Vijay Lin, Weisi School of Computer Science and Engineering 2019 IEEE International Conference on Image Processing (ICIP) Engineering::Computer science and engineering Video Retrieval Correlation In this paper, we propose Deep Holographic Networks (DHN) to learn similarity metrics of videos for multi-label video search. DHN introduces a holographic composition layer to explicitly encode similarity metrics at intermediate layer of the network, instead of conventional deep metric learning approaches driven by ranking losses. The holographic composition layer is parameter-free and enables less memory footprint compared with state-of-the-art. Towards multi-label video search at large scale, we present a new video benchmark built upon the YouTube-8M dataset. Extensive evaluations on this dataset demonstrate that DHN performs better than traditional deep metric learning approaches as well as other compositional networks. Agency for Science, Technology and Research (A*STAR) Ministry of Education (MOE) Accepted version This research was carried out at the Rapid- Rich Object Search (ROSE) Lab at the Nanyang Technological University and the Agency for Science, Technology and Research (A*STAR) under its Hardware-Software Cooptimisation for Deep Learning (Project No.A1892b0026). This work is also partially supported by Singapore Ministry of Education Tier-2 Fund MOE2016-T2-2-057(S). 2020-10-20T01:52:53Z 2020-10-20T01:52:53Z 2019 Conference Paper Chen, Z., Lin, J., Wang, Z., Chandrasekhar, V., & Lin, W. (2019). Beyond ranking loss : deep holographic networks for multi-label video search. 2019 IEEE International Conference on Image Processing (ICIP), 879-883. doi:10.1109/ICIP.2019.8802944 978-1-5386-6249-6 https://hdl.handle.net/10356/144186 10.1109/ICIP.2019.8802944 879 883 en © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/ICIP.2019.8802944 application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Video Retrieval
Correlation
spellingShingle Engineering::Computer science and engineering
Video Retrieval
Correlation
Chen, Zhuo
Lin, Jie
Wang, Zhe
Chandrasekhar, Vijay
Lin, Weisi
Beyond ranking loss : deep holographic networks for multi-label video search
description In this paper, we propose Deep Holographic Networks (DHN) to learn similarity metrics of videos for multi-label video search. DHN introduces a holographic composition layer to explicitly encode similarity metrics at intermediate layer of the network, instead of conventional deep metric learning approaches driven by ranking losses. The holographic composition layer is parameter-free and enables less memory footprint compared with state-of-the-art. Towards multi-label video search at large scale, we present a new video benchmark built upon the YouTube-8M dataset. Extensive evaluations on this dataset demonstrate that DHN performs better than traditional deep metric learning approaches as well as other compositional networks.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Chen, Zhuo
Lin, Jie
Wang, Zhe
Chandrasekhar, Vijay
Lin, Weisi
format Conference or Workshop Item
author Chen, Zhuo
Lin, Jie
Wang, Zhe
Chandrasekhar, Vijay
Lin, Weisi
author_sort Chen, Zhuo
title Beyond ranking loss : deep holographic networks for multi-label video search
title_short Beyond ranking loss : deep holographic networks for multi-label video search
title_full Beyond ranking loss : deep holographic networks for multi-label video search
title_fullStr Beyond ranking loss : deep holographic networks for multi-label video search
title_full_unstemmed Beyond ranking loss : deep holographic networks for multi-label video search
title_sort beyond ranking loss : deep holographic networks for multi-label video search
publishDate 2020
url https://hdl.handle.net/10356/144186
_version_ 1683492950674243584