Deep hashing by discriminating hard examples

This paper tackles a rarely explored but critical problem within learning to hash, i.e., to learn hash codes that effectively discriminate hard similar and dissimilar examples, to empower large-scale image retrieval. Hard similar examples refer to image pairs from the same semantic class that demons...

Full description

Saved in:
Bibliographic Details
Main Authors: YAN, Cheng, PANG, Guansong, BAI, Xiao, SHEN, Chunhua, ZHOU, Jun, HANCOCK, Edwin
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7139
https://ink.library.smu.edu.sg/context/sis_research/article/8142/viewcontent/3343031.3350927.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8142
record_format dspace
spelling sg-smu-ink.sis_research-81422022-04-22T04:22:33Z Deep hashing by discriminating hard examples YAN, Cheng PANG, Guansong BAI, Xiao SHEN, Chunhua ZHOU, Jun HANCOCK, Edwin This paper tackles a rarely explored but critical problem within learning to hash, i.e., to learn hash codes that effectively discriminate hard similar and dissimilar examples, to empower large-scale image retrieval. Hard similar examples refer to image pairs from the same semantic class that demonstrate some shared appearance but have different fine-grained appearance. Hard dissimilar examples are image pairs that come from different semantic classes but exhibit similar appearance. These hard examples generally have a small distance due to the shared appearance. Therefore, effective encoding of the hard examples can well discriminate the relevant images within a small Hamming distance, enabling more accurate retrieval in the top-ranked returned images. However, most existing hashing methods cannot capture this key information as their optimization is dominated by easy examples, i.e., distant similar/dissimilar pairs that share no or limited appearance. To address this problem, we introduce a novel Gamma distribution-enabled and symmetric Kullback-Leibler divergence-based loss, which is dubbed dual hinge loss because it works similarly as imposing two smoothed hinge losses on the respective similar and dissimilar pairs. Specifically, the loss enforces exponentially variant penalization on the hard similar (dissimilar) examples to emphasize and learn their fine-grained difference. It meanwhile imposes a bounding penalization on easy similar (dissimilar) examples to prevent the dominance of the easy examples in the optimization while preserving the high-level similarity (dissimilarity). This enables our model to well encode the key information carried by both easy and hard examples. Extensive empirical results on three widely-used image retrieval datasets show that (i) our method consistently and substantially outperforms state-of-the-art competing methods using hash codes of the same length and (ii) our method can use significantly (e.g., 50%-75%) shorter hash codes to perform substantially better than, or comparably well to, the competing methods. 2019-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7139 info:doi/10.1145/3343031.3350927 https://ink.library.smu.edu.sg/context/sis_research/article/8142/viewcontent/3343031.3350927.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Image Retrieval Deep Hashing Hard Examples Hinge Loss Databases and Information Systems Data Storage Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Image Retrieval
Deep Hashing
Hard Examples
Hinge Loss
Databases and Information Systems
Data Storage Systems
spellingShingle Image Retrieval
Deep Hashing
Hard Examples
Hinge Loss
Databases and Information Systems
Data Storage Systems
YAN, Cheng
PANG, Guansong
BAI, Xiao
SHEN, Chunhua
ZHOU, Jun
HANCOCK, Edwin
Deep hashing by discriminating hard examples
description This paper tackles a rarely explored but critical problem within learning to hash, i.e., to learn hash codes that effectively discriminate hard similar and dissimilar examples, to empower large-scale image retrieval. Hard similar examples refer to image pairs from the same semantic class that demonstrate some shared appearance but have different fine-grained appearance. Hard dissimilar examples are image pairs that come from different semantic classes but exhibit similar appearance. These hard examples generally have a small distance due to the shared appearance. Therefore, effective encoding of the hard examples can well discriminate the relevant images within a small Hamming distance, enabling more accurate retrieval in the top-ranked returned images. However, most existing hashing methods cannot capture this key information as their optimization is dominated by easy examples, i.e., distant similar/dissimilar pairs that share no or limited appearance. To address this problem, we introduce a novel Gamma distribution-enabled and symmetric Kullback-Leibler divergence-based loss, which is dubbed dual hinge loss because it works similarly as imposing two smoothed hinge losses on the respective similar and dissimilar pairs. Specifically, the loss enforces exponentially variant penalization on the hard similar (dissimilar) examples to emphasize and learn their fine-grained difference. It meanwhile imposes a bounding penalization on easy similar (dissimilar) examples to prevent the dominance of the easy examples in the optimization while preserving the high-level similarity (dissimilarity). This enables our model to well encode the key information carried by both easy and hard examples. Extensive empirical results on three widely-used image retrieval datasets show that (i) our method consistently and substantially outperforms state-of-the-art competing methods using hash codes of the same length and (ii) our method can use significantly (e.g., 50%-75%) shorter hash codes to perform substantially better than, or comparably well to, the competing methods.
format text
author YAN, Cheng
PANG, Guansong
BAI, Xiao
SHEN, Chunhua
ZHOU, Jun
HANCOCK, Edwin
author_facet YAN, Cheng
PANG, Guansong
BAI, Xiao
SHEN, Chunhua
ZHOU, Jun
HANCOCK, Edwin
author_sort YAN, Cheng
title Deep hashing by discriminating hard examples
title_short Deep hashing by discriminating hard examples
title_full Deep hashing by discriminating hard examples
title_fullStr Deep hashing by discriminating hard examples
title_full_unstemmed Deep hashing by discriminating hard examples
title_sort deep hashing by discriminating hard examples
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/7139
https://ink.library.smu.edu.sg/context/sis_research/article/8142/viewcontent/3343031.3350927.pdf
_version_ 1770576230404325376