Deep hashing by discriminating hard examples

This paper tackles a rarely explored but critical problem within learning to hash, i.e., to learn hash codes that effectively discriminate hard similar and dissimilar examples, to empower large-scale image retrieval. Hard similar examples refer to image pairs from the same semantic class that demons...

Full description

Saved in:

Bibliographic Details
Main Authors:	YAN, Cheng, PANG, Guansong, BAI, Xiao, SHEN, Chunhua, ZHOU, Jun, HANCOCK, Edwin
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2019
Subjects:	Image Retrieval Deep Hashing Hard Examples Hinge Loss Databases and Information Systems Data Storage Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/7139 https://ink.library.smu.edu.sg/context/sis_research/article/8142/viewcontent/3343031.3350927.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-8142
record_format	dspace
spelling	sg-smu-ink.sis_research-81422022-04-22T04:22:33Z Deep hashing by discriminating hard examples YAN, Cheng PANG, Guansong BAI, Xiao SHEN, Chunhua ZHOU, Jun HANCOCK, Edwin This paper tackles a rarely explored but critical problem within learning to hash, i.e., to learn hash codes that effectively discriminate hard similar and dissimilar examples, to empower large-scale image retrieval. Hard similar examples refer to image pairs from the same semantic class that demonstrate some shared appearance but have different fine-grained appearance. Hard dissimilar examples are image pairs that come from different semantic classes but exhibit similar appearance. These hard examples generally have a small distance due to the shared appearance. Therefore, effective encoding of the hard examples can well discriminate the relevant images within a small Hamming distance, enabling more accurate retrieval in the top-ranked returned images. However, most existing hashing methods cannot capture this key information as their optimization is dominated by easy examples, i.e., distant similar/dissimilar pairs that share no or limited appearance. To address this problem, we introduce a novel Gamma distribution-enabled and symmetric Kullback-Leibler divergence-based loss, which is dubbed dual hinge loss because it works similarly as imposing two smoothed hinge losses on the respective similar and dissimilar pairs. Specifically, the loss enforces exponentially variant penalization on the hard similar (dissimilar) examples to emphasize and learn their fine-grained difference. It meanwhile imposes a bounding penalization on easy similar (dissimilar) examples to prevent the dominance of the easy examples in the optimization while preserving the high-level similarity (dissimilarity). This enables our model to well encode the key information carried by both easy and hard examples. Extensive empirical results on three widely-used image retrieval datasets show that (i) our method consistently and substantially outperforms state-of-the-art competing methods using hash codes of the same length and (ii) our method can use significantly (e.g., 50%-75%) shorter hash codes to perform substantially better than, or comparably well to, the competing methods. 2019-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7139 info:doi/10.1145/3343031.3350927 https://ink.library.smu.edu.sg/context/sis_research/article/8142/viewcontent/3343031.3350927.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Image Retrieval Deep Hashing Hard Examples Hinge Loss Databases and Information Systems Data Storage Systems
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Image Retrieval Deep Hashing Hard Examples Hinge Loss Databases and Information Systems Data Storage Systems
spellingShingle	Image Retrieval Deep Hashing Hard Examples Hinge Loss Databases and Information Systems Data Storage Systems YAN, Cheng PANG, Guansong BAI, Xiao SHEN, Chunhua ZHOU, Jun HANCOCK, Edwin Deep hashing by discriminating hard examples
description	This paper tackles a rarely explored but critical problem within learning to hash, i.e., to learn hash codes that effectively discriminate hard similar and dissimilar examples, to empower large-scale image retrieval. Hard similar examples refer to image pairs from the same semantic class that demonstrate some shared appearance but have different fine-grained appearance. Hard dissimilar examples are image pairs that come from different semantic classes but exhibit similar appearance. These hard examples generally have a small distance due to the shared appearance. Therefore, effective encoding of the hard examples can well discriminate the relevant images within a small Hamming distance, enabling more accurate retrieval in the top-ranked returned images. However, most existing hashing methods cannot capture this key information as their optimization is dominated by easy examples, i.e., distant similar/dissimilar pairs that share no or limited appearance. To address this problem, we introduce a novel Gamma distribution-enabled and symmetric Kullback-Leibler divergence-based loss, which is dubbed dual hinge loss because it works similarly as imposing two smoothed hinge losses on the respective similar and dissimilar pairs. Specifically, the loss enforces exponentially variant penalization on the hard similar (dissimilar) examples to emphasize and learn their fine-grained difference. It meanwhile imposes a bounding penalization on easy similar (dissimilar) examples to prevent the dominance of the easy examples in the optimization while preserving the high-level similarity (dissimilarity). This enables our model to well encode the key information carried by both easy and hard examples. Extensive empirical results on three widely-used image retrieval datasets show that (i) our method consistently and substantially outperforms state-of-the-art competing methods using hash codes of the same length and (ii) our method can use significantly (e.g., 50%-75%) shorter hash codes to perform substantially better than, or comparably well to, the competing methods.
format	text
author	YAN, Cheng PANG, Guansong BAI, Xiao SHEN, Chunhua ZHOU, Jun HANCOCK, Edwin
author_facet	YAN, Cheng PANG, Guansong BAI, Xiao SHEN, Chunhua ZHOU, Jun HANCOCK, Edwin
author_sort	YAN, Cheng
title	Deep hashing by discriminating hard examples
title_short	Deep hashing by discriminating hard examples
title_full	Deep hashing by discriminating hard examples
title_fullStr	Deep hashing by discriminating hard examples
title_full_unstemmed	Deep hashing by discriminating hard examples
title_sort	deep hashing by discriminating hard examples
publisher	Institutional Knowledge at Singapore Management University
publishDate	2019
url	https://ink.library.smu.edu.sg/sis_research/7139 https://ink.library.smu.edu.sg/context/sis_research/article/8142/viewcontent/3343031.3350927.pdf
_version_	1770576230404325376

Deep hashing by discriminating hard examples

Similar Items