Fast covariant VLAD for image search

Vector of locally aggregated descriptor (VLAD) is a popular image encoding approach for its simplicity and better scalability over conventional bag-of-visual-word approach. In order to enhance its distinctiveness and geometric invariance, covariant VLAD (CVLAD) is proposed to pool local features bas...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHAO, Wan-Lei, NGO, Chong-wah, WANG, Hanzi
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6310
https://ink.library.smu.edu.sg/context/sis_research/article/7313/viewcontent/07499824.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7313
record_format dspace
spelling sg-smu-ink.sis_research-73132021-11-23T06:51:33Z Fast covariant VLAD for image search ZHAO, Wan-Lei NGO, Chong-wah WANG, Hanzi Vector of locally aggregated descriptor (VLAD) is a popular image encoding approach for its simplicity and better scalability over conventional bag-of-visual-word approach. In order to enhance its distinctiveness and geometric invariance, covariant VLAD (CVLAD) is proposed to pool local features based on their dominant orientations/characteristic scales, which leads to a geometric-aware representation. This representation achieves rotation/scale invariance when being associated with circular matching. However, the circular matching induces several times of computation overhead, which makes CVLAD hardly suitable for large-scale retrieval tasks. In this paper, the issue of computation overhead is alleviated by performing the circular matching in CVLAD's frequency domain. In addition, by operating PCA on CVLAD in its frequency domain, much better scalability is achieved than when it is undertaken in the original feature space. Furthermore, the high-dimensional CVLAD subvectors are converted to dozens of very low-dimensional subvectors, which is possible when transforming the feature into its frequency domain. Nearest neighbor search is therefore undertaken on very low-dimensional subspaces, which becomes easily tractable. The effectiveness of our approach is demonstrated in the retrieval scenario on popular benchmarks comprising up to 1 million database images. 2016-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6310 info:doi/10.1109/TMM.2016.2585023 https://ink.library.smu.edu.sg/context/sis_research/article/7313/viewcontent/07499824.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Circular matching covariant pooling covariant vector of locally aggregated descriptor (CVLAD) similar image search Computer Sciences Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Circular matching
covariant pooling
covariant vector of locally aggregated descriptor (CVLAD)
similar image search
Computer Sciences
Graphics and Human Computer Interfaces
spellingShingle Circular matching
covariant pooling
covariant vector of locally aggregated descriptor (CVLAD)
similar image search
Computer Sciences
Graphics and Human Computer Interfaces
ZHAO, Wan-Lei
NGO, Chong-wah
WANG, Hanzi
Fast covariant VLAD for image search
description Vector of locally aggregated descriptor (VLAD) is a popular image encoding approach for its simplicity and better scalability over conventional bag-of-visual-word approach. In order to enhance its distinctiveness and geometric invariance, covariant VLAD (CVLAD) is proposed to pool local features based on their dominant orientations/characteristic scales, which leads to a geometric-aware representation. This representation achieves rotation/scale invariance when being associated with circular matching. However, the circular matching induces several times of computation overhead, which makes CVLAD hardly suitable for large-scale retrieval tasks. In this paper, the issue of computation overhead is alleviated by performing the circular matching in CVLAD's frequency domain. In addition, by operating PCA on CVLAD in its frequency domain, much better scalability is achieved than when it is undertaken in the original feature space. Furthermore, the high-dimensional CVLAD subvectors are converted to dozens of very low-dimensional subvectors, which is possible when transforming the feature into its frequency domain. Nearest neighbor search is therefore undertaken on very low-dimensional subspaces, which becomes easily tractable. The effectiveness of our approach is demonstrated in the retrieval scenario on popular benchmarks comprising up to 1 million database images.
format text
author ZHAO, Wan-Lei
NGO, Chong-wah
WANG, Hanzi
author_facet ZHAO, Wan-Lei
NGO, Chong-wah
WANG, Hanzi
author_sort ZHAO, Wan-Lei
title Fast covariant VLAD for image search
title_short Fast covariant VLAD for image search
title_full Fast covariant VLAD for image search
title_fullStr Fast covariant VLAD for image search
title_full_unstemmed Fast covariant VLAD for image search
title_sort fast covariant vlad for image search
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/6310
https://ink.library.smu.edu.sg/context/sis_research/article/7313/viewcontent/07499824.pdf
_version_ 1770575932084453376