Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description
Binary descriptors have been widely used for real-time image retrieval and correspondence matching. However, most of the learned descriptors are obtained using a large deep neural network (DNN) with several million parameters, and the learned binary codes are generally not invariant to many geometri...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Published: |
Springer
2021
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/26950/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Malaya |
id |
my.um.eprints.26950 |
---|---|
record_format |
eprints |
spelling |
my.um.eprints.269502022-04-14T01:31:08Z http://eprints.um.edu.my/26950/ Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description Bandara, Ravimal Ranathunga, Lochandaka Abdullah, Nor Aniza QA75 Electronic computers. Computer science Binary descriptors have been widely used for real-time image retrieval and correspondence matching. However, most of the learned descriptors are obtained using a large deep neural network (DNN) with several million parameters, and the learned binary codes are generally not invariant to many geometrical variances which is crucial for accurate correspondence matching. To address this problem, we proposed a new learning approach using a lightweight DNN architecture via a stack of multiple multilayer perceptrons based on the network in network (NIN) architecture, and a restricted Boltzmann machine (RBM). The latter is used for mapping the features to binary codes, and carry out the geometrically invariant correspondence matching task. Our experimental results on several benchmark datasets (e.g., Brown, Oxford, Paris, INRIA Holidays, RomePatches, HPatches, and CIFAR-10) show that the proposed approach produces the learned binary descriptor that outperforms other baseline self-supervised binary descriptors in terms of correspondence matching despite the smaller size of its DNN. Most importantly, the proposed approach does not freeze the features that are obtained while pre-training the NIN model. Instead, it fine-tunes the features while learning the features needed for binary mapping through the RBM. Additionally, its lightweight architecture makes it suitable for resource-constrained devices. Springer 2021-02 Article PeerReviewed Bandara, Ravimal and Ranathunga, Lochandaka and Abdullah, Nor Aniza (2021) Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description. Visual Computer, 37 (2). pp. 275-290. ISSN 0178-2789, DOI https://doi.org/10.1007/s00371-020-01798-5 <https://doi.org/10.1007/s00371-020-01798-5>. 10.1007/s00371-020-01798-5 |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Research Repository |
url_provider |
http://eprints.um.edu.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Bandara, Ravimal Ranathunga, Lochandaka Abdullah, Nor Aniza Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description |
description |
Binary descriptors have been widely used for real-time image retrieval and correspondence matching. However, most of the learned descriptors are obtained using a large deep neural network (DNN) with several million parameters, and the learned binary codes are generally not invariant to many geometrical variances which is crucial for accurate correspondence matching. To address this problem, we proposed a new learning approach using a lightweight DNN architecture via a stack of multiple multilayer perceptrons based on the network in network (NIN) architecture, and a restricted Boltzmann machine (RBM). The latter is used for mapping the features to binary codes, and carry out the geometrically invariant correspondence matching task. Our experimental results on several benchmark datasets (e.g., Brown, Oxford, Paris, INRIA Holidays, RomePatches, HPatches, and CIFAR-10) show that the proposed approach produces the learned binary descriptor that outperforms other baseline self-supervised binary descriptors in terms of correspondence matching despite the smaller size of its DNN. Most importantly, the proposed approach does not freeze the features that are obtained while pre-training the NIN model. Instead, it fine-tunes the features while learning the features needed for binary mapping through the RBM. Additionally, its lightweight architecture makes it suitable for resource-constrained devices. |
format |
Article |
author |
Bandara, Ravimal Ranathunga, Lochandaka Abdullah, Nor Aniza |
author_facet |
Bandara, Ravimal Ranathunga, Lochandaka Abdullah, Nor Aniza |
author_sort |
Bandara, Ravimal |
title |
Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description |
title_short |
Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description |
title_full |
Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description |
title_fullStr |
Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description |
title_full_unstemmed |
Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description |
title_sort |
deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description |
publisher |
Springer |
publishDate |
2021 |
url |
http://eprints.um.edu.my/26950/ |
_version_ |
1735409479854325760 |