Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description

Binary descriptors have been widely used for real-time image retrieval and correspondence matching. However, most of the learned descriptors are obtained using a large deep neural network (DNN) with several million parameters, and the learned binary codes are generally not invariant to many geometri...

Full description

Saved in:
Bibliographic Details
Main Authors: Bandara, Ravimal, Ranathunga, Lochandaka, Abdullah, Nor Aniza
Format: Article
Published: Springer 2021
Subjects:
Online Access:http://eprints.um.edu.my/26950/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaya
id my.um.eprints.26950
record_format eprints
spelling my.um.eprints.269502022-04-14T01:31:08Z http://eprints.um.edu.my/26950/ Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description Bandara, Ravimal Ranathunga, Lochandaka Abdullah, Nor Aniza QA75 Electronic computers. Computer science Binary descriptors have been widely used for real-time image retrieval and correspondence matching. However, most of the learned descriptors are obtained using a large deep neural network (DNN) with several million parameters, and the learned binary codes are generally not invariant to many geometrical variances which is crucial for accurate correspondence matching. To address this problem, we proposed a new learning approach using a lightweight DNN architecture via a stack of multiple multilayer perceptrons based on the network in network (NIN) architecture, and a restricted Boltzmann machine (RBM). The latter is used for mapping the features to binary codes, and carry out the geometrically invariant correspondence matching task. Our experimental results on several benchmark datasets (e.g., Brown, Oxford, Paris, INRIA Holidays, RomePatches, HPatches, and CIFAR-10) show that the proposed approach produces the learned binary descriptor that outperforms other baseline self-supervised binary descriptors in terms of correspondence matching despite the smaller size of its DNN. Most importantly, the proposed approach does not freeze the features that are obtained while pre-training the NIN model. Instead, it fine-tunes the features while learning the features needed for binary mapping through the RBM. Additionally, its lightweight architecture makes it suitable for resource-constrained devices. Springer 2021-02 Article PeerReviewed Bandara, Ravimal and Ranathunga, Lochandaka and Abdullah, Nor Aniza (2021) Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description. Visual Computer, 37 (2). pp. 275-290. ISSN 0178-2789, DOI https://doi.org/10.1007/s00371-020-01798-5 <https://doi.org/10.1007/s00371-020-01798-5>. 10.1007/s00371-020-01798-5
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Bandara, Ravimal
Ranathunga, Lochandaka
Abdullah, Nor Aniza
Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description
description Binary descriptors have been widely used for real-time image retrieval and correspondence matching. However, most of the learned descriptors are obtained using a large deep neural network (DNN) with several million parameters, and the learned binary codes are generally not invariant to many geometrical variances which is crucial for accurate correspondence matching. To address this problem, we proposed a new learning approach using a lightweight DNN architecture via a stack of multiple multilayer perceptrons based on the network in network (NIN) architecture, and a restricted Boltzmann machine (RBM). The latter is used for mapping the features to binary codes, and carry out the geometrically invariant correspondence matching task. Our experimental results on several benchmark datasets (e.g., Brown, Oxford, Paris, INRIA Holidays, RomePatches, HPatches, and CIFAR-10) show that the proposed approach produces the learned binary descriptor that outperforms other baseline self-supervised binary descriptors in terms of correspondence matching despite the smaller size of its DNN. Most importantly, the proposed approach does not freeze the features that are obtained while pre-training the NIN model. Instead, it fine-tunes the features while learning the features needed for binary mapping through the RBM. Additionally, its lightweight architecture makes it suitable for resource-constrained devices.
format Article
author Bandara, Ravimal
Ranathunga, Lochandaka
Abdullah, Nor Aniza
author_facet Bandara, Ravimal
Ranathunga, Lochandaka
Abdullah, Nor Aniza
author_sort Bandara, Ravimal
title Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description
title_short Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description
title_full Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description
title_fullStr Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description
title_full_unstemmed Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description
title_sort deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description
publisher Springer
publishDate 2021
url http://eprints.um.edu.my/26950/
_version_ 1735409479854325760