Enhancing place recognition with deep convolutional neural network using bag-of-visual-words

This Computer Vision (CV) project has developed an unsupervised Convolutional Neural Network (CNN) solution for enhanced Visual Place Recognition (VPR), with the usage of Bag-of-Visual-Words (BoVW) for automatic image clustering. BoVW enables automatic generation of image clusters and automatic labe...

Full description

Saved in:
Bibliographic Details
Main Author: Soh, Wei Xin
Other Authors: Li Hua
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/150823
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:This Computer Vision (CV) project has developed an unsupervised Convolutional Neural Network (CNN) solution for enhanced Visual Place Recognition (VPR), with the usage of Bag-of-Visual-Words (BoVW) for automatic image clustering. BoVW enables automatic generation of image clusters and automatic labelling of training data. These labelled image clusters can be used as input data to train CNN models for VPR. Extraction of image frames was performed from videos of the public dataset, which were subsequently used to automatically generate image clusters. This proved more efficient than most well-known deep learning methods which often required time-consuming manual labelling, especially for extremely large quantities of images. Experiments were conducted on a public dataset to validate that the proposed solution was able to achieve better recognition performance compared to the traditional BoVW approach. This project can potentially be applied to the Advanced Remanufacturing & Technology Centre (ARTC) production shopfloor in various aspects such as Automated Guided Vehicle (AGV) localization with the proposed unsupervised deep learning solution.