Enhancing place recognition with deep convolutional neural network using bag-of-visual-words

This Computer Vision (CV) project has developed an unsupervised Convolutional Neural Network (CNN) solution for enhanced Visual Place Recognition (VPR), with the usage of Bag-of-Visual-Words (BoVW) for automatic image clustering. BoVW enables automatic generation of image clusters and automatic labe...

全面介紹

Saved in:
書目詳細資料
主要作者: Soh, Wei Xin
其他作者: Li Hua
格式: Final Year Project
語言:English
出版: Nanyang Technological University 2021
主題:
在線閱讀:https://hdl.handle.net/10356/150823
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
實物特徵
總結:This Computer Vision (CV) project has developed an unsupervised Convolutional Neural Network (CNN) solution for enhanced Visual Place Recognition (VPR), with the usage of Bag-of-Visual-Words (BoVW) for automatic image clustering. BoVW enables automatic generation of image clusters and automatic labelling of training data. These labelled image clusters can be used as input data to train CNN models for VPR. Extraction of image frames was performed from videos of the public dataset, which were subsequently used to automatically generate image clusters. This proved more efficient than most well-known deep learning methods which often required time-consuming manual labelling, especially for extremely large quantities of images. Experiments were conducted on a public dataset to validate that the proposed solution was able to achieve better recognition performance compared to the traditional BoVW approach. This project can potentially be applied to the Advanced Remanufacturing & Technology Centre (ARTC) production shopfloor in various aspects such as Automated Guided Vehicle (AGV) localization with the proposed unsupervised deep learning solution.