Learning descriptors for sequence-based hierarchical place recognition
Visual place recognition aims at making unmanned vehicles recognize a revisit place of their exact location and returning reasonable query information. Most researchers regard this kind of problem as an image retrieval task. There are mainly two categories in this task: the hand-crafted feature extr...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Coursework |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/157911 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-157911 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1579112023-07-04T17:50:16Z Learning descriptors for sequence-based hierarchical place recognition Lan, Xin Xie Lihua School of Electrical and Electronic Engineering ELHXIE@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Visual place recognition aims at making unmanned vehicles recognize a revisit place of their exact location and returning reasonable query information. Most researchers regard this kind of problem as an image retrieval task. There are mainly two categories in this task: the hand-crafted feature extraction method and the learning-based feature representation method. This dissertation will focus on the latter. In this dissertation, with the core of Vector of Locally Aggregated Descriptor (VLAD) part in NetVLAD, the features are represented as vectors. To use sequential information embedded in image series, a temporal convolution part is added to get a layer of sequential descriptors, which generate top K candi- dates for further similarity check with single descriptors of the same sequences of images. In analyzing phase, the dissertation compares different backbones of VGG-16 and AlexNet to select a satisfying CNN-based feature extractor. Also, a comparison of single and sequential descriptors is performed through Oxford, Nordland and Pittsburgh 250k that have different characteristics. The result shows that the sequential model with the hierarchical structure possesses greater performance facing changing lights and view angles, the best recall@20 is over 0.96. At last, several possible future works are listed like the efficiency of algorithms, the improved structure of models and variance-based robustness improvements. Master of Science (Computer Control and Automation) 2022-05-13T08:13:30Z 2022-05-13T08:13:30Z 2022 Thesis-Master by Coursework Lan, X. (2022). Learning descriptors for sequence-based hierarchical place recognition. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/157911 https://hdl.handle.net/10356/157911 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Lan, Xin Learning descriptors for sequence-based hierarchical place recognition |
description |
Visual place recognition aims at making unmanned vehicles recognize a revisit place of their exact location and returning reasonable query information. Most researchers regard this kind of problem as an image retrieval task. There are mainly two categories in this task: the hand-crafted feature extraction method and the learning-based feature representation method. This dissertation will focus on the latter.
In this dissertation, with the core of Vector of Locally Aggregated Descriptor (VLAD) part in NetVLAD, the features are represented as vectors. To use sequential information embedded in image series, a temporal convolution part is added to get a layer of sequential descriptors, which generate top K candi- dates for further similarity check with single descriptors of the same sequences of images. In analyzing phase, the dissertation compares different backbones of VGG-16 and AlexNet to select a satisfying CNN-based feature extractor. Also, a comparison of single and sequential descriptors is performed through Oxford, Nordland and Pittsburgh 250k that have different characteristics. The result shows that the sequential model with the hierarchical structure possesses greater performance facing changing lights and view angles, the best recall@20 is over 0.96. At last, several possible future works are listed like the efficiency of algorithms, the improved structure of models and variance-based robustness improvements. |
author2 |
Xie Lihua |
author_facet |
Xie Lihua Lan, Xin |
format |
Thesis-Master by Coursework |
author |
Lan, Xin |
author_sort |
Lan, Xin |
title |
Learning descriptors for sequence-based hierarchical place recognition |
title_short |
Learning descriptors for sequence-based hierarchical place recognition |
title_full |
Learning descriptors for sequence-based hierarchical place recognition |
title_fullStr |
Learning descriptors for sequence-based hierarchical place recognition |
title_full_unstemmed |
Learning descriptors for sequence-based hierarchical place recognition |
title_sort |
learning descriptors for sequence-based hierarchical place recognition |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/157911 |
_version_ |
1772825320384102400 |