Visual localization on NTU campus
This final year report presents research and development of a Visual Place Recognition (VPR) system whose innovation lies in leveraging semantic information to enhance image and information retrieval accuracy. VPR is a crucial task in computer vision and robotics, with the goal of localizing the ‘us...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/171924 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-171924 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1719242023-11-17T15:37:23Z Visual localization on NTU campus Chen, Wei May Lin Weisi School of Computer Science and Engineering WSLin@ntu.edu.sg Engineering This final year report presents research and development of a Visual Place Recognition (VPR) system whose innovation lies in leveraging semantic information to enhance image and information retrieval accuracy. VPR is a crucial task in computer vision and robotics, with the goal of localizing the ‘user’ of the VPR system to enable machines to understand and navigate through environments accurately. Fairly accurate image retrieval has been attainable with NetVLAD image representation. However, the presence of dynamic objects in the visual scene as well as other varying perspective and lighting conditions pose challenges to the accurate image matching of physical locations. The project explores the integration of semantic labels generated from a segmentation model to improve the image retrieval model’s performance. The main novelty of this project lies in its leverage of semantic labels to assign cluster centroids with semantic labels in order to facilitate the filtering out of dynamic objects features to enable the model to concentrate its attention and learning on the relevant, useful features without being distracted by the noisy confusing features. The experiment results demonstrate a significant improvement in the image matching accuracy, hence providing support for establishing the effectiveness of this approach. Bachelor of Science in Data Science and Artificial Intelligence 2023-11-16T07:25:43Z 2023-11-16T07:25:43Z 2023 Final Year Project (FYP) Chen, W. M. (2023). Visual localization on NTU campus. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/171924 https://hdl.handle.net/10356/171924 en SCSE22-0804 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering |
spellingShingle |
Engineering Chen, Wei May Visual localization on NTU campus |
description |
This final year report presents research and development of a Visual Place Recognition (VPR) system whose innovation lies in leveraging semantic information to enhance image and information retrieval accuracy. VPR is a crucial task in computer vision and robotics, with the goal of localizing the ‘user’ of the VPR system to enable machines to understand and navigate through environments accurately. Fairly accurate image retrieval has been attainable with NetVLAD image representation. However, the presence of dynamic objects in the visual scene as well as other varying perspective and lighting conditions pose challenges to the accurate image matching of physical locations. The project explores the integration of semantic labels generated from a segmentation model to improve the image retrieval model’s performance. The main novelty of this project lies in its leverage of semantic labels to assign cluster centroids with semantic labels in order to facilitate the filtering out of dynamic objects features to enable the model to concentrate its attention and learning on the relevant, useful features without being distracted by the noisy confusing features. The experiment results demonstrate a significant improvement in the image matching accuracy, hence providing support for establishing the effectiveness of this approach. |
author2 |
Lin Weisi |
author_facet |
Lin Weisi Chen, Wei May |
format |
Final Year Project |
author |
Chen, Wei May |
author_sort |
Chen, Wei May |
title |
Visual localization on NTU campus |
title_short |
Visual localization on NTU campus |
title_full |
Visual localization on NTU campus |
title_fullStr |
Visual localization on NTU campus |
title_full_unstemmed |
Visual localization on NTU campus |
title_sort |
visual localization on ntu campus |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/171924 |
_version_ |
1783955541432729600 |