Image-similarity-based convolutional neural network for robot visual relocalization
Convolutional neural network (CNN)-based methods, which train an end-to-end model to regress a six degree of freedom (DoF) pose of a robot from a single red–green–blue (RGB) image, have been developed to overcome the poor robustness of robot visual relocalization recently. However, the pose precisio...
Saved in:
Main Authors: | , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/147613 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-147613 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1476132021-04-13T08:24:29Z Image-similarity-based convolutional neural network for robot visual relocalization Wang, Li Li, Ruifeng Sun, Jingwen Seah, Hock Soon Quah, Chee Kwang Zhao, Lijun Tandianus, Budianto School of Computer Science and Engineering Engineering::Computer science and engineering Visual Relocalization CNN Convolutional neural network (CNN)-based methods, which train an end-to-end model to regress a six degree of freedom (DoF) pose of a robot from a single red–green–blue (RGB) image, have been developed to overcome the poor robustness of robot visual relocalization recently. However, the pose precision becomes low when the test image is dissimilar to training images. In this paper, we propose a novel method, named image-similarity-based CNN, which considers the image similarity of an input image during the CNN training. The higher the similarity of the input image, the higher precision we can achieve. Therefore, we crop the input image into several small image blocks, and the similarity between each cropped image block and training dataset images is measured by employing a feature vector in a fully connected CNN layer. Finally, the most similar image is selected to regress the pose. A genetic algorithm is utilized to determine the cropped position. Experiments on both open-source dataset 7-Scenes and two actual indoor environments are conducted. The results show that the proposed algorithm leads to better results and reduces large regression errors effectively compared with existing solutions. National Research Foundation (NRF) Published version This work was supported by the National Key Research and Development Program “Intelligent Robot” Key Special Project (2018YFB1308900), the National Natural Science Foundation of China (61673136), the Self-Planned Task of State Key Laboratory of Robotics and System (HIT) (Nos. SKLRS201906B and SKLRS201715A), the Foundation for Innovative Research Groups of the National Natural Science Foundation of China (No. 51521003), ST Engineering-NTU Corporate Lab through the NRF corporate lab@university scheme, and the China Scholarship Council (No. 201706120137). 2021-04-13T08:24:29Z 2021-04-13T08:24:29Z 2020 Journal Article Wang, L., Li, R., Sun, J., Seah, H. S., Quah, C. K., Zhao, L. & Tandianus, B. (2020). Image-similarity-based convolutional neural network for robot visual relocalization. Sensors and Materials, 32(4), 1245-1259. https://dx.doi.org/10.18494/SAM.2020.2549 0914-4935 https://hdl.handle.net/10356/147613 10.18494/SAM.2020.2549 2-s2.0-85084051492 4 32 1245 1259 en Sensors and Materials © 2020 MYU K.K. This work is licensed under a Creative Commons Attribution 4.0 International License. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Visual Relocalization CNN |
spellingShingle |
Engineering::Computer science and engineering Visual Relocalization CNN Wang, Li Li, Ruifeng Sun, Jingwen Seah, Hock Soon Quah, Chee Kwang Zhao, Lijun Tandianus, Budianto Image-similarity-based convolutional neural network for robot visual relocalization |
description |
Convolutional neural network (CNN)-based methods, which train an end-to-end model to regress a six degree of freedom (DoF) pose of a robot from a single red–green–blue (RGB) image, have been developed to overcome the poor robustness of robot visual relocalization recently. However, the pose precision becomes low when the test image is dissimilar to training images. In this paper, we propose a novel method, named image-similarity-based CNN, which considers the image similarity of an input image during the CNN training. The higher the similarity of the input image, the higher precision we can achieve. Therefore, we crop the input image into several small image blocks, and the similarity between each cropped image block and training dataset images is measured by employing a feature vector in a fully connected CNN layer. Finally, the most similar image is selected to regress the pose. A genetic algorithm is utilized to determine the cropped position. Experiments on both open-source dataset 7-Scenes and two actual indoor environments are conducted. The results show that the proposed algorithm leads to better results and reduces large regression errors effectively compared with existing solutions. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Wang, Li Li, Ruifeng Sun, Jingwen Seah, Hock Soon Quah, Chee Kwang Zhao, Lijun Tandianus, Budianto |
format |
Article |
author |
Wang, Li Li, Ruifeng Sun, Jingwen Seah, Hock Soon Quah, Chee Kwang Zhao, Lijun Tandianus, Budianto |
author_sort |
Wang, Li |
title |
Image-similarity-based convolutional neural network for robot visual relocalization |
title_short |
Image-similarity-based convolutional neural network for robot visual relocalization |
title_full |
Image-similarity-based convolutional neural network for robot visual relocalization |
title_fullStr |
Image-similarity-based convolutional neural network for robot visual relocalization |
title_full_unstemmed |
Image-similarity-based convolutional neural network for robot visual relocalization |
title_sort |
image-similarity-based convolutional neural network for robot visual relocalization |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/147613 |
_version_ |
1698713699454287872 |