Multi-channel convolutional neural network based 3D object detection for indoor robot environmental perception

Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abst...

Full description

Saved in:
Bibliographic Details
Main Authors: Wang, Li, Li, Ruifeng, Shi, Hezi, Sun, Jingwen, Zhao, Lijun, Tandianus, Budianto, Seah, Hock Soon, Quah, Chee Kwang
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2019
Subjects:
Online Access:https://hdl.handle.net/10356/105814
http://hdl.handle.net/10220/48782
http://dx.doi.org/10.3390/s19040893
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-105814
record_format dspace
spelling sg-ntu-dr.10356-1058142019-12-06T21:58:29Z Multi-channel convolutional neural network based 3D object detection for indoor robot environmental perception Wang, Li Li, Ruifeng Shi, Hezi Sun, Jingwen Zhao, Lijun Tandianus, Budianto Seah, Hock Soon Quah, Chee Kwang School of Computer Science and Engineering School of Electrical and Electronic Engineering ST Engineering-NTU Corporate Laboratory Multi-Channel CNN 3D Object Detection DRNTU::Engineering::Computer science and engineering Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abstract concepts, such as objects and scenes. Moreover, the 2D object detection based on images always fails to provide the actual position and size of an object, which is quite important for a robot’s operation. In this paper, we focus on the 3D object detection to regress the object’s category, 3D size, and spatial position through a convolutional neural network (CNN). We propose a multi-channel CNN for 3D object detection, which fuses three input channels including RGB, depth, and bird’s eye view (BEV) images. We also propose a method to generate 3D proposals based on 2D ones in the RGB image and semantic prior. Training and test are conducted on the modified NYU V2 dataset and SUN RGB-D dataset in order to verify the effectiveness of the algorithm. We also carry out the actual experiments in a service robot to utilize the proposed 3D object detection method to enhance the environmental perception of the robot. NRF (Natl Research Foundation, S’pore) Published version 2019-06-14T08:11:06Z 2019-12-06T21:58:29Z 2019-06-14T08:11:06Z 2019-12-06T21:58:29Z 2019 Journal Article Wang, L., Li, R., Shi, H., Sun, J., Zhao, L., Seah, H. S., . . . Tandianus, B. (2019). Multi-channel convolutional neural network based 3D object detection for indoor robot environmental perception. Sensors, 19(4), 893-. doi:10.3390/s19040893 1424-8220 https://hdl.handle.net/10356/105814 http://hdl.handle.net/10220/48782 http://dx.doi.org/10.3390/s19040893 en Sensors © 2019 The Authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). 14 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Multi-Channel CNN
3D Object Detection
DRNTU::Engineering::Computer science and engineering
spellingShingle Multi-Channel CNN
3D Object Detection
DRNTU::Engineering::Computer science and engineering
Wang, Li
Li, Ruifeng
Shi, Hezi
Sun, Jingwen
Zhao, Lijun
Tandianus, Budianto
Seah, Hock Soon
Quah, Chee Kwang
Multi-channel convolutional neural network based 3D object detection for indoor robot environmental perception
description Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abstract concepts, such as objects and scenes. Moreover, the 2D object detection based on images always fails to provide the actual position and size of an object, which is quite important for a robot’s operation. In this paper, we focus on the 3D object detection to regress the object’s category, 3D size, and spatial position through a convolutional neural network (CNN). We propose a multi-channel CNN for 3D object detection, which fuses three input channels including RGB, depth, and bird’s eye view (BEV) images. We also propose a method to generate 3D proposals based on 2D ones in the RGB image and semantic prior. Training and test are conducted on the modified NYU V2 dataset and SUN RGB-D dataset in order to verify the effectiveness of the algorithm. We also carry out the actual experiments in a service robot to utilize the proposed 3D object detection method to enhance the environmental perception of the robot.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Wang, Li
Li, Ruifeng
Shi, Hezi
Sun, Jingwen
Zhao, Lijun
Tandianus, Budianto
Seah, Hock Soon
Quah, Chee Kwang
format Article
author Wang, Li
Li, Ruifeng
Shi, Hezi
Sun, Jingwen
Zhao, Lijun
Tandianus, Budianto
Seah, Hock Soon
Quah, Chee Kwang
author_sort Wang, Li
title Multi-channel convolutional neural network based 3D object detection for indoor robot environmental perception
title_short Multi-channel convolutional neural network based 3D object detection for indoor robot environmental perception
title_full Multi-channel convolutional neural network based 3D object detection for indoor robot environmental perception
title_fullStr Multi-channel convolutional neural network based 3D object detection for indoor robot environmental perception
title_full_unstemmed Multi-channel convolutional neural network based 3D object detection for indoor robot environmental perception
title_sort multi-channel convolutional neural network based 3d object detection for indoor robot environmental perception
publishDate 2019
url https://hdl.handle.net/10356/105814
http://hdl.handle.net/10220/48782
http://dx.doi.org/10.3390/s19040893
_version_ 1681043328582287360