Object classification from segmented LiDAR input using 3 dimensional convolutional neural networks

As autonomous vehicles are poised to enter the mainstream in the automobile industry, an important requirement for these platforms is the ability to robustly recognize and react to objects in the real world. This is further compounded by the fact that other autonomous platforms like delivery robots...

Full description

Saved in:

Bibliographic Details
Main Author:	Pangottil Shanoop
Other Authors:	Justin Dauwels
Format:	Theses and Dissertations
Language:	English
Published:	2018
Subjects:	DRNTU::Engineering::Electrical and electronic engineering
Online Access:	http://hdl.handle.net/10356/73132
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-73132
record_format	dspace
spelling	sg-ntu-dr.10356-731322023-07-04T15:05:50Z Object classification from segmented LiDAR input using 3 dimensional convolutional neural networks Pangottil Shanoop Justin Dauwels School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering As autonomous vehicles are poised to enter the mainstream in the automobile industry, an important requirement for these platforms is the ability to robustly recognize and react to objects in the real world. This is further compounded by the fact that other autonomous platforms like delivery robots and industrial collaborative systems would have to actively make decisions based on the visual feedback from their sensors. Range sensors such as LiDAR and RGBD are commonly found sensors in modern robotic platforms, providing a richer dataset than any other single sensor platform. Most of the current algorithms for classification and segmentation do not however use the depth data from the 3D data or employ work arounds, often sacrificing classification performance. This thesis is a study into the classification capabilities of 3D convolutional neural networks and evaluates the performance on a 3D CNN implementation [1] in a publicly available dataset [3] and compares it to the state of the art performance metrics as put forward by [2]. This thesis also attempts to find the optimal grid for a voxelization problem by comparing three approaches as mentioned by [1] and verifies the results put forward by the authors. To study these, a 7-layer 3D convolutional neural network based on [1] is used. Slight modifications of the hyper-parameters to accommodate the new dataset is also discussed in this thesis. Finally, the limitations of 3D CNN networks is discussed and its effect on the results of this thesis and improvements as suggested by [15] are also discussed. Master of Science (Computer Control and Automation) 2018-01-03T07:17:06Z 2018-01-03T07:17:06Z 2018 Thesis http://hdl.handle.net/10356/73132 en 63 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Electrical and electronic engineering
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering Pangottil Shanoop Object classification from segmented LiDAR input using 3 dimensional convolutional neural networks
description	As autonomous vehicles are poised to enter the mainstream in the automobile industry, an important requirement for these platforms is the ability to robustly recognize and react to objects in the real world. This is further compounded by the fact that other autonomous platforms like delivery robots and industrial collaborative systems would have to actively make decisions based on the visual feedback from their sensors. Range sensors such as LiDAR and RGBD are commonly found sensors in modern robotic platforms, providing a richer dataset than any other single sensor platform. Most of the current algorithms for classification and segmentation do not however use the depth data from the 3D data or employ work arounds, often sacrificing classification performance. This thesis is a study into the classification capabilities of 3D convolutional neural networks and evaluates the performance on a 3D CNN implementation [1] in a publicly available dataset [3] and compares it to the state of the art performance metrics as put forward by [2]. This thesis also attempts to find the optimal grid for a voxelization problem by comparing three approaches as mentioned by [1] and verifies the results put forward by the authors. To study these, a 7-layer 3D convolutional neural network based on [1] is used. Slight modifications of the hyper-parameters to accommodate the new dataset is also discussed in this thesis. Finally, the limitations of 3D CNN networks is discussed and its effect on the results of this thesis and improvements as suggested by [15] are also discussed.
author2	Justin Dauwels
author_facet	Justin Dauwels Pangottil Shanoop
format	Theses and Dissertations
author	Pangottil Shanoop
author_sort	Pangottil Shanoop
title	Object classification from segmented LiDAR input using 3 dimensional convolutional neural networks
title_short	Object classification from segmented LiDAR input using 3 dimensional convolutional neural networks
title_full	Object classification from segmented LiDAR input using 3 dimensional convolutional neural networks
title_fullStr	Object classification from segmented LiDAR input using 3 dimensional convolutional neural networks
title_full_unstemmed	Object classification from segmented LiDAR input using 3 dimensional convolutional neural networks
title_sort	object classification from segmented lidar input using 3 dimensional convolutional neural networks
publishDate	2018
url	http://hdl.handle.net/10356/73132
_version_	1772827646769496064

Object classification from segmented LiDAR input using 3 dimensional convolutional neural networks

Similar Items