Deep learning based people detection using 3D point cloud
With the advancement of computational devices and 3D sensor technology, it has become increasingly viable to develop a highly accurate detection system within the constraints of a mobile service robot. As such robots need to navigate in unfamiliar environments with less than optimal conditions, the...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2018
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/74956 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-74956 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-749562023-07-07T17:19:26Z Deep learning based people detection using 3D point cloud Tan, Kye Min Teoh Eam Khwang School of Electrical and Electronic Engineering A*STAR Institute for Infocomm Research DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision With the advancement of computational devices and 3D sensor technology, it has become increasingly viable to develop a highly accurate detection system within the constraints of a mobile service robot. As such robots need to navigate in unfamiliar environments with less than optimal conditions, the algorithms responsible for detection, tracking and guidance must be robust. Deep learning is a recent field of artificial intelligence which potentially provides such features. By harnessing large amounts of computational power and datasets, deep learning systems can achieve significantly better performance in computer vision tasks such as classification and detection compared to previous methods. The usage of 3D point cloud data allows spatial information to be obtained while overcoming adverse conditions such as poor illumination and complex texture information. This project combines the advantages of deep learning methods and 3D point cloud data to perform people detection, which is a task required of mobile service robots. Depth images from the Microsoft Kinect sensors are converted into 3D point cloud form before being used to train an advanced network known as DenseNet for the task of detecting the presence of people. DenseNet was chosen due to its very deep architecture which allows high performance while its dense connections mitigate the risk of the model overfitting on the limited data available. By training DenseNet on the Darknet framework, it is qualitatively shown that DenseNet can perform better than networks like You Only Look Once (YOLO) on new data while being sufficiently fast to process images in real time. Bachelor of Engineering 2018-05-25T05:02:26Z 2018-05-25T05:02:26Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74956 en Nanyang Technological University 96 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Tan, Kye Min Deep learning based people detection using 3D point cloud |
description |
With the advancement of computational devices and 3D sensor technology, it has become increasingly viable to develop a highly accurate detection system within the constraints of a mobile service robot. As such robots need to navigate in unfamiliar environments with less than optimal conditions, the algorithms responsible for detection, tracking and guidance must be robust. Deep learning is a recent field of artificial intelligence which potentially provides such features. By harnessing large amounts of computational power and datasets, deep learning systems can achieve significantly better performance in computer vision tasks such as classification and detection compared to previous methods. The usage of 3D point cloud data allows spatial information to be obtained while overcoming adverse conditions such as poor illumination and complex texture information.
This project combines the advantages of deep learning methods and 3D point cloud data to perform people detection, which is a task required of mobile service robots. Depth images from the Microsoft Kinect sensors are converted into 3D point cloud form before being used to train an advanced network known as DenseNet for the task of detecting the presence of people. DenseNet was chosen due to its very deep architecture which allows high performance while its dense connections mitigate the risk of the model overfitting on the limited data available.
By training DenseNet on the Darknet framework, it is qualitatively shown that DenseNet can perform better than networks like You Only Look Once (YOLO) on new data while being sufficiently fast to process images in real time. |
author2 |
Teoh Eam Khwang |
author_facet |
Teoh Eam Khwang Tan, Kye Min |
format |
Final Year Project |
author |
Tan, Kye Min |
author_sort |
Tan, Kye Min |
title |
Deep learning based people detection using 3D point cloud |
title_short |
Deep learning based people detection using 3D point cloud |
title_full |
Deep learning based people detection using 3D point cloud |
title_fullStr |
Deep learning based people detection using 3D point cloud |
title_full_unstemmed |
Deep learning based people detection using 3D point cloud |
title_sort |
deep learning based people detection using 3d point cloud |
publishDate |
2018 |
url |
http://hdl.handle.net/10356/74956 |
_version_ |
1772825542428459008 |