Deep object affordance learning for mobile robot applications

Deep learning is a subset of artificial intelligence which uses artificial neural network that can learn and make decisions on its own. Numerous deep learning frameworks have been developed for the purpose of object detection and classification. However, for mobile robots to work autonomously or col...

Full description

Saved in:
Bibliographic Details
Main Author: Teh, Han Wei
Other Authors: Teoh Eam Khwang
Format: Final Year Project
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/74841
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-74841
record_format dspace
spelling sg-ntu-dr.10356-748412023-07-07T15:55:23Z Deep object affordance learning for mobile robot applications Teh, Han Wei Teoh Eam Khwang School of Electrical and Electronic Engineering DRNTU::Engineering Deep learning is a subset of artificial intelligence which uses artificial neural network that can learn and make decisions on its own. Numerous deep learning frameworks have been developed for the purpose of object detection and classification. However, for mobile robots to work autonomously or collaborate with humans in daily workspaces, they should possess the capability of recognizing object affordances instead of just identifying a certain object. The possible functions of tools’ parts are what we define as affordances in the context of this study. Various methods have been presented over the years for object affordance detection and most previous works relied on hand-designed geometric features to localize and identify object affordances. Undeniably, the new state-of-the-art method would be deep learning which has recently gained much popularity due to its capability in handling a huge amount of data and learning deep features automatically. This project was targeted towards developing an affordance detection system with higher accuracy that the existing ones. The proposed method is to use a deep learning approach for semantic segmentation to detect object affordances from RGB images. The input data are RGB images which may represent multiple modalities to allow the network to learn features more effectively during training. SegNet is chosen for implementation due to it being the most memory efficient deep neural network for segmentation. The dataset used in this project contains a diverse collection of everyday tools such as knife and hammer and the 7 affordances associated with these tools’ parts are grasp, cut, scoop, contain, pound, support and wrap-grasp respectively. The training model obtained was validated through the inference process. The affordance detection system achieved an accuracy of 78.4% which is about 2% higher than the existing one trained using a different deep learning architecture. Bachelor of Engineering 2018-05-24T05:44:45Z 2018-05-24T05:44:45Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74841 en Nanyang Technological University 88 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Teh, Han Wei
Deep object affordance learning for mobile robot applications
description Deep learning is a subset of artificial intelligence which uses artificial neural network that can learn and make decisions on its own. Numerous deep learning frameworks have been developed for the purpose of object detection and classification. However, for mobile robots to work autonomously or collaborate with humans in daily workspaces, they should possess the capability of recognizing object affordances instead of just identifying a certain object. The possible functions of tools’ parts are what we define as affordances in the context of this study. Various methods have been presented over the years for object affordance detection and most previous works relied on hand-designed geometric features to localize and identify object affordances. Undeniably, the new state-of-the-art method would be deep learning which has recently gained much popularity due to its capability in handling a huge amount of data and learning deep features automatically. This project was targeted towards developing an affordance detection system with higher accuracy that the existing ones. The proposed method is to use a deep learning approach for semantic segmentation to detect object affordances from RGB images. The input data are RGB images which may represent multiple modalities to allow the network to learn features more effectively during training. SegNet is chosen for implementation due to it being the most memory efficient deep neural network for segmentation. The dataset used in this project contains a diverse collection of everyday tools such as knife and hammer and the 7 affordances associated with these tools’ parts are grasp, cut, scoop, contain, pound, support and wrap-grasp respectively. The training model obtained was validated through the inference process. The affordance detection system achieved an accuracy of 78.4% which is about 2% higher than the existing one trained using a different deep learning architecture.
author2 Teoh Eam Khwang
author_facet Teoh Eam Khwang
Teh, Han Wei
format Final Year Project
author Teh, Han Wei
author_sort Teh, Han Wei
title Deep object affordance learning for mobile robot applications
title_short Deep object affordance learning for mobile robot applications
title_full Deep object affordance learning for mobile robot applications
title_fullStr Deep object affordance learning for mobile robot applications
title_full_unstemmed Deep object affordance learning for mobile robot applications
title_sort deep object affordance learning for mobile robot applications
publishDate 2018
url http://hdl.handle.net/10356/74841
_version_ 1772825484797673472