Investigation and simulation of transfer reinforcement learning-based for robotic manipulation

Reinforcement learning is a process of investigating the interaction between agents and the environment, making sequential decisions, optimizing policies and maximizing cumulative returns. Reinforcement learning has great research value and application potential, which is a key step to realize gener...

Full description

Saved in:

Bibliographic Details
Main Author:	Zhang, Mengxia
Other Authors:	Soong Boon Hee
Format:	Thesis-Master by Coursework
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/155421
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-155421
record_format	dspace
spelling	sg-ntu-dr.10356-1554212023-07-04T17:43:13Z Investigation and simulation of transfer reinforcement learning-based for robotic manipulation Zhang, Mengxia Soong Boon Hee School of Electrical and Electronic Engineering EBHSOONG@ntu.edu.sg Engineering::Electrical and electronic engineering Reinforcement learning is a process of investigating the interaction between agents and the environment, making sequential decisions, optimizing policies and maximizing cumulative returns. Reinforcement learning has great research value and application potential, which is a key step to realize general artificial intelligence. This project introduces the principles and methods of reinforcement learning. The DRL algorithms based on Actor-Critic framework and HRL algorithm based on Option-Critic framework are verified and compared in Mujoco and RLBench robot simulation environments to complete complex robot tasks. The robot tasks using Mujoco as the back-end physical engine of the robot simulator are mainly low dimensional tasks with discrete inputs, including Humanoid, Hopper, HalfCheetah and Ant. In RLBench robot simulation environment, robot tasks are mainly high-dimensional tasks, whose inputs are images, including Open Box, Close Box, Pick Up Cup. In low dimensional robotic tasks, the on-policy algorithm is far less efficient in data utilization than the off-policy algorithms that learn from experience replay. For the three off-policy algorithms, DDPG is far less effective than TD3 and SAC. Due to the lack of exploration ability of deterministic policy, the training variance of TD3 is large compared with stochastic policy algorithm SAC. From the convergence speed of reward, SAC has the best performance. For high dimensional robotic tasks, only Option-Critic algorithm can solve Open Box and Close Box task. Due to the high memory limit, the off-policy algorithms can not be well implemented when retaining images in experience replay, so the agent cannot learn well from experience replay. Because the agent cannot use random exploration to obtain sparse reward signals to solve the task, no algorithm can solve more complex operation tasks, such as Pick Up Cup. Master of Science (Communications Engineering) 2022-02-23T04:54:37Z 2022-02-23T04:54:37Z 2021 Thesis-Master by Coursework Zhang, M. (2021). Investigation and simulation of transfer reinforcement learning-based for robotic manipulation. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/155421 https://hdl.handle.net/10356/155421 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Zhang, Mengxia Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
description	Reinforcement learning is a process of investigating the interaction between agents and the environment, making sequential decisions, optimizing policies and maximizing cumulative returns. Reinforcement learning has great research value and application potential, which is a key step to realize general artificial intelligence. This project introduces the principles and methods of reinforcement learning. The DRL algorithms based on Actor-Critic framework and HRL algorithm based on Option-Critic framework are verified and compared in Mujoco and RLBench robot simulation environments to complete complex robot tasks. The robot tasks using Mujoco as the back-end physical engine of the robot simulator are mainly low dimensional tasks with discrete inputs, including Humanoid, Hopper, HalfCheetah and Ant. In RLBench robot simulation environment, robot tasks are mainly high-dimensional tasks, whose inputs are images, including Open Box, Close Box, Pick Up Cup. In low dimensional robotic tasks, the on-policy algorithm is far less efficient in data utilization than the off-policy algorithms that learn from experience replay. For the three off-policy algorithms, DDPG is far less effective than TD3 and SAC. Due to the lack of exploration ability of deterministic policy, the training variance of TD3 is large compared with stochastic policy algorithm SAC. From the convergence speed of reward, SAC has the best performance. For high dimensional robotic tasks, only Option-Critic algorithm can solve Open Box and Close Box task. Due to the high memory limit, the off-policy algorithms can not be well implemented when retaining images in experience replay, so the agent cannot learn well from experience replay. Because the agent cannot use random exploration to obtain sparse reward signals to solve the task, no algorithm can solve more complex operation tasks, such as Pick Up Cup.
author2	Soong Boon Hee
author_facet	Soong Boon Hee Zhang, Mengxia
format	Thesis-Master by Coursework
author	Zhang, Mengxia
author_sort	Zhang, Mengxia
title	Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_short	Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_full	Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_fullStr	Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_full_unstemmed	Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_sort	investigation and simulation of transfer reinforcement learning-based for robotic manipulation
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/155421
_version_	1772826534260768768

Investigation and simulation of transfer reinforcement learning-based for robotic manipulation

Similar Items