Investigation and simulation of transfer reinforcement learning-based for robotic manipulation

Reinforcement learning is a process of investigating the interaction between agents and the environment, making sequential decisions, optimizing policies and maximizing cumulative returns. Reinforcement learning has great research value and application potential, which is a key step to realize gener...

Full description

Saved in:
Bibliographic Details
Main Author: Zhang, Mengxia
Other Authors: Soong Boon Hee
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/155421
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-155421
record_format dspace
spelling sg-ntu-dr.10356-1554212023-07-04T17:43:13Z Investigation and simulation of transfer reinforcement learning-based for robotic manipulation Zhang, Mengxia Soong Boon Hee School of Electrical and Electronic Engineering EBHSOONG@ntu.edu.sg Engineering::Electrical and electronic engineering Reinforcement learning is a process of investigating the interaction between agents and the environment, making sequential decisions, optimizing policies and maximizing cumulative returns. Reinforcement learning has great research value and application potential, which is a key step to realize general artificial intelligence. This project introduces the principles and methods of reinforcement learning. The DRL algorithms based on Actor-Critic framework and HRL algorithm based on Option-Critic framework are verified and compared in Mujoco and RLBench robot simulation environments to complete complex robot tasks. The robot tasks using Mujoco as the back-end physical engine of the robot simulator are mainly low dimensional tasks with discrete inputs, including Humanoid, Hopper, HalfCheetah and Ant. In RLBench robot simulation environment, robot tasks are mainly high-dimensional tasks, whose inputs are images, including Open Box, Close Box, Pick Up Cup. In low dimensional robotic tasks, the on-policy algorithm is far less efficient in data utilization than the off-policy algorithms that learn from experience replay. For the three off-policy algorithms, DDPG is far less effective than TD3 and SAC. Due to the lack of exploration ability of deterministic policy, the training variance of TD3 is large compared with stochastic policy algorithm SAC. From the convergence speed of reward, SAC has the best performance. For high dimensional robotic tasks, only Option-Critic algorithm can solve Open Box and Close Box task. Due to the high memory limit, the off-policy algorithms can not be well implemented when retaining images in experience replay, so the agent cannot learn well from experience replay. Because the agent cannot use random exploration to obtain sparse reward signals to solve the task, no algorithm can solve more complex operation tasks, such as Pick Up Cup. Master of Science (Communications Engineering) 2022-02-23T04:54:37Z 2022-02-23T04:54:37Z 2021 Thesis-Master by Coursework Zhang, M. (2021). Investigation and simulation of transfer reinforcement learning-based for robotic manipulation. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/155421 https://hdl.handle.net/10356/155421 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Zhang, Mengxia
Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
description Reinforcement learning is a process of investigating the interaction between agents and the environment, making sequential decisions, optimizing policies and maximizing cumulative returns. Reinforcement learning has great research value and application potential, which is a key step to realize general artificial intelligence. This project introduces the principles and methods of reinforcement learning. The DRL algorithms based on Actor-Critic framework and HRL algorithm based on Option-Critic framework are verified and compared in Mujoco and RLBench robot simulation environments to complete complex robot tasks. The robot tasks using Mujoco as the back-end physical engine of the robot simulator are mainly low dimensional tasks with discrete inputs, including Humanoid, Hopper, HalfCheetah and Ant. In RLBench robot simulation environment, robot tasks are mainly high-dimensional tasks, whose inputs are images, including Open Box, Close Box, Pick Up Cup. In low dimensional robotic tasks, the on-policy algorithm is far less efficient in data utilization than the off-policy algorithms that learn from experience replay. For the three off-policy algorithms, DDPG is far less effective than TD3 and SAC. Due to the lack of exploration ability of deterministic policy, the training variance of TD3 is large compared with stochastic policy algorithm SAC. From the convergence speed of reward, SAC has the best performance. For high dimensional robotic tasks, only Option-Critic algorithm can solve Open Box and Close Box task. Due to the high memory limit, the off-policy algorithms can not be well implemented when retaining images in experience replay, so the agent cannot learn well from experience replay. Because the agent cannot use random exploration to obtain sparse reward signals to solve the task, no algorithm can solve more complex operation tasks, such as Pick Up Cup.
author2 Soong Boon Hee
author_facet Soong Boon Hee
Zhang, Mengxia
format Thesis-Master by Coursework
author Zhang, Mengxia
author_sort Zhang, Mengxia
title Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_short Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_full Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_fullStr Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_full_unstemmed Investigation and simulation of transfer reinforcement learning-based for robotic manipulation
title_sort investigation and simulation of transfer reinforcement learning-based for robotic manipulation
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/155421
_version_ 1772826534260768768