SEAPoT-RL: Selective exploration algorithm for policy transfer in RL

We propose a new method for transferring a policy from a source task to a target task in model-based reinforcement learning. Our work is motivated by scenarios where a robotic agent operates in similar but challenging environments, such as hospital wards, differentiated by structural arrangements or...

Full description

Saved in:
Bibliographic Details
Main Authors: NARAYAN, Akshay, LI, Zhuoru, LEONG, Tze-Yun
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3762
https://ink.library.smu.edu.sg/context/sis_research/article/4764/viewcontent/14729_66712_1_PB.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4764
record_format dspace
spelling sg-smu-ink.sis_research-47642020-03-25T03:20:46Z SEAPoT-RL: Selective exploration algorithm for policy transfer in RL NARAYAN, Akshay LI, Zhuoru LEONG, Tze-Yun We propose a new method for transferring a policy from a source task to a target task in model-based reinforcement learning. Our work is motivated by scenarios where a robotic agent operates in similar but challenging environments, such as hospital wards, differentiated by structural arrangements or obstacles, such as furniture. We address problems that require fast responses adapted from incomplete, prior knowledge of the agent in new scenarios. We present an efficient selective exploration strategy that maximally reuses the source task policy. Reuse efficiency is effected through identifying sub-spaces that are different in the target environment, thus limiting the exploration needed in the target task. We empirically show that SEAPoT performs better in terms of jump starts and cumulative average rewards, as compared to existing state-of-the-art policy reuse methods. 2017-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3762 https://ink.library.smu.edu.sg/context/sis_research/article/4764/viewcontent/14729_66712_1_PB.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Transfer learning policy transfer reinforcement learning Numerical Analysis and Scientific Computing Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Transfer learning
policy transfer
reinforcement learning
Numerical Analysis and Scientific Computing
Theory and Algorithms
spellingShingle Transfer learning
policy transfer
reinforcement learning
Numerical Analysis and Scientific Computing
Theory and Algorithms
NARAYAN, Akshay
LI, Zhuoru
LEONG, Tze-Yun
SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
description We propose a new method for transferring a policy from a source task to a target task in model-based reinforcement learning. Our work is motivated by scenarios where a robotic agent operates in similar but challenging environments, such as hospital wards, differentiated by structural arrangements or obstacles, such as furniture. We address problems that require fast responses adapted from incomplete, prior knowledge of the agent in new scenarios. We present an efficient selective exploration strategy that maximally reuses the source task policy. Reuse efficiency is effected through identifying sub-spaces that are different in the target environment, thus limiting the exploration needed in the target task. We empirically show that SEAPoT performs better in terms of jump starts and cumulative average rewards, as compared to existing state-of-the-art policy reuse methods.
format text
author NARAYAN, Akshay
LI, Zhuoru
LEONG, Tze-Yun
author_facet NARAYAN, Akshay
LI, Zhuoru
LEONG, Tze-Yun
author_sort NARAYAN, Akshay
title SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_short SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_full SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_fullStr SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_full_unstemmed SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_sort seapot-rl: selective exploration algorithm for policy transfer in rl
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/3762
https://ink.library.smu.edu.sg/context/sis_research/article/4764/viewcontent/14729_66712_1_PB.pdf
_version_ 1770573714733137920