SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
We propose a new method for transferring a policy from a source task to a target task in model-based reinforcement learning. Our work is motivated by scenarios where a robotic agent operates in similar but challenging environments, such as hospital wards, differentiated by structural arrangements or...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2017
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/3762 https://ink.library.smu.edu.sg/context/sis_research/article/4764/viewcontent/14729_66712_1_PB.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-4764 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-47642020-03-25T03:20:46Z SEAPoT-RL: Selective exploration algorithm for policy transfer in RL NARAYAN, Akshay LI, Zhuoru LEONG, Tze-Yun We propose a new method for transferring a policy from a source task to a target task in model-based reinforcement learning. Our work is motivated by scenarios where a robotic agent operates in similar but challenging environments, such as hospital wards, differentiated by structural arrangements or obstacles, such as furniture. We address problems that require fast responses adapted from incomplete, prior knowledge of the agent in new scenarios. We present an efficient selective exploration strategy that maximally reuses the source task policy. Reuse efficiency is effected through identifying sub-spaces that are different in the target environment, thus limiting the exploration needed in the target task. We empirically show that SEAPoT performs better in terms of jump starts and cumulative average rewards, as compared to existing state-of-the-art policy reuse methods. 2017-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3762 https://ink.library.smu.edu.sg/context/sis_research/article/4764/viewcontent/14729_66712_1_PB.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Transfer learning policy transfer reinforcement learning Numerical Analysis and Scientific Computing Theory and Algorithms |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Transfer learning policy transfer reinforcement learning Numerical Analysis and Scientific Computing Theory and Algorithms |
spellingShingle |
Transfer learning policy transfer reinforcement learning Numerical Analysis and Scientific Computing Theory and Algorithms NARAYAN, Akshay LI, Zhuoru LEONG, Tze-Yun SEAPoT-RL: Selective exploration algorithm for policy transfer in RL |
description |
We propose a new method for transferring a policy from a source task to a target task in model-based reinforcement learning. Our work is motivated by scenarios where a robotic agent operates in similar but challenging environments, such as hospital wards, differentiated by structural arrangements or obstacles, such as furniture. We address problems that require fast responses adapted from incomplete, prior knowledge of the agent in new scenarios. We present an efficient selective exploration strategy that maximally reuses the source task policy. Reuse efficiency is effected through identifying sub-spaces that are different in the target environment, thus limiting the exploration needed in the target task. We empirically show that SEAPoT performs better in terms of jump starts and cumulative average rewards, as compared to existing state-of-the-art policy reuse methods. |
format |
text |
author |
NARAYAN, Akshay LI, Zhuoru LEONG, Tze-Yun |
author_facet |
NARAYAN, Akshay LI, Zhuoru LEONG, Tze-Yun |
author_sort |
NARAYAN, Akshay |
title |
SEAPoT-RL: Selective exploration algorithm for policy transfer in RL |
title_short |
SEAPoT-RL: Selective exploration algorithm for policy transfer in RL |
title_full |
SEAPoT-RL: Selective exploration algorithm for policy transfer in RL |
title_fullStr |
SEAPoT-RL: Selective exploration algorithm for policy transfer in RL |
title_full_unstemmed |
SEAPoT-RL: Selective exploration algorithm for policy transfer in RL |
title_sort |
seapot-rl: selective exploration algorithm for policy transfer in rl |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2017 |
url |
https://ink.library.smu.edu.sg/sis_research/3762 https://ink.library.smu.edu.sg/context/sis_research/article/4764/viewcontent/14729_66712_1_PB.pdf |
_version_ |
1770573714733137920 |