SEAPoT-RL: Selective exploration algorithm for policy transfer in RL

We propose a new method for transferring a policy from a source task to a target task in model-based reinforcement learning. Our work is motivated by scenarios where a robotic agent operates in similar but challenging environments, such as hospital wards, differentiated by structural arrangements or...

Full description

Saved in:

Bibliographic Details
Main Authors:	NARAYAN, Akshay, LI, Zhuoru, LEONG, Tze-Yun
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2017
Subjects:	Transfer learning policy transfer reinforcement learning Numerical Analysis and Scientific Computing Theory and Algorithms
Online Access:	https://ink.library.smu.edu.sg/sis_research/3762 https://ink.library.smu.edu.sg/context/sis_research/article/4764/viewcontent/14729_66712_1_PB.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-4764
record_format	dspace
spelling	sg-smu-ink.sis_research-47642020-03-25T03:20:46Z SEAPoT-RL: Selective exploration algorithm for policy transfer in RL NARAYAN, Akshay LI, Zhuoru LEONG, Tze-Yun We propose a new method for transferring a policy from a source task to a target task in model-based reinforcement learning. Our work is motivated by scenarios where a robotic agent operates in similar but challenging environments, such as hospital wards, differentiated by structural arrangements or obstacles, such as furniture. We address problems that require fast responses adapted from incomplete, prior knowledge of the agent in new scenarios. We present an efficient selective exploration strategy that maximally reuses the source task policy. Reuse efficiency is effected through identifying sub-spaces that are different in the target environment, thus limiting the exploration needed in the target task. We empirically show that SEAPoT performs better in terms of jump starts and cumulative average rewards, as compared to existing state-of-the-art policy reuse methods. 2017-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3762 https://ink.library.smu.edu.sg/context/sis_research/article/4764/viewcontent/14729_66712_1_PB.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Transfer learning policy transfer reinforcement learning Numerical Analysis and Scientific Computing Theory and Algorithms
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Transfer learning policy transfer reinforcement learning Numerical Analysis and Scientific Computing Theory and Algorithms
spellingShingle	Transfer learning policy transfer reinforcement learning Numerical Analysis and Scientific Computing Theory and Algorithms NARAYAN, Akshay LI, Zhuoru LEONG, Tze-Yun SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
description	We propose a new method for transferring a policy from a source task to a target task in model-based reinforcement learning. Our work is motivated by scenarios where a robotic agent operates in similar but challenging environments, such as hospital wards, differentiated by structural arrangements or obstacles, such as furniture. We address problems that require fast responses adapted from incomplete, prior knowledge of the agent in new scenarios. We present an efficient selective exploration strategy that maximally reuses the source task policy. Reuse efficiency is effected through identifying sub-spaces that are different in the target environment, thus limiting the exploration needed in the target task. We empirically show that SEAPoT performs better in terms of jump starts and cumulative average rewards, as compared to existing state-of-the-art policy reuse methods.
format	text
author	NARAYAN, Akshay LI, Zhuoru LEONG, Tze-Yun
author_facet	NARAYAN, Akshay LI, Zhuoru LEONG, Tze-Yun
author_sort	NARAYAN, Akshay
title	SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_short	SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_full	SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_fullStr	SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_full_unstemmed	SEAPoT-RL: Selective exploration algorithm for policy transfer in RL
title_sort	seapot-rl: selective exploration algorithm for policy transfer in rl
publisher	Institutional Knowledge at Singapore Management University
publishDate	2017
url	https://ink.library.smu.edu.sg/sis_research/3762 https://ink.library.smu.edu.sg/context/sis_research/article/4764/viewcontent/14729_66712_1_PB.pdf
_version_	1770573714733137920

SEAPoT-RL: Selective exploration algorithm for policy transfer in RL

Similar Items