Receding horizon cache and extreme learning machine based reinforcement learning
Function approximators have been extensively used in Reinforcement Learning (RL) to deal with large or continuous space problems. However, batch learning Neural Networks (NN), one of the most common approximators, has been rarely applied to RL. In this paper, possible reasons for this are laid out a...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/97140 http://hdl.handle.net/10220/11704 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-97140 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-971402020-03-07T13:24:47Z Receding horizon cache and extreme learning machine based reinforcement learning Shao, Zhifei Er, Meng Joo Huang, Guang-Bin School of Electrical and Electronic Engineering International Conference on Control Automation Robotics & Vision (12th : 2012 : Guangzhou, China) DRNTU::Engineering::Electrical and electronic engineering Function approximators have been extensively used in Reinforcement Learning (RL) to deal with large or continuous space problems. However, batch learning Neural Networks (NN), one of the most common approximators, has been rarely applied to RL. In this paper, possible reasons for this are laid out and a solution is proposed. Specifically, a Receding Horizon Cache (RHC) structure is designed to collect training data for NN by dynamically archiving state-action pairs and actively updating their Q-values, which makes batch learning NN much easier to implement. Together with Extreme Learning Machine (ELM), a new RL with function approximation algorithm termed as RHC and ELM based RL (RHC-ELM-RL) is proposed. A mountain car task was carried out to test RHC-ELM-RL and compare its performance with other algorithms. 2013-07-17T04:32:56Z 2019-12-06T19:39:20Z 2013-07-17T04:32:56Z 2019-12-06T19:39:20Z 2012 2012 Conference Paper Shao, Z., Er, M. J., & Huang, G.-B. (2012). Receding Horizon Cache and Extreme Learning Machine Based Reinforcement Learning. 2012 12th International Conference on Control Automation Robotics & Vision (ICARCV), 1591-1596. https://hdl.handle.net/10356/97140 http://hdl.handle.net/10220/11704 10.1109/ICARCV.2012.6485384 en © 2012 IEEE. |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Shao, Zhifei Er, Meng Joo Huang, Guang-Bin Receding horizon cache and extreme learning machine based reinforcement learning |
description |
Function approximators have been extensively used in Reinforcement Learning (RL) to deal with large or continuous space problems. However, batch learning Neural Networks (NN), one of the most common approximators, has been rarely applied to RL. In this paper, possible reasons for this are laid out and a solution is proposed. Specifically, a Receding Horizon Cache (RHC) structure is designed to collect training data for NN by dynamically archiving state-action pairs and actively updating their Q-values, which makes batch learning NN much easier to implement. Together with Extreme Learning Machine (ELM), a new RL with function approximation algorithm termed as RHC and ELM based RL (RHC-ELM-RL) is proposed. A mountain car task was carried out to test RHC-ELM-RL and compare its performance with other algorithms. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Shao, Zhifei Er, Meng Joo Huang, Guang-Bin |
format |
Conference or Workshop Item |
author |
Shao, Zhifei Er, Meng Joo Huang, Guang-Bin |
author_sort |
Shao, Zhifei |
title |
Receding horizon cache and extreme learning machine based reinforcement learning |
title_short |
Receding horizon cache and extreme learning machine based reinforcement learning |
title_full |
Receding horizon cache and extreme learning machine based reinforcement learning |
title_fullStr |
Receding horizon cache and extreme learning machine based reinforcement learning |
title_full_unstemmed |
Receding horizon cache and extreme learning machine based reinforcement learning |
title_sort |
receding horizon cache and extreme learning machine based reinforcement learning |
publishDate |
2013 |
url |
https://hdl.handle.net/10356/97140 http://hdl.handle.net/10220/11704 |
_version_ |
1681043221397897216 |