Receding horizon cache and extreme learning machine based reinforcement learning

Function approximators have been extensively used in Reinforcement Learning (RL) to deal with large or continuous space problems. However, batch learning Neural Networks (NN), one of the most common approximators, has been rarely applied to RL. In this paper, possible reasons for this are laid out a...

Full description

Saved in:
Bibliographic Details
Main Authors: Shao, Zhifei, Er, Meng Joo, Huang, Guang-Bin
Other Authors: School of Electrical and Electronic Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/97140
http://hdl.handle.net/10220/11704
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-97140
record_format dspace
spelling sg-ntu-dr.10356-971402020-03-07T13:24:47Z Receding horizon cache and extreme learning machine based reinforcement learning Shao, Zhifei Er, Meng Joo Huang, Guang-Bin School of Electrical and Electronic Engineering International Conference on Control Automation Robotics & Vision (12th : 2012 : Guangzhou, China) DRNTU::Engineering::Electrical and electronic engineering Function approximators have been extensively used in Reinforcement Learning (RL) to deal with large or continuous space problems. However, batch learning Neural Networks (NN), one of the most common approximators, has been rarely applied to RL. In this paper, possible reasons for this are laid out and a solution is proposed. Specifically, a Receding Horizon Cache (RHC) structure is designed to collect training data for NN by dynamically archiving state-action pairs and actively updating their Q-values, which makes batch learning NN much easier to implement. Together with Extreme Learning Machine (ELM), a new RL with function approximation algorithm termed as RHC and ELM based RL (RHC-ELM-RL) is proposed. A mountain car task was carried out to test RHC-ELM-RL and compare its performance with other algorithms. 2013-07-17T04:32:56Z 2019-12-06T19:39:20Z 2013-07-17T04:32:56Z 2019-12-06T19:39:20Z 2012 2012 Conference Paper Shao, Z., Er, M. J., & Huang, G.-B. (2012). Receding Horizon Cache and Extreme Learning Machine Based Reinforcement Learning. 2012 12th International Conference on Control Automation Robotics & Vision (ICARCV), 1591-1596. https://hdl.handle.net/10356/97140 http://hdl.handle.net/10220/11704 10.1109/ICARCV.2012.6485384 en © 2012 IEEE.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Shao, Zhifei
Er, Meng Joo
Huang, Guang-Bin
Receding horizon cache and extreme learning machine based reinforcement learning
description Function approximators have been extensively used in Reinforcement Learning (RL) to deal with large or continuous space problems. However, batch learning Neural Networks (NN), one of the most common approximators, has been rarely applied to RL. In this paper, possible reasons for this are laid out and a solution is proposed. Specifically, a Receding Horizon Cache (RHC) structure is designed to collect training data for NN by dynamically archiving state-action pairs and actively updating their Q-values, which makes batch learning NN much easier to implement. Together with Extreme Learning Machine (ELM), a new RL with function approximation algorithm termed as RHC and ELM based RL (RHC-ELM-RL) is proposed. A mountain car task was carried out to test RHC-ELM-RL and compare its performance with other algorithms.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Shao, Zhifei
Er, Meng Joo
Huang, Guang-Bin
format Conference or Workshop Item
author Shao, Zhifei
Er, Meng Joo
Huang, Guang-Bin
author_sort Shao, Zhifei
title Receding horizon cache and extreme learning machine based reinforcement learning
title_short Receding horizon cache and extreme learning machine based reinforcement learning
title_full Receding horizon cache and extreme learning machine based reinforcement learning
title_fullStr Receding horizon cache and extreme learning machine based reinforcement learning
title_full_unstemmed Receding horizon cache and extreme learning machine based reinforcement learning
title_sort receding horizon cache and extreme learning machine based reinforcement learning
publishDate 2013
url https://hdl.handle.net/10356/97140
http://hdl.handle.net/10220/11704
_version_ 1681043221397897216