Least Square Reinforcement Learning for Solving Inverted Pendulum Problem
© 2018 IEEE. Inverted pendulum is one of the classic control problem that could be solved by reinforcement learning approach. Most of the previous work consider the problem in discrete state space with only few exceptions assume continuous state domain. In this paper, we consider the problem of cart...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Conference Proceeding |
Published: |
2018
|
Subjects: | |
Online Access: | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85054821848&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/62647 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Chiang Mai University |
id |
th-cmuir.6653943832-62647 |
---|---|
record_format |
dspace |
spelling |
th-cmuir.6653943832-626472018-11-29T07:38:07Z Least Square Reinforcement Learning for Solving Inverted Pendulum Problem Sa Ngapong Panyakaew Papangkorn Inkeaw Jakramate Bootkrajang Jeerayut Chaijaruwanich Computer Science © 2018 IEEE. Inverted pendulum is one of the classic control problem that could be solved by reinforcement learning approach. Most of the previous work consider the problem in discrete state space with only few exceptions assume continuous state domain. In this paper, we consider the problem of cart-pole balancing in the continuous state space setup with constrained track length. We adopted a least square temporal difference reinforcement learning algorithm for learning the controller. A new reward function is then proposed to better reflect the nature of the task. In addition, we also studied various factors which play important roles in the success of the learning. The empirical studies validate the effectiveness of our method. 2018-11-29T07:38:07Z 2018-11-29T07:38:07Z 2018-09-11 Conference Proceeding 2-s2.0-85054821848 10.1109/CCOMS.2018.8463234 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85054821848&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/62647 |
institution |
Chiang Mai University |
building |
Chiang Mai University Library |
country |
Thailand |
collection |
CMU Intellectual Repository |
topic |
Computer Science |
spellingShingle |
Computer Science Sa Ngapong Panyakaew Papangkorn Inkeaw Jakramate Bootkrajang Jeerayut Chaijaruwanich Least Square Reinforcement Learning for Solving Inverted Pendulum Problem |
description |
© 2018 IEEE. Inverted pendulum is one of the classic control problem that could be solved by reinforcement learning approach. Most of the previous work consider the problem in discrete state space with only few exceptions assume continuous state domain. In this paper, we consider the problem of cart-pole balancing in the continuous state space setup with constrained track length. We adopted a least square temporal difference reinforcement learning algorithm for learning the controller. A new reward function is then proposed to better reflect the nature of the task. In addition, we also studied various factors which play important roles in the success of the learning. The empirical studies validate the effectiveness of our method. |
format |
Conference Proceeding |
author |
Sa Ngapong Panyakaew Papangkorn Inkeaw Jakramate Bootkrajang Jeerayut Chaijaruwanich |
author_facet |
Sa Ngapong Panyakaew Papangkorn Inkeaw Jakramate Bootkrajang Jeerayut Chaijaruwanich |
author_sort |
Sa Ngapong Panyakaew |
title |
Least Square Reinforcement Learning for Solving Inverted Pendulum Problem |
title_short |
Least Square Reinforcement Learning for Solving Inverted Pendulum Problem |
title_full |
Least Square Reinforcement Learning for Solving Inverted Pendulum Problem |
title_fullStr |
Least Square Reinforcement Learning for Solving Inverted Pendulum Problem |
title_full_unstemmed |
Least Square Reinforcement Learning for Solving Inverted Pendulum Problem |
title_sort |
least square reinforcement learning for solving inverted pendulum problem |
publishDate |
2018 |
url |
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85054821848&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/62647 |
_version_ |
1681425846122840064 |