Least Square Reinforcement Learning for Solving Inverted Pendulum Problem

© 2018 IEEE. Inverted pendulum is one of the classic control problem that could be solved by reinforcement learning approach. Most of the previous work consider the problem in discrete state space with only few exceptions assume continuous state domain. In this paper, we consider the problem of cart...

Full description

Saved in:
Bibliographic Details
Main Authors: Sa Ngapong Panyakaew, Papangkorn Inkeaw, Jakramate Bootkrajang, Jeerayut Chaijaruwanich
Format: Conference Proceeding
Published: 2018
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85054821848&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/62647
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-62647
record_format dspace
spelling th-cmuir.6653943832-626472018-11-29T07:38:07Z Least Square Reinforcement Learning for Solving Inverted Pendulum Problem Sa Ngapong Panyakaew Papangkorn Inkeaw Jakramate Bootkrajang Jeerayut Chaijaruwanich Computer Science © 2018 IEEE. Inverted pendulum is one of the classic control problem that could be solved by reinforcement learning approach. Most of the previous work consider the problem in discrete state space with only few exceptions assume continuous state domain. In this paper, we consider the problem of cart-pole balancing in the continuous state space setup with constrained track length. We adopted a least square temporal difference reinforcement learning algorithm for learning the controller. A new reward function is then proposed to better reflect the nature of the task. In addition, we also studied various factors which play important roles in the success of the learning. The empirical studies validate the effectiveness of our method. 2018-11-29T07:38:07Z 2018-11-29T07:38:07Z 2018-09-11 Conference Proceeding 2-s2.0-85054821848 10.1109/CCOMS.2018.8463234 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85054821848&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/62647
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Computer Science
spellingShingle Computer Science
Sa Ngapong Panyakaew
Papangkorn Inkeaw
Jakramate Bootkrajang
Jeerayut Chaijaruwanich
Least Square Reinforcement Learning for Solving Inverted Pendulum Problem
description © 2018 IEEE. Inverted pendulum is one of the classic control problem that could be solved by reinforcement learning approach. Most of the previous work consider the problem in discrete state space with only few exceptions assume continuous state domain. In this paper, we consider the problem of cart-pole balancing in the continuous state space setup with constrained track length. We adopted a least square temporal difference reinforcement learning algorithm for learning the controller. A new reward function is then proposed to better reflect the nature of the task. In addition, we also studied various factors which play important roles in the success of the learning. The empirical studies validate the effectiveness of our method.
format Conference Proceeding
author Sa Ngapong Panyakaew
Papangkorn Inkeaw
Jakramate Bootkrajang
Jeerayut Chaijaruwanich
author_facet Sa Ngapong Panyakaew
Papangkorn Inkeaw
Jakramate Bootkrajang
Jeerayut Chaijaruwanich
author_sort Sa Ngapong Panyakaew
title Least Square Reinforcement Learning for Solving Inverted Pendulum Problem
title_short Least Square Reinforcement Learning for Solving Inverted Pendulum Problem
title_full Least Square Reinforcement Learning for Solving Inverted Pendulum Problem
title_fullStr Least Square Reinforcement Learning for Solving Inverted Pendulum Problem
title_full_unstemmed Least Square Reinforcement Learning for Solving Inverted Pendulum Problem
title_sort least square reinforcement learning for solving inverted pendulum problem
publishDate 2018
url https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85054821848&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/62647
_version_ 1681425846122840064