Hindsight-Combined and Hindsight-Prioritized Experience Replay
Reinforcement learning has proved to be of great utility; execution, however, may be costly due to sampling inefficiency. An efficient method for training is experience replay, which recalls past experiences. Several experience replay techniques, namely, combined experience replay, hindsight experie...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Published: |
Archīum Ateneo
2020
|
Subjects: | |
Online Access: | https://archium.ateneo.edu/mathematics-faculty-pubs/146 https://link.springer.com/chapter/10.1007%2F978-3-030-63833-7_36 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Ateneo De Manila University |
id |
ph-ateneo-arc.mathematics-faculty-pubs-1145 |
---|---|
record_format |
eprints |
spelling |
ph-ateneo-arc.mathematics-faculty-pubs-11452022-02-23T07:40:18Z Hindsight-Combined and Hindsight-Prioritized Experience Replay Tan, Renzo Roel P Ikeda, Kazushi Vergara, John Paul Reinforcement learning has proved to be of great utility; execution, however, may be costly due to sampling inefficiency. An efficient method for training is experience replay, which recalls past experiences. Several experience replay techniques, namely, combined experience replay, hindsight experience replay, and prioritized experience replay, have been crafted while their relative merits are unclear. In the study, one proposes hybrid algorithms – hindsight-combined and hindsight-prioritized experience replay – and evaluates their performance against published baselines. Experimental results demonstrate the superior performance of hindsight-combined experience replay on an OpenAI Gym benchmark. Further, insight into the nonconvergence of hindsightprioritized experience replay is presented towards the improvement of the approach. 2020-11-01T07:00:00Z text https://archium.ateneo.edu/mathematics-faculty-pubs/146 https://link.springer.com/chapter/10.1007%2F978-3-030-63833-7_36 Mathematics Faculty Publications Archīum Ateneo Experience replay Deep Q-Network reinforcement learning sample efficiency hybrid algorithm Logic and Foundations Mathematics |
institution |
Ateneo De Manila University |
building |
Ateneo De Manila University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
Ateneo De Manila University Library |
collection |
archium.Ateneo Institutional Repository |
topic |
Experience replay Deep Q-Network reinforcement learning sample efficiency hybrid algorithm Logic and Foundations Mathematics |
spellingShingle |
Experience replay Deep Q-Network reinforcement learning sample efficiency hybrid algorithm Logic and Foundations Mathematics Tan, Renzo Roel P Ikeda, Kazushi Vergara, John Paul Hindsight-Combined and Hindsight-Prioritized Experience Replay |
description |
Reinforcement learning has proved to be of great utility; execution, however, may be costly due to sampling inefficiency. An efficient method for training is experience replay, which recalls past experiences. Several experience replay techniques, namely, combined experience replay, hindsight experience replay, and prioritized experience replay, have been crafted while their relative merits are unclear. In the study, one proposes hybrid algorithms – hindsight-combined and hindsight-prioritized experience replay – and evaluates their performance against published baselines. Experimental results demonstrate the superior performance of hindsight-combined experience replay on an OpenAI Gym benchmark. Further, insight into the nonconvergence of hindsightprioritized experience replay is presented towards the improvement of the approach. |
format |
text |
author |
Tan, Renzo Roel P Ikeda, Kazushi Vergara, John Paul |
author_facet |
Tan, Renzo Roel P Ikeda, Kazushi Vergara, John Paul |
author_sort |
Tan, Renzo Roel P |
title |
Hindsight-Combined and Hindsight-Prioritized Experience Replay |
title_short |
Hindsight-Combined and Hindsight-Prioritized Experience Replay |
title_full |
Hindsight-Combined and Hindsight-Prioritized Experience Replay |
title_fullStr |
Hindsight-Combined and Hindsight-Prioritized Experience Replay |
title_full_unstemmed |
Hindsight-Combined and Hindsight-Prioritized Experience Replay |
title_sort |
hindsight-combined and hindsight-prioritized experience replay |
publisher |
Archīum Ateneo |
publishDate |
2020 |
url |
https://archium.ateneo.edu/mathematics-faculty-pubs/146 https://link.springer.com/chapter/10.1007%2F978-3-030-63833-7_36 |
_version_ |
1726158613889679360 |