Hindsight-Combined and Hindsight-Prioritized Experience Replay

Reinforcement learning has proved to be of great utility; execution, however, may be costly due to sampling inefficiency. An efficient method for training is experience replay, which recalls past experiences. Several experience replay techniques, namely, combined experience replay, hindsight experie...

Full description

Saved in:
Bibliographic Details
Main Authors: Tan, Renzo Roel P, Ikeda, Kazushi, Vergara, John Paul
Format: text
Published: Archīum Ateneo 2020
Subjects:
Online Access:https://archium.ateneo.edu/mathematics-faculty-pubs/146
https://link.springer.com/chapter/10.1007%2F978-3-030-63833-7_36
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Ateneo De Manila University
id ph-ateneo-arc.mathematics-faculty-pubs-1145
record_format eprints
spelling ph-ateneo-arc.mathematics-faculty-pubs-11452022-02-23T07:40:18Z Hindsight-Combined and Hindsight-Prioritized Experience Replay Tan, Renzo Roel P Ikeda, Kazushi Vergara, John Paul Reinforcement learning has proved to be of great utility; execution, however, may be costly due to sampling inefficiency. An efficient method for training is experience replay, which recalls past experiences. Several experience replay techniques, namely, combined experience replay, hindsight experience replay, and prioritized experience replay, have been crafted while their relative merits are unclear. In the study, one proposes hybrid algorithms – hindsight-combined and hindsight-prioritized experience replay – and evaluates their performance against published baselines. Experimental results demonstrate the superior performance of hindsight-combined experience replay on an OpenAI Gym benchmark. Further, insight into the nonconvergence of hindsightprioritized experience replay is presented towards the improvement of the approach. 2020-11-01T07:00:00Z text https://archium.ateneo.edu/mathematics-faculty-pubs/146 https://link.springer.com/chapter/10.1007%2F978-3-030-63833-7_36 Mathematics Faculty Publications Archīum Ateneo Experience replay Deep Q-Network reinforcement learning sample efficiency hybrid algorithm Logic and Foundations Mathematics
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic Experience replay
Deep Q-Network
reinforcement learning
sample efficiency
hybrid algorithm
Logic and Foundations
Mathematics
spellingShingle Experience replay
Deep Q-Network
reinforcement learning
sample efficiency
hybrid algorithm
Logic and Foundations
Mathematics
Tan, Renzo Roel P
Ikeda, Kazushi
Vergara, John Paul
Hindsight-Combined and Hindsight-Prioritized Experience Replay
description Reinforcement learning has proved to be of great utility; execution, however, may be costly due to sampling inefficiency. An efficient method for training is experience replay, which recalls past experiences. Several experience replay techniques, namely, combined experience replay, hindsight experience replay, and prioritized experience replay, have been crafted while their relative merits are unclear. In the study, one proposes hybrid algorithms – hindsight-combined and hindsight-prioritized experience replay – and evaluates their performance against published baselines. Experimental results demonstrate the superior performance of hindsight-combined experience replay on an OpenAI Gym benchmark. Further, insight into the nonconvergence of hindsightprioritized experience replay is presented towards the improvement of the approach.
format text
author Tan, Renzo Roel P
Ikeda, Kazushi
Vergara, John Paul
author_facet Tan, Renzo Roel P
Ikeda, Kazushi
Vergara, John Paul
author_sort Tan, Renzo Roel P
title Hindsight-Combined and Hindsight-Prioritized Experience Replay
title_short Hindsight-Combined and Hindsight-Prioritized Experience Replay
title_full Hindsight-Combined and Hindsight-Prioritized Experience Replay
title_fullStr Hindsight-Combined and Hindsight-Prioritized Experience Replay
title_full_unstemmed Hindsight-Combined and Hindsight-Prioritized Experience Replay
title_sort hindsight-combined and hindsight-prioritized experience replay
publisher Archīum Ateneo
publishDate 2020
url https://archium.ateneo.edu/mathematics-faculty-pubs/146
https://link.springer.com/chapter/10.1007%2F978-3-030-63833-7_36
_version_ 1726158613889679360