Programmatic policies for interpretable reinforcement learning using pre-trained models

Decision Trees (DTs) are widely used in machine learning due to their critical interpretability. However, training DTs in a Reinforcement Learning (RL) setting is challenging. In the project, we present a framework to improve the interpretability of reinforcement learning (RL) by generating prog...

全面介紹

Saved in:

書目詳細資料
主要作者:	Tu, Xia Yang
其他作者:	Arvind Easwaran
格式:	Final Year Project
語言:	English
出版:	Nanyang Technological University 2024
主題:	Computer and Information Science LLM Reinforcement learning
在線閱讀:	https://hdl.handle.net/10356/181169
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Nanyang Technological University
語言:	English

id	sg-ntu-dr.10356-181169
record_format	dspace
spelling	sg-ntu-dr.10356-1811692024-11-18T01:24:17Z Programmatic policies for interpretable reinforcement learning using pre-trained models Tu, Xia Yang Arvind Easwaran College of Computing and Data Science arvinde@ntu.edu.sg Computer and Information Science LLM Reinforcement learning Decision Trees (DTs) are widely used in machine learning due to their critical interpretability. However, training DTs in a Reinforcement Learning (RL) setting is challenging. In the project, we present a framework to improve the interpretability of reinforcement learning (RL) by generating programmatic policies through large language models (LLMs). Initially looking at decision trees within latent space representation, we proceed to develop and apply a “code reflection” framework in the Karel environment, which provides a controlled setting for evaluating task performance. The framework leverages prompt engineering and code reflection to optimize program synthesis, producing interpretable policies. The report discusses the findings and concludes with recommendations for future work at the end. Bachelor's degree 2024-11-18T01:24:16Z 2024-11-18T01:24:16Z 2024 Final Year Project (FYP) Tu, X. Y. (2024). Programmatic policies for interpretable reinforcement learning using pre-trained models. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181169 https://hdl.handle.net/10356/181169 en SCSE23-0622 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Computer and Information Science LLM Reinforcement learning
spellingShingle	Computer and Information Science LLM Reinforcement learning Tu, Xia Yang Programmatic policies for interpretable reinforcement learning using pre-trained models
description	Decision Trees (DTs) are widely used in machine learning due to their critical interpretability. However, training DTs in a Reinforcement Learning (RL) setting is challenging. In the project, we present a framework to improve the interpretability of reinforcement learning (RL) by generating programmatic policies through large language models (LLMs). Initially looking at decision trees within latent space representation, we proceed to develop and apply a “code reflection” framework in the Karel environment, which provides a controlled setting for evaluating task performance. The framework leverages prompt engineering and code reflection to optimize program synthesis, producing interpretable policies. The report discusses the findings and concludes with recommendations for future work at the end.
author2	Arvind Easwaran
author_facet	Arvind Easwaran Tu, Xia Yang
format	Final Year Project
author	Tu, Xia Yang
author_sort	Tu, Xia Yang
title	Programmatic policies for interpretable reinforcement learning using pre-trained models
title_short	Programmatic policies for interpretable reinforcement learning using pre-trained models
title_full	Programmatic policies for interpretable reinforcement learning using pre-trained models
title_fullStr	Programmatic policies for interpretable reinforcement learning using pre-trained models
title_full_unstemmed	Programmatic policies for interpretable reinforcement learning using pre-trained models
title_sort	programmatic policies for interpretable reinforcement learning using pre-trained models
publisher	Nanyang Technological University
publishDate	2024
url	https://hdl.handle.net/10356/181169
_version_	1816859022167375872

Programmatic policies for interpretable reinforcement learning using pre-trained models

相似書籍