Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues

Target-driven recommendation dialogues present unique challenges in dialogue management due to the necessity of anticipating user interactions for successful conversations. Current methods face significant limitations: (I) inadequate capabilities for conversation anticipation, (II) computational ine...

Full description

Saved in:

Bibliographic Details
Main Authors:	DAO, Quang Huy, DENG, Yang, BUI, Khanh-Huyen, LE, Dung D., LIAO, Lizi
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2024
Subjects:	Recommendation dialogues Experiential policy learning EPL Large Language Models LLMs Artificial Intelligence and Robotics Computer Sciences
Online Access:	https://ink.library.smu.edu.sg/sis_research/9617 https://ink.library.smu.edu.sg/context/sis_research/article/10617/viewcontent/2743_Experience_as_Source_for_.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-10617
record_format	dspace
spelling	sg-smu-ink.sis_research-106172024-11-23T15:43:47Z Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues DAO, Quang Huy DENG, Yang BUI, Khanh-Huyen LE, Dung D. LIAO, Lizi Target-driven recommendation dialogues present unique challenges in dialogue management due to the necessity of anticipating user interactions for successful conversations. Current methods face significant limitations: (I) inadequate capabilities for conversation anticipation, (II) computational inefficiencies due to costly simulations, and (III) neglect of valuable past dialogue experiences. To address these limitations, we propose a new framework, Experiential Policy Learning (EPL), for enhancing such dialogues. EPL embodies the principle of Learning From Experience, facilitating anticipation with an experiential scoring function that estimates dialogue state potential using similar past interactions stored in long-term memory. To demonstrate its flexibility, we introduce Tree-structured EPL (T-EPL) as one possible training-free realization with Large Language Models (LLMs) and Monte-Carlo Tree Search (MCTS). T-EPL assesses past dialogue states with LLMs while utilizing MCTS to achieve hierarchical and multi-level reasoning. Extensive experiments on two published datasets demonstrate the superiority and efficacy of T-EPL. 2024-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9617 https://ink.library.smu.edu.sg/context/sis_research/article/10617/viewcontent/2743_Experience_as_Source_for_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Recommendation dialogues Experiential policy learning EPL Large Language Models LLMs Artificial Intelligence and Robotics Computer Sciences
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Recommendation dialogues Experiential policy learning EPL Large Language Models LLMs Artificial Intelligence and Robotics Computer Sciences
spellingShingle	Recommendation dialogues Experiential policy learning EPL Large Language Models LLMs Artificial Intelligence and Robotics Computer Sciences DAO, Quang Huy DENG, Yang BUI, Khanh-Huyen LE, Dung D. LIAO, Lizi Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
description	Target-driven recommendation dialogues present unique challenges in dialogue management due to the necessity of anticipating user interactions for successful conversations. Current methods face significant limitations: (I) inadequate capabilities for conversation anticipation, (II) computational inefficiencies due to costly simulations, and (III) neglect of valuable past dialogue experiences. To address these limitations, we propose a new framework, Experiential Policy Learning (EPL), for enhancing such dialogues. EPL embodies the principle of Learning From Experience, facilitating anticipation with an experiential scoring function that estimates dialogue state potential using similar past interactions stored in long-term memory. To demonstrate its flexibility, we introduce Tree-structured EPL (T-EPL) as one possible training-free realization with Large Language Models (LLMs) and Monte-Carlo Tree Search (MCTS). T-EPL assesses past dialogue states with LLMs while utilizing MCTS to achieve hierarchical and multi-level reasoning. Extensive experiments on two published datasets demonstrate the superiority and efficacy of T-EPL.
format	text
author	DAO, Quang Huy DENG, Yang BUI, Khanh-Huyen LE, Dung D. LIAO, Lizi
author_facet	DAO, Quang Huy DENG, Yang BUI, Khanh-Huyen LE, Dung D. LIAO, Lizi
author_sort	DAO, Quang Huy
title	Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_short	Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_full	Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_fullStr	Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_full_unstemmed	Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_sort	experience as source for anticipation and planning : experiential policy learning for target-driven recommendation dialogues
publisher	Institutional Knowledge at Singapore Management University
publishDate	2024
url	https://ink.library.smu.edu.sg/sis_research/9617 https://ink.library.smu.edu.sg/context/sis_research/article/10617/viewcontent/2743_Experience_as_Source_for_.pdf
_version_	1816859162178486272

Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues

Similar Items