Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues

Target-driven recommendation dialogues present unique challenges in dialogue management due to the necessity of anticipating user interactions for successful conversations. Current methods face significant limitations: (I) inadequate capabilities for conversation anticipation, (II) computational ine...

Full description

Saved in:
Bibliographic Details
Main Authors: DAO, Quang Huy, DENG, Yang, BUI, Khanh-Huyen, LE, Dung D., LIAO, Lizi
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
EPL
Online Access:https://ink.library.smu.edu.sg/sis_research/9617
https://ink.library.smu.edu.sg/context/sis_research/article/10617/viewcontent/2743_Experience_as_Source_for_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10617
record_format dspace
spelling sg-smu-ink.sis_research-106172024-11-23T15:43:47Z Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues DAO, Quang Huy DENG, Yang BUI, Khanh-Huyen LE, Dung D. LIAO, Lizi Target-driven recommendation dialogues present unique challenges in dialogue management due to the necessity of anticipating user interactions for successful conversations. Current methods face significant limitations: (I) inadequate capabilities for conversation anticipation, (II) computational inefficiencies due to costly simulations, and (III) neglect of valuable past dialogue experiences. To address these limitations, we propose a new framework, Experiential Policy Learning (EPL), for enhancing such dialogues. EPL embodies the principle of Learning From Experience, facilitating anticipation with an experiential scoring function that estimates dialogue state potential using similar past interactions stored in long-term memory. To demonstrate its flexibility, we introduce Tree-structured EPL (T-EPL) as one possible training-free realization with Large Language Models (LLMs) and Monte-Carlo Tree Search (MCTS). T-EPL assesses past dialogue states with LLMs while utilizing MCTS to achieve hierarchical and multi-level reasoning. Extensive experiments on two published datasets demonstrate the superiority and efficacy of T-EPL. 2024-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9617 https://ink.library.smu.edu.sg/context/sis_research/article/10617/viewcontent/2743_Experience_as_Source_for_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Recommendation dialogues Experiential policy learning EPL Large Language Models LLMs Artificial Intelligence and Robotics Computer Sciences
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Recommendation dialogues
Experiential policy learning
EPL
Large Language Models
LLMs
Artificial Intelligence and Robotics
Computer Sciences
spellingShingle Recommendation dialogues
Experiential policy learning
EPL
Large Language Models
LLMs
Artificial Intelligence and Robotics
Computer Sciences
DAO, Quang Huy
DENG, Yang
BUI, Khanh-Huyen
LE, Dung D.
LIAO, Lizi
Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
description Target-driven recommendation dialogues present unique challenges in dialogue management due to the necessity of anticipating user interactions for successful conversations. Current methods face significant limitations: (I) inadequate capabilities for conversation anticipation, (II) computational inefficiencies due to costly simulations, and (III) neglect of valuable past dialogue experiences. To address these limitations, we propose a new framework, Experiential Policy Learning (EPL), for enhancing such dialogues. EPL embodies the principle of Learning From Experience, facilitating anticipation with an experiential scoring function that estimates dialogue state potential using similar past interactions stored in long-term memory. To demonstrate its flexibility, we introduce Tree-structured EPL (T-EPL) as one possible training-free realization with Large Language Models (LLMs) and Monte-Carlo Tree Search (MCTS). T-EPL assesses past dialogue states with LLMs while utilizing MCTS to achieve hierarchical and multi-level reasoning. Extensive experiments on two published datasets demonstrate the superiority and efficacy of T-EPL.
format text
author DAO, Quang Huy
DENG, Yang
BUI, Khanh-Huyen
LE, Dung D.
LIAO, Lizi
author_facet DAO, Quang Huy
DENG, Yang
BUI, Khanh-Huyen
LE, Dung D.
LIAO, Lizi
author_sort DAO, Quang Huy
title Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_short Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_full Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_fullStr Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_full_unstemmed Experience as source for anticipation and planning : Experiential policy learning for target-driven recommendation dialogues
title_sort experience as source for anticipation and planning : experiential policy learning for target-driven recommendation dialogues
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9617
https://ink.library.smu.edu.sg/context/sis_research/article/10617/viewcontent/2743_Experience_as_Source_for_.pdf
_version_ 1816859162178486272