Reinforced target-driven conversational promotion
The ability to proactively engage with users towards pitching products is highly desired for conversational assistants. However, existing conversational recommendation methods overemphasize on acquiring user preferences while ignore the strategic planning for nudging users towards accepting a design...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8581 https://ink.library.smu.edu.sg/context/sis_research/article/9584/viewcontent/Reinforced_target_driven_conversational_promotion.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9584 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-95842024-04-18T02:35:43Z Reinforced target-driven conversational promotion DAO, Huy Quang LIAO, Lizi LE, Dung D. NIE, Yuxiang The ability to proactively engage with users towards pitching products is highly desired for conversational assistants. However, existing conversational recommendation methods overemphasize on acquiring user preferences while ignore the strategic planning for nudging users towards accepting a designated item. Hence, these methods fail to promote specified items with engaging responses. In this work, we propose a Reinforced Target-driven Conversational Promotion (RTCP) framework for conversational promotion. RTCP integrates short-term and long-term planning via a balanced gating mechanism. Inside which, the dialogue actions are predicted via a knowledge-integrated multi-head attention and guided via reinforcement learning rewards. RTCP then employs action-guided prefix tuning to generate relevant responses. Experimental results demonstrate that our model outperforms state-of-the-art models on both automatic metrics and human evaluation. Moreover, RTCP has a strong capability in quickly adapting to unseen scenarios just by updating prefix parameters without re-training the whole model. 2023-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8581 info:doi/10.18653/v1/2023.emnlp-main.775 https://ink.library.smu.edu.sg/context/sis_research/article/9584/viewcontent/Reinforced_target_driven_conversational_promotion.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Conversational recommendations Dialogue strategy Gating mechanisms Knowledge integrated Long term planning Recommendation methods Reinforcement learnings Target driven Tuning method Artificial Intelligence and Robotics Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Conversational recommendations Dialogue strategy Gating mechanisms Knowledge integrated Long term planning Recommendation methods Reinforcement learnings Target driven Tuning method Artificial Intelligence and Robotics Databases and Information Systems |
spellingShingle |
Conversational recommendations Dialogue strategy Gating mechanisms Knowledge integrated Long term planning Recommendation methods Reinforcement learnings Target driven Tuning method Artificial Intelligence and Robotics Databases and Information Systems DAO, Huy Quang LIAO, Lizi LE, Dung D. NIE, Yuxiang Reinforced target-driven conversational promotion |
description |
The ability to proactively engage with users towards pitching products is highly desired for conversational assistants. However, existing conversational recommendation methods overemphasize on acquiring user preferences while ignore the strategic planning for nudging users towards accepting a designated item. Hence, these methods fail to promote specified items with engaging responses. In this work, we propose a Reinforced Target-driven Conversational Promotion (RTCP) framework for conversational promotion. RTCP integrates short-term and long-term planning via a balanced gating mechanism. Inside which, the dialogue actions are predicted via a knowledge-integrated multi-head attention and guided via reinforcement learning rewards. RTCP then employs action-guided prefix tuning to generate relevant responses. Experimental results demonstrate that our model outperforms state-of-the-art models on both automatic metrics and human evaluation. Moreover, RTCP has a strong capability in quickly adapting to unseen scenarios just by updating prefix parameters without re-training the whole model. |
format |
text |
author |
DAO, Huy Quang LIAO, Lizi LE, Dung D. NIE, Yuxiang |
author_facet |
DAO, Huy Quang LIAO, Lizi LE, Dung D. NIE, Yuxiang |
author_sort |
DAO, Huy Quang |
title |
Reinforced target-driven conversational promotion |
title_short |
Reinforced target-driven conversational promotion |
title_full |
Reinforced target-driven conversational promotion |
title_fullStr |
Reinforced target-driven conversational promotion |
title_full_unstemmed |
Reinforced target-driven conversational promotion |
title_sort |
reinforced target-driven conversational promotion |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2023 |
url |
https://ink.library.smu.edu.sg/sis_research/8581 https://ink.library.smu.edu.sg/context/sis_research/article/9584/viewcontent/Reinforced_target_driven_conversational_promotion.pdf |
_version_ |
1814047492870766592 |