Leveraging long short-term user preference in conversational recommendation via multi-agent reinforcement learning

Conversational recommender systems (CRS) endow traditional recommender systems with the capability of dynamically obtaining users’ short-term preferences for items and attributes through interactive dialogues. There are three core challenges for CRS, including the intelligent decisions for what attr...

Full description

Saved in:
Bibliographic Details
Main Authors: DENG, Yang, LI, Yaliang, DING, Bolin, LAM, Wai
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9088
https://ink.library.smu.edu.sg/context/sis_research/article/10091/viewcontent/09964317.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10091
record_format dspace
spelling sg-smu-ink.sis_research-100912024-08-01T15:11:59Z Leveraging long short-term user preference in conversational recommendation via multi-agent reinforcement learning DENG, Yang LI, Yaliang DING, Bolin LAM, Wai Conversational recommender systems (CRS) endow traditional recommender systems with the capability of dynamically obtaining users’ short-term preferences for items and attributes through interactive dialogues. There are three core challenges for CRS, including the intelligent decisions for what attributes to ask, which items to recommend, and when to askor recommend, at each conversation turn. Previous methods mainly leverage reinforcement learning (RL) to learn conversational recommendation policies for solving one or two of these three decision-making problems in CRS with separated conversation and recommendation components. These approaches restrict the scalability and generality of CRS and fall short of preserving a stable training procedure. In the light of these challenges, we tackle these three decision-making problems in CRS as a unified policy learning task. In order to leverage different features that are important to each sub-problem and facilitate better unified policy learning in CRS, we propose two novel multi-agent RL-based frameworks, namely Independent and Hierarchical Multi-Agent UNIfied COnversational RecommeNders (IMAUNICORNandHMA-UNICORN),respectively. In specific, two low-level agents enrich the state representations for attribute prediction and item recommendation, by combining the long-term user preference information from the historical interaction data and the shortterm user preference information from the conversation history. A high-level meta agent is responsible for coordinating the low-level agents to adaptively make the final decision. Experimental results on four benchmark CRS datasets and a real-world E-Commerce application show that the proposed frameworks significantly outperform state-of-the-art methods. Extensive analyses further demonstrate the superior scalability of the MARL frameworks on the multi-round conversational recommendation. 2023-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9088 info:doi/10.1109/TKDE.2022.3225109 https://ink.library.smu.edu.sg/context/sis_research/article/10091/viewcontent/09964317.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Conversational recommender system multi-agent reinforcement learning graph representation learning Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Conversational recommender system
multi-agent reinforcement learning
graph representation learning
Databases and Information Systems
spellingShingle Conversational recommender system
multi-agent reinforcement learning
graph representation learning
Databases and Information Systems
DENG, Yang
LI, Yaliang
DING, Bolin
LAM, Wai
Leveraging long short-term user preference in conversational recommendation via multi-agent reinforcement learning
description Conversational recommender systems (CRS) endow traditional recommender systems with the capability of dynamically obtaining users’ short-term preferences for items and attributes through interactive dialogues. There are three core challenges for CRS, including the intelligent decisions for what attributes to ask, which items to recommend, and when to askor recommend, at each conversation turn. Previous methods mainly leverage reinforcement learning (RL) to learn conversational recommendation policies for solving one or two of these three decision-making problems in CRS with separated conversation and recommendation components. These approaches restrict the scalability and generality of CRS and fall short of preserving a stable training procedure. In the light of these challenges, we tackle these three decision-making problems in CRS as a unified policy learning task. In order to leverage different features that are important to each sub-problem and facilitate better unified policy learning in CRS, we propose two novel multi-agent RL-based frameworks, namely Independent and Hierarchical Multi-Agent UNIfied COnversational RecommeNders (IMAUNICORNandHMA-UNICORN),respectively. In specific, two low-level agents enrich the state representations for attribute prediction and item recommendation, by combining the long-term user preference information from the historical interaction data and the shortterm user preference information from the conversation history. A high-level meta agent is responsible for coordinating the low-level agents to adaptively make the final decision. Experimental results on four benchmark CRS datasets and a real-world E-Commerce application show that the proposed frameworks significantly outperform state-of-the-art methods. Extensive analyses further demonstrate the superior scalability of the MARL frameworks on the multi-round conversational recommendation.
format text
author DENG, Yang
LI, Yaliang
DING, Bolin
LAM, Wai
author_facet DENG, Yang
LI, Yaliang
DING, Bolin
LAM, Wai
author_sort DENG, Yang
title Leveraging long short-term user preference in conversational recommendation via multi-agent reinforcement learning
title_short Leveraging long short-term user preference in conversational recommendation via multi-agent reinforcement learning
title_full Leveraging long short-term user preference in conversational recommendation via multi-agent reinforcement learning
title_fullStr Leveraging long short-term user preference in conversational recommendation via multi-agent reinforcement learning
title_full_unstemmed Leveraging long short-term user preference in conversational recommendation via multi-agent reinforcement learning
title_sort leveraging long short-term user preference in conversational recommendation via multi-agent reinforcement learning
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/9088
https://ink.library.smu.edu.sg/context/sis_research/article/10091/viewcontent/09964317.pdf
_version_ 1814047728078946304