Controllable neural text generation in persona-based dialogue systems

Persona-based dialogue system, a specialized area of open-domain dialogue system, focuses on creating more engaging and personalized conversational agents. Distinguished from the open-domain dialogue system, it incorporates a consistent and distinct persona that directs its interaction style, which...

Full description

Saved in:
Bibliographic Details
Main Author: Zhu, Luyao
Other Authors: Erik Cambria
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2025
Subjects:
Online Access:https://hdl.handle.net/10356/182528
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-182528
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Controllable text generation
Persona attribute extraction
Relation triplet extraction
Dialogue generation
Personalized sentiment analysis
spellingShingle Computer and Information Science
Controllable text generation
Persona attribute extraction
Relation triplet extraction
Dialogue generation
Personalized sentiment analysis
Zhu, Luyao
Controllable neural text generation in persona-based dialogue systems
description Persona-based dialogue system, a specialized area of open-domain dialogue system, focuses on creating more engaging and personalized conversational agents. Distinguished from the open-domain dialogue system, it incorporates a consistent and distinct persona that directs its interaction style, which is either explicitly defined or learned from large datasets of human interactions. The recent advance of pre-trained language models boosts the dialogue system, significantly enhancing the user experience by delivering fluent, diversified, and human-like conversations. However, the challenge of maintaining persona consistency and contextual coherence still persists in persona-driven dialogue generation, due to the overlook of user intent and persona identification. Therefore, it is proposed to create a pipeline dialogue system consisting of natural language understanding module for precise user intent detection and persona extraction, and dialogue management and generation module for multiple source inputs integration. The research goals of this thesis include controllable sentence generation for persona-consistent and contextual coherent conversation generation, controllable structural extraction by exploiting persona attributes in conversation histories to facilitate natural language understanding in a dialogue system, and controllable data synthesis for low-resource extraction scenario. Natural language understanding and dialogue management and generation, which elevated the performance of a task-oriented dialogue system, are introduced to the persona-based dialogue system. In the first work, a lightweight hierarchical intent-inferring pointer network is proposed for multi-source persona-driven dialogue generation. For natural language understanding, the proposed method involves detecting interlocutor intents, retrieving the most relevant persona attributes in chitchat, and utilizing pseudo-labeling and natural language inference techniques to generate intent labels. For dialogue management and generation, a multi-source pointer-generator is designed to leverage the useful information from multiple input sources and filter out irrelevant textual noises. However, matching the relevant persona attribute is a retrieval-based method and requires predefined persona information, nontransferable to realistic conversations. This motivates the second framework, which formulates and investigates persona attribute extraction from dialogues utilizing pre-trained language models in generalized zero-shot setting. Moreover, a Meta-Variational-AutoEncoder sampler with contrastive structured constraint is presented to tackle the hard negative samples in this task. By leveraging more reliable text-label matching criteria, a task-specific dataset with more detailed relation types and consistent entity relation annotation is created for persona attribute extraction. The aforementioned method is applicable to cases when seen data and unseen relation types are available. A more stringent condition has not been explored where no information about the unseen data is accessible, which is genuine zero-shot persona attribute extraction. In the third work, thus, a Chain-of-Proposal prompting framework is developed for unseen relation generation, controllable support sentence synthesis conditioned on hypothesized relations, synthetic data denoising, and zero-shot persona triplet extraction. Extensive experiments on benchmark and built datasets demonstrate the superior performance of the proposed frameworks compared to strong baseline models. A further discussion and empirical study is conducted on the potential constructive effects of persona attribute understanding on personalized conversational sentiment analysis.
author2 Erik Cambria
author_facet Erik Cambria
Zhu, Luyao
format Thesis-Doctor of Philosophy
author Zhu, Luyao
author_sort Zhu, Luyao
title Controllable neural text generation in persona-based dialogue systems
title_short Controllable neural text generation in persona-based dialogue systems
title_full Controllable neural text generation in persona-based dialogue systems
title_fullStr Controllable neural text generation in persona-based dialogue systems
title_full_unstemmed Controllable neural text generation in persona-based dialogue systems
title_sort controllable neural text generation in persona-based dialogue systems
publisher Nanyang Technological University
publishDate 2025
url https://hdl.handle.net/10356/182528
_version_ 1826362256363880448
spelling sg-ntu-dr.10356-1825282025-03-04T02:57:32Z Controllable neural text generation in persona-based dialogue systems Zhu, Luyao Erik Cambria College of Computing and Data Science Computational Intelligence Lab cambria@ntu.edu.sg Computer and Information Science Controllable text generation Persona attribute extraction Relation triplet extraction Dialogue generation Personalized sentiment analysis Persona-based dialogue system, a specialized area of open-domain dialogue system, focuses on creating more engaging and personalized conversational agents. Distinguished from the open-domain dialogue system, it incorporates a consistent and distinct persona that directs its interaction style, which is either explicitly defined or learned from large datasets of human interactions. The recent advance of pre-trained language models boosts the dialogue system, significantly enhancing the user experience by delivering fluent, diversified, and human-like conversations. However, the challenge of maintaining persona consistency and contextual coherence still persists in persona-driven dialogue generation, due to the overlook of user intent and persona identification. Therefore, it is proposed to create a pipeline dialogue system consisting of natural language understanding module for precise user intent detection and persona extraction, and dialogue management and generation module for multiple source inputs integration. The research goals of this thesis include controllable sentence generation for persona-consistent and contextual coherent conversation generation, controllable structural extraction by exploiting persona attributes in conversation histories to facilitate natural language understanding in a dialogue system, and controllable data synthesis for low-resource extraction scenario. Natural language understanding and dialogue management and generation, which elevated the performance of a task-oriented dialogue system, are introduced to the persona-based dialogue system. In the first work, a lightweight hierarchical intent-inferring pointer network is proposed for multi-source persona-driven dialogue generation. For natural language understanding, the proposed method involves detecting interlocutor intents, retrieving the most relevant persona attributes in chitchat, and utilizing pseudo-labeling and natural language inference techniques to generate intent labels. For dialogue management and generation, a multi-source pointer-generator is designed to leverage the useful information from multiple input sources and filter out irrelevant textual noises. However, matching the relevant persona attribute is a retrieval-based method and requires predefined persona information, nontransferable to realistic conversations. This motivates the second framework, which formulates and investigates persona attribute extraction from dialogues utilizing pre-trained language models in generalized zero-shot setting. Moreover, a Meta-Variational-AutoEncoder sampler with contrastive structured constraint is presented to tackle the hard negative samples in this task. By leveraging more reliable text-label matching criteria, a task-specific dataset with more detailed relation types and consistent entity relation annotation is created for persona attribute extraction. The aforementioned method is applicable to cases when seen data and unseen relation types are available. A more stringent condition has not been explored where no information about the unseen data is accessible, which is genuine zero-shot persona attribute extraction. In the third work, thus, a Chain-of-Proposal prompting framework is developed for unseen relation generation, controllable support sentence synthesis conditioned on hypothesized relations, synthetic data denoising, and zero-shot persona triplet extraction. Extensive experiments on benchmark and built datasets demonstrate the superior performance of the proposed frameworks compared to strong baseline models. A further discussion and empirical study is conducted on the potential constructive effects of persona attribute understanding on personalized conversational sentiment analysis. Doctor of Philosophy 2025-02-06T04:17:23Z 2025-02-06T04:17:23Z 2025 Thesis-Doctor of Philosophy Zhu, L. (2025). Controllable neural text generation in persona-based dialogue systems. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/182528 https://hdl.handle.net/10356/182528 10.32657/10356/182528 en A18A2b0046 This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University