SGSH : Stimulate Large Language Models with skeleton heuristics for knowledge base question generation

Knowledge base question generation (KBQG) aims to generate natural language questions from a set of triplet facts extracted from KB. Existing methods have significantly boosted the performance of KBQG via pre-trained language models (PLMs) thanks to the richly endowed semantic knowledge. With the ad...

Full description

Saved in:
Bibliographic Details
Main Authors: GUO, Shasha, LIAO, Lizi, ZHANG, Jing, WANG, Yanling, LI, Cuiping, CHEN, Hong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9702
https://ink.library.smu.edu.sg/context/sis_research/article/10702/viewcontent/2024.findings_naacl.287.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10702
record_format dspace
spelling sg-smu-ink.sis_research-107022024-11-28T08:58:29Z SGSH : Stimulate Large Language Models with skeleton heuristics for knowledge base question generation GUO, Shasha LIAO, Lizi ZHANG, Jing WANG, Yanling LI, Cuiping CHEN, Hong Knowledge base question generation (KBQG) aims to generate natural language questions from a set of triplet facts extracted from KB. Existing methods have significantly boosted the performance of KBQG via pre-trained language models (PLMs) thanks to the richly endowed semantic knowledge. With the advance of pre-training techniques, large language models (LLMs) (e.g., GPT-3.5) undoubtedly possess much more semantic knowledge. Therefore, how to effectively organize and exploit the abundant knowledge for KBQG becomes the focus of our study. In this work, we propose SGSH — a simple and effective framework to Stimulate GPT-3.5 with Skeleton Heuristics to enhance KBQG. The framework incorporates “skeleton heuristics”, which provides more finegrained guidance associated with each input to stimulate LLMs to generate optimal questions, encompassing essential elements like the question phrase and the auxiliary verb. More specifically, we devise an automatic data construction strategy leveraging ChatGPT to construct a skeleton training dataset, based on which we employ a soft prompting approach to train a BART model dedicated to generating the skeleton associated with each input. Subsequently, skeleton heuristics are encoded into the prompt to incentivize GPT-3.5 to generate desired questions. Extensive experiments demonstrate that SGSH derives the new state-of-the-art performance on the KBQG tasks. The code is now available on Github. 2024-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9702 info:doi/10.18653/v1/2024.findings-naacl.287 https://ink.library.smu.edu.sg/context/sis_research/article/10702/viewcontent/2024.findings_naacl.287.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Knowledge base question generation KBQR Natural language processing Skeleton Heuristics Large language models LLMs Artificial Intelligence and Robotics Computer Sciences
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Knowledge base question generation
KBQR
Natural language processing
Skeleton Heuristics
Large language models
LLMs
Artificial Intelligence and Robotics
Computer Sciences
spellingShingle Knowledge base question generation
KBQR
Natural language processing
Skeleton Heuristics
Large language models
LLMs
Artificial Intelligence and Robotics
Computer Sciences
GUO, Shasha
LIAO, Lizi
ZHANG, Jing
WANG, Yanling
LI, Cuiping
CHEN, Hong
SGSH : Stimulate Large Language Models with skeleton heuristics for knowledge base question generation
description Knowledge base question generation (KBQG) aims to generate natural language questions from a set of triplet facts extracted from KB. Existing methods have significantly boosted the performance of KBQG via pre-trained language models (PLMs) thanks to the richly endowed semantic knowledge. With the advance of pre-training techniques, large language models (LLMs) (e.g., GPT-3.5) undoubtedly possess much more semantic knowledge. Therefore, how to effectively organize and exploit the abundant knowledge for KBQG becomes the focus of our study. In this work, we propose SGSH — a simple and effective framework to Stimulate GPT-3.5 with Skeleton Heuristics to enhance KBQG. The framework incorporates “skeleton heuristics”, which provides more finegrained guidance associated with each input to stimulate LLMs to generate optimal questions, encompassing essential elements like the question phrase and the auxiliary verb. More specifically, we devise an automatic data construction strategy leveraging ChatGPT to construct a skeleton training dataset, based on which we employ a soft prompting approach to train a BART model dedicated to generating the skeleton associated with each input. Subsequently, skeleton heuristics are encoded into the prompt to incentivize GPT-3.5 to generate desired questions. Extensive experiments demonstrate that SGSH derives the new state-of-the-art performance on the KBQG tasks. The code is now available on Github.
format text
author GUO, Shasha
LIAO, Lizi
ZHANG, Jing
WANG, Yanling
LI, Cuiping
CHEN, Hong
author_facet GUO, Shasha
LIAO, Lizi
ZHANG, Jing
WANG, Yanling
LI, Cuiping
CHEN, Hong
author_sort GUO, Shasha
title SGSH : Stimulate Large Language Models with skeleton heuristics for knowledge base question generation
title_short SGSH : Stimulate Large Language Models with skeleton heuristics for knowledge base question generation
title_full SGSH : Stimulate Large Language Models with skeleton heuristics for knowledge base question generation
title_fullStr SGSH : Stimulate Large Language Models with skeleton heuristics for knowledge base question generation
title_full_unstemmed SGSH : Stimulate Large Language Models with skeleton heuristics for knowledge base question generation
title_sort sgsh : stimulate large language models with skeleton heuristics for knowledge base question generation
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9702
https://ink.library.smu.edu.sg/context/sis_research/article/10702/viewcontent/2024.findings_naacl.287.pdf
_version_ 1819113107413270528