Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations

Despite the remarkable abilities of Large Language Models (LLMs) to answer questions, they often display a considerable level of overconfidence even when the question does not have a definitive answer. To avoid providing hallucinated answers to these unknown questions, existing studies typically inv...

Full description

Saved in:
Bibliographic Details
Main Authors: DENG, Yang, ZHAO, Yong, LI, Moxin, NG, See-Kiong, CHUA, Tat-Seng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9614
https://ink.library.smu.edu.sg/context/sis_research/article/10614/viewcontent/2402.15062v2.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10614
record_format dspace
spelling sg-smu-ink.sis_research-106142024-11-23T15:47:34Z Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations DENG, Yang ZHAO, Yong LI, Moxin NG, See-Kiong CHUA, Tat-Seng Despite the remarkable abilities of Large Language Models (LLMs) to answer questions, they often display a considerable level of overconfidence even when the question does not have a definitive answer. To avoid providing hallucinated answers to these unknown questions, existing studies typically investigate approaches to refusing to answer these questions. In this work, we propose a novel and scalable self-alignment method to utilize the LLM itself to enhance its response-ability to different types of unknown questions, being capable of not only refusing to answer but also providing explanation to the unanswerability of unknown questions. Specifically, the Self-Align method first employ a two-stage class-aware self-augmentation approach to generate a large amount of unknown question-response data. Then we conduct disparity-driven self-curation to select qualified data for fine-tuning the LLM itself for aligning the responses to unknown questions as desired. Experimental results on two datasets across four types of unknown questions validate the superiority of the Self-Align method over existing baselines in terms of three types of task formulation. 2024-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9614 https://ink.library.smu.edu.sg/context/sis_research/article/10614/viewcontent/2402.15062v2.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Large Language Models LLMs Unknown question response Self-Align method Artificial Intelligence and Robotics Computer Sciences
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Large Language Models
LLMs
Unknown question response
Self-Align method
Artificial Intelligence and Robotics
Computer Sciences
spellingShingle Large Language Models
LLMs
Unknown question response
Self-Align method
Artificial Intelligence and Robotics
Computer Sciences
DENG, Yang
ZHAO, Yong
LI, Moxin
NG, See-Kiong
CHUA, Tat-Seng
Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations
description Despite the remarkable abilities of Large Language Models (LLMs) to answer questions, they often display a considerable level of overconfidence even when the question does not have a definitive answer. To avoid providing hallucinated answers to these unknown questions, existing studies typically investigate approaches to refusing to answer these questions. In this work, we propose a novel and scalable self-alignment method to utilize the LLM itself to enhance its response-ability to different types of unknown questions, being capable of not only refusing to answer but also providing explanation to the unanswerability of unknown questions. Specifically, the Self-Align method first employ a two-stage class-aware self-augmentation approach to generate a large amount of unknown question-response data. Then we conduct disparity-driven self-curation to select qualified data for fine-tuning the LLM itself for aligning the responses to unknown questions as desired. Experimental results on two datasets across four types of unknown questions validate the superiority of the Self-Align method over existing baselines in terms of three types of task formulation.
format text
author DENG, Yang
ZHAO, Yong
LI, Moxin
NG, See-Kiong
CHUA, Tat-Seng
author_facet DENG, Yang
ZHAO, Yong
LI, Moxin
NG, See-Kiong
CHUA, Tat-Seng
author_sort DENG, Yang
title Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations
title_short Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations
title_full Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations
title_fullStr Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations
title_full_unstemmed Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations
title_sort don’t just say “i don’t know”! self-aligning large language models for responding to unknown questions with explanations
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9614
https://ink.library.smu.edu.sg/context/sis_research/article/10614/viewcontent/2402.15062v2.pdf
_version_ 1816859161284050944