Actively learn from LLMs with uncertainty propagation for generalized category discovery

Generalized category discovery faces a key issue: the lack of supervision for new and unseen data categories. Traditional methods typically combine supervised pretraining with self-supervised learning to create models, and then employ clustering for category identification. However, these approaches...

Full description

Saved in:
Bibliographic Details
Main Authors: LIANG, Jinggui, LIAO, Lizi, FEI, Hao, LI, Bobo, JIANG, Jing
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9700
https://ink.library.smu.edu.sg/context/sis_research/article/10700/viewcontent/2024.naacl_long.434.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10700
record_format dspace
spelling sg-smu-ink.sis_research-107002024-11-28T08:59:18Z Actively learn from LLMs with uncertainty propagation for generalized category discovery LIANG, Jinggui LIAO, Lizi FEI, Hao LI, Bobo JIANG, Jing Generalized category discovery faces a key issue: the lack of supervision for new and unseen data categories. Traditional methods typically combine supervised pretraining with self-supervised learning to create models, and then employ clustering for category identification. However, these approaches tend to become overly tailored to known categories, failing to fully resolve the core issue. Hence, we propose to integrate the feedback from LLMs into an active learning paradigm. Specifically, our method innovatively employs uncertainty propagation to select data samples from high-uncertainty regions, which are then labeled using LLMs through a comparison-based prompting scheme. This not only eases the labeling task but also enhances accuracy in identifying new categories. Additionally, a soft feedback propagation mechanism is introduced to minimize the spread of inaccurate feedback. Experiments on various datasets demonstrate our framework’s efficacy and generalizability, significantly improving baseline models at a nominal average cost. 2024-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9700 info:doi/10.18653/v1/2024.naacl-long.434 https://ink.library.smu.edu.sg/context/sis_research/article/10700/viewcontent/2024.naacl_long.434.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Large language models LLMs Category discovery Natural language processing Uncertainty propagation Artificial Intelligence and Robotics Computer Sciences
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Large language models
LLMs
Category discovery
Natural language processing
Uncertainty propagation
Artificial Intelligence and Robotics
Computer Sciences
spellingShingle Large language models
LLMs
Category discovery
Natural language processing
Uncertainty propagation
Artificial Intelligence and Robotics
Computer Sciences
LIANG, Jinggui
LIAO, Lizi
FEI, Hao
LI, Bobo
JIANG, Jing
Actively learn from LLMs with uncertainty propagation for generalized category discovery
description Generalized category discovery faces a key issue: the lack of supervision for new and unseen data categories. Traditional methods typically combine supervised pretraining with self-supervised learning to create models, and then employ clustering for category identification. However, these approaches tend to become overly tailored to known categories, failing to fully resolve the core issue. Hence, we propose to integrate the feedback from LLMs into an active learning paradigm. Specifically, our method innovatively employs uncertainty propagation to select data samples from high-uncertainty regions, which are then labeled using LLMs through a comparison-based prompting scheme. This not only eases the labeling task but also enhances accuracy in identifying new categories. Additionally, a soft feedback propagation mechanism is introduced to minimize the spread of inaccurate feedback. Experiments on various datasets demonstrate our framework’s efficacy and generalizability, significantly improving baseline models at a nominal average cost.
format text
author LIANG, Jinggui
LIAO, Lizi
FEI, Hao
LI, Bobo
JIANG, Jing
author_facet LIANG, Jinggui
LIAO, Lizi
FEI, Hao
LI, Bobo
JIANG, Jing
author_sort LIANG, Jinggui
title Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_short Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_full Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_fullStr Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_full_unstemmed Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_sort actively learn from llms with uncertainty propagation for generalized category discovery
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9700
https://ink.library.smu.edu.sg/context/sis_research/article/10700/viewcontent/2024.naacl_long.434.pdf
_version_ 1819113106795659264