Actively learn from LLMs with uncertainty propagation for generalized category discovery

Generalized category discovery faces a key issue: the lack of supervision for new and unseen data categories. Traditional methods typically combine supervised pretraining with self-supervised learning to create models, and then employ clustering for category identification. However, these approaches...

Full description

Saved in:

Bibliographic Details
Main Authors:	LIANG, Jinggui, LIAO, Lizi, FEI, Hao, LI, Bobo, JIANG, Jing
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2024
Subjects:	Large language models LLMs Category discovery Natural language processing Uncertainty propagation Artificial Intelligence and Robotics Computer Sciences
Online Access:	https://ink.library.smu.edu.sg/sis_research/9700 https://ink.library.smu.edu.sg/context/sis_research/article/10700/viewcontent/2024.naacl_long.434.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-10700
record_format	dspace
spelling	sg-smu-ink.sis_research-107002024-11-28T08:59:18Z Actively learn from LLMs with uncertainty propagation for generalized category discovery LIANG, Jinggui LIAO, Lizi FEI, Hao LI, Bobo JIANG, Jing Generalized category discovery faces a key issue: the lack of supervision for new and unseen data categories. Traditional methods typically combine supervised pretraining with self-supervised learning to create models, and then employ clustering for category identification. However, these approaches tend to become overly tailored to known categories, failing to fully resolve the core issue. Hence, we propose to integrate the feedback from LLMs into an active learning paradigm. Specifically, our method innovatively employs uncertainty propagation to select data samples from high-uncertainty regions, which are then labeled using LLMs through a comparison-based prompting scheme. This not only eases the labeling task but also enhances accuracy in identifying new categories. Additionally, a soft feedback propagation mechanism is introduced to minimize the spread of inaccurate feedback. Experiments on various datasets demonstrate our framework’s efficacy and generalizability, significantly improving baseline models at a nominal average cost. 2024-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9700 info:doi/10.18653/v1/2024.naacl-long.434 https://ink.library.smu.edu.sg/context/sis_research/article/10700/viewcontent/2024.naacl_long.434.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Large language models LLMs Category discovery Natural language processing Uncertainty propagation Artificial Intelligence and Robotics Computer Sciences
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Large language models LLMs Category discovery Natural language processing Uncertainty propagation Artificial Intelligence and Robotics Computer Sciences
spellingShingle	Large language models LLMs Category discovery Natural language processing Uncertainty propagation Artificial Intelligence and Robotics Computer Sciences LIANG, Jinggui LIAO, Lizi FEI, Hao LI, Bobo JIANG, Jing Actively learn from LLMs with uncertainty propagation for generalized category discovery
description	Generalized category discovery faces a key issue: the lack of supervision for new and unseen data categories. Traditional methods typically combine supervised pretraining with self-supervised learning to create models, and then employ clustering for category identification. However, these approaches tend to become overly tailored to known categories, failing to fully resolve the core issue. Hence, we propose to integrate the feedback from LLMs into an active learning paradigm. Specifically, our method innovatively employs uncertainty propagation to select data samples from high-uncertainty regions, which are then labeled using LLMs through a comparison-based prompting scheme. This not only eases the labeling task but also enhances accuracy in identifying new categories. Additionally, a soft feedback propagation mechanism is introduced to minimize the spread of inaccurate feedback. Experiments on various datasets demonstrate our framework’s efficacy and generalizability, significantly improving baseline models at a nominal average cost.
format	text
author	LIANG, Jinggui LIAO, Lizi FEI, Hao LI, Bobo JIANG, Jing
author_facet	LIANG, Jinggui LIAO, Lizi FEI, Hao LI, Bobo JIANG, Jing
author_sort	LIANG, Jinggui
title	Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_short	Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_full	Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_fullStr	Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_full_unstemmed	Actively learn from LLMs with uncertainty propagation for generalized category discovery
title_sort	actively learn from llms with uncertainty propagation for generalized category discovery
publisher	Institutional Knowledge at Singapore Management University
publishDate	2024
url	https://ink.library.smu.edu.sg/sis_research/9700 https://ink.library.smu.edu.sg/context/sis_research/article/10700/viewcontent/2024.naacl_long.434.pdf
_version_	1819113106795659264

Actively learn from LLMs with uncertainty propagation for generalized category discovery

Similar Items