Actively learn from LLMs with uncertainty propagation for generalized category discovery
Generalized category discovery faces a key issue: the lack of supervision for new and unseen data categories. Traditional methods typically combine supervised pretraining with self-supervised learning to create models, and then employ clustering for category identification. However, these approaches...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9700 https://ink.library.smu.edu.sg/context/sis_research/article/10700/viewcontent/2024.naacl_long.434.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-10700 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-107002024-11-28T08:59:18Z Actively learn from LLMs with uncertainty propagation for generalized category discovery LIANG, Jinggui LIAO, Lizi FEI, Hao LI, Bobo JIANG, Jing Generalized category discovery faces a key issue: the lack of supervision for new and unseen data categories. Traditional methods typically combine supervised pretraining with self-supervised learning to create models, and then employ clustering for category identification. However, these approaches tend to become overly tailored to known categories, failing to fully resolve the core issue. Hence, we propose to integrate the feedback from LLMs into an active learning paradigm. Specifically, our method innovatively employs uncertainty propagation to select data samples from high-uncertainty regions, which are then labeled using LLMs through a comparison-based prompting scheme. This not only eases the labeling task but also enhances accuracy in identifying new categories. Additionally, a soft feedback propagation mechanism is introduced to minimize the spread of inaccurate feedback. Experiments on various datasets demonstrate our framework’s efficacy and generalizability, significantly improving baseline models at a nominal average cost. 2024-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9700 info:doi/10.18653/v1/2024.naacl-long.434 https://ink.library.smu.edu.sg/context/sis_research/article/10700/viewcontent/2024.naacl_long.434.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Large language models LLMs Category discovery Natural language processing Uncertainty propagation Artificial Intelligence and Robotics Computer Sciences |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Large language models LLMs Category discovery Natural language processing Uncertainty propagation Artificial Intelligence and Robotics Computer Sciences |
spellingShingle |
Large language models LLMs Category discovery Natural language processing Uncertainty propagation Artificial Intelligence and Robotics Computer Sciences LIANG, Jinggui LIAO, Lizi FEI, Hao LI, Bobo JIANG, Jing Actively learn from LLMs with uncertainty propagation for generalized category discovery |
description |
Generalized category discovery faces a key issue: the lack of supervision for new and unseen data categories. Traditional methods typically combine supervised pretraining with self-supervised learning to create models, and then employ clustering for category identification. However, these approaches tend to become overly tailored to known categories, failing to fully resolve the core issue. Hence, we propose to integrate the feedback from LLMs into an active learning paradigm. Specifically, our method innovatively employs uncertainty propagation to select data samples from high-uncertainty regions, which are then labeled using LLMs through a comparison-based prompting scheme. This not only eases the labeling task but also enhances accuracy in identifying new categories. Additionally, a soft feedback propagation mechanism is introduced to minimize the spread of inaccurate feedback. Experiments on various datasets demonstrate our framework’s efficacy and generalizability, significantly improving baseline models at a nominal average cost. |
format |
text |
author |
LIANG, Jinggui LIAO, Lizi FEI, Hao LI, Bobo JIANG, Jing |
author_facet |
LIANG, Jinggui LIAO, Lizi FEI, Hao LI, Bobo JIANG, Jing |
author_sort |
LIANG, Jinggui |
title |
Actively learn from LLMs with uncertainty propagation for generalized category discovery |
title_short |
Actively learn from LLMs with uncertainty propagation for generalized category discovery |
title_full |
Actively learn from LLMs with uncertainty propagation for generalized category discovery |
title_fullStr |
Actively learn from LLMs with uncertainty propagation for generalized category discovery |
title_full_unstemmed |
Actively learn from LLMs with uncertainty propagation for generalized category discovery |
title_sort |
actively learn from llms with uncertainty propagation for generalized category discovery |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2024 |
url |
https://ink.library.smu.edu.sg/sis_research/9700 https://ink.library.smu.edu.sg/context/sis_research/article/10700/viewcontent/2024.naacl_long.434.pdf |
_version_ |
1819113106795659264 |