LPT: Long-tailed prompt tuning for image classification

For long-tailed classification tasks, most works often pretrain a big model on a large-scale (unlabeled) dataset, and then fine-tune the whole pretrained model for adapting to long-tailed data. Though promising, fine-tuning the whole pretrained model tends to suffer from high cost in computation and...

Full description

Saved in:

Bibliographic Details
Main Authors:	DONG, Bowen, ZHOU, Pan, YAN, Shuicheng, ZUO, Wangmeng
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	Graphics and Human Computer Interfaces
Online Access:	https://ink.library.smu.edu.sg/sis_research/8982 https://ink.library.smu.edu.sg/context/sis_research/article/9985/viewcontent/2023_ICLR_LPT__1_.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-9985
record_format	dspace
spelling	sg-smu-ink.sis_research-99852024-07-25T08:31:31Z LPT: Long-tailed prompt tuning for image classification DONG, Bowen ZHOU, Pan YAN, Shuicheng ZUO, Wangmeng For long-tailed classification tasks, most works often pretrain a big model on a large-scale (unlabeled) dataset, and then fine-tune the whole pretrained model for adapting to long-tailed data. Though promising, fine-tuning the whole pretrained model tends to suffer from high cost in computation and deployment of different models for different tasks, as well as weakened generalization capability for overfitting to certain features of long-tailed data. To alleviate these issues, we propose an effective Long-tailed Prompt Tuning (LPT) method for long-tailed classification tasks. LPT introduces several trainable prompts into a frozen pretrained model to adapt it to long-tailed data. For better effectiveness, we divide prompts into two groups: 1) a shared prompt for the whole long-tailed dataset to learn general features and to adapt a pretrained model into the target long-tailed domain; and 2) group-specific prompts to gather group-specific features for the samples which have similar features and also to empower the pretrained model with fine-grained discrimination ability. Then we design a two-phase training paradigm to learn these prompts. In the first phase, we train the shared prompt via conventional supervised prompt tuning to adapt a pretrained model to the desired long-tailed domain. In the second phase, we use the learnt shared prompt as query to select a small best matched set for a group of similar samples from the group-specific prompt set to dig the common features of these similar samples, and then optimize these prompts with a dual sampling strategy and the asymmetric Gaussian Clouded Logit loss. By only fine-tuning a few prompts while fixing the pretrained model, LPT can reduce training cost and deployment cost by storing a few prompts, and enjoys a strong generalization ability of the pretrained model. Experiments show that on various long-tailed benchmarks, with only ∼1.1\% extra trainable parameters, LPT achieves comparable or higher performance than previous whole model fine-tuning methods, and is more robust to domain-shift. 2023-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8982 https://ink.library.smu.edu.sg/context/sis_research/article/9985/viewcontent/2023_ICLR_LPT__1_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Graphics and Human Computer Interfaces
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Graphics and Human Computer Interfaces
spellingShingle	Graphics and Human Computer Interfaces DONG, Bowen ZHOU, Pan YAN, Shuicheng ZUO, Wangmeng LPT: Long-tailed prompt tuning for image classification
description	For long-tailed classification tasks, most works often pretrain a big model on a large-scale (unlabeled) dataset, and then fine-tune the whole pretrained model for adapting to long-tailed data. Though promising, fine-tuning the whole pretrained model tends to suffer from high cost in computation and deployment of different models for different tasks, as well as weakened generalization capability for overfitting to certain features of long-tailed data. To alleviate these issues, we propose an effective Long-tailed Prompt Tuning (LPT) method for long-tailed classification tasks. LPT introduces several trainable prompts into a frozen pretrained model to adapt it to long-tailed data. For better effectiveness, we divide prompts into two groups: 1) a shared prompt for the whole long-tailed dataset to learn general features and to adapt a pretrained model into the target long-tailed domain; and 2) group-specific prompts to gather group-specific features for the samples which have similar features and also to empower the pretrained model with fine-grained discrimination ability. Then we design a two-phase training paradigm to learn these prompts. In the first phase, we train the shared prompt via conventional supervised prompt tuning to adapt a pretrained model to the desired long-tailed domain. In the second phase, we use the learnt shared prompt as query to select a small best matched set for a group of similar samples from the group-specific prompt set to dig the common features of these similar samples, and then optimize these prompts with a dual sampling strategy and the asymmetric Gaussian Clouded Logit loss. By only fine-tuning a few prompts while fixing the pretrained model, LPT can reduce training cost and deployment cost by storing a few prompts, and enjoys a strong generalization ability of the pretrained model. Experiments show that on various long-tailed benchmarks, with only ∼1.1\% extra trainable parameters, LPT achieves comparable or higher performance than previous whole model fine-tuning methods, and is more robust to domain-shift.
format	text
author	DONG, Bowen ZHOU, Pan YAN, Shuicheng ZUO, Wangmeng
author_facet	DONG, Bowen ZHOU, Pan YAN, Shuicheng ZUO, Wangmeng
author_sort	DONG, Bowen
title	LPT: Long-tailed prompt tuning for image classification
title_short	LPT: Long-tailed prompt tuning for image classification
title_full	LPT: Long-tailed prompt tuning for image classification
title_fullStr	LPT: Long-tailed prompt tuning for image classification
title_full_unstemmed	LPT: Long-tailed prompt tuning for image classification
title_sort	lpt: long-tailed prompt tuning for image classification
publisher	Institutional Knowledge at Singapore Management University
publishDate	2023
url	https://ink.library.smu.edu.sg/sis_research/8982 https://ink.library.smu.edu.sg/context/sis_research/article/9985/viewcontent/2023_ICLR_LPT__1_.pdf
_version_	1814047700048412672

LPT: Long-tailed prompt tuning for image classification

Similar Items