DML-PL: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning

Traditional class imbalanced learning algorithms require training data to be labeled, whereas semi-supervised learning algorithms assume that the class distribution is balanced. However, class imbalance and insufficient labeled data problems often coexist in practical real-world applications. Curren...

Full description

Saved in:
Bibliographic Details
Main Authors: Yan, Mi, Hui, Siu Cheung, Li, Ning
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/170840
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-170840
record_format dspace
spelling sg-ntu-dr.10356-1708402023-10-03T07:32:52Z DML-PL: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning Yan, Mi Hui, Siu Cheung Li, Ning School of Computer Science and Engineering Engineering::Computer science and engineering Class Imbalanced Classification Semi-Supervised Learning Traditional class imbalanced learning algorithms require training data to be labeled, whereas semi-supervised learning algorithms assume that the class distribution is balanced. However, class imbalance and insufficient labeled data problems often coexist in practical real-world applications. Currently, most existing class-imbalanced semi-supervised learning methods tackle these two problems separately, resulting in the trained model biased towards majority classes that have more data samples. In this study, we propose a deep metric learning based pseudo-labeling (DML-PL) framework that tackles both problems simultaneously for class-imbalanced semi-supervised learning. The proposed DML-PL framework comprises three modules: Deep Metric Learning, Pseudo-Labeling and Network Fine-tuning. An iterative self-training strategy is used to train the model multiple times. For each time of training, Deep Metric Learning trains a deep metric network to learn compact feature representations of labeled and unlabeled data. Pseudo-Labeling then generates reliable pseudo-labels for unlabeled data through labeled data clustering with nearest neighbors selection. Finally, Network Fine-tuning fine-tunes the deep metric network to generate better pseudo-labels in the subsequent training. The training ends when all the unlabeled data are pseudo-labeled. The proposed framework achieved state-of-the-art performance on the long-tailed CIFAR-10, CIFAR-100, and ImageNet127 benchmark datasets compared with baseline models. This study is supported by National Natural Science Foundation of China (62273230) and China Scholarship Council (No.202006230225) 2023-10-03T07:32:52Z 2023-10-03T07:32:52Z 2023 Journal Article Yan, M., Hui, S. C. & Li, N. (2023). DML-PL: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning. Information Sciences, 626, 641-657. https://dx.doi.org/10.1016/j.ins.2023.01.074 0020-0255 https://hdl.handle.net/10356/170840 10.1016/j.ins.2023.01.074 2-s2.0-85149786935 626 641 657 en Information Sciences © 2023 Elsevier Inc. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Class Imbalanced Classification
Semi-Supervised Learning
spellingShingle Engineering::Computer science and engineering
Class Imbalanced Classification
Semi-Supervised Learning
Yan, Mi
Hui, Siu Cheung
Li, Ning
DML-PL: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning
description Traditional class imbalanced learning algorithms require training data to be labeled, whereas semi-supervised learning algorithms assume that the class distribution is balanced. However, class imbalance and insufficient labeled data problems often coexist in practical real-world applications. Currently, most existing class-imbalanced semi-supervised learning methods tackle these two problems separately, resulting in the trained model biased towards majority classes that have more data samples. In this study, we propose a deep metric learning based pseudo-labeling (DML-PL) framework that tackles both problems simultaneously for class-imbalanced semi-supervised learning. The proposed DML-PL framework comprises three modules: Deep Metric Learning, Pseudo-Labeling and Network Fine-tuning. An iterative self-training strategy is used to train the model multiple times. For each time of training, Deep Metric Learning trains a deep metric network to learn compact feature representations of labeled and unlabeled data. Pseudo-Labeling then generates reliable pseudo-labels for unlabeled data through labeled data clustering with nearest neighbors selection. Finally, Network Fine-tuning fine-tunes the deep metric network to generate better pseudo-labels in the subsequent training. The training ends when all the unlabeled data are pseudo-labeled. The proposed framework achieved state-of-the-art performance on the long-tailed CIFAR-10, CIFAR-100, and ImageNet127 benchmark datasets compared with baseline models.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Yan, Mi
Hui, Siu Cheung
Li, Ning
format Article
author Yan, Mi
Hui, Siu Cheung
Li, Ning
author_sort Yan, Mi
title DML-PL: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning
title_short DML-PL: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning
title_full DML-PL: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning
title_fullStr DML-PL: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning
title_full_unstemmed DML-PL: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning
title_sort dml-pl: deep metric learning based pseudo-labeling framework for class imbalanced semi-supervised learning
publishDate 2023
url https://hdl.handle.net/10356/170840
_version_ 1779171089611489280