DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers

In this paper, we address the few-shot classification task from a new perspective of optimal matching between image regions. We adopt the Earth Mover’s Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance. The EMD generates the...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Chi, Cai, Yujun, Lin, Guosheng, Shen, Chunhua
Other Authors: School of Computer Science and Engineering
Format: Conference or Workshop Item
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/144270
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-144270
record_format dspace
spelling sg-ntu-dr.10356-1442702020-10-26T06:04:12Z DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers Zhang, Chi Cai, Yujun Lin, Guosheng Shen, Chunhua School of Computer Science and Engineering IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020 Engineering::Computer science and engineering Deep Neural Networks Earth Mover’s Distance (EMD) In this paper, we address the few-shot classification task from a new perspective of optimal matching between image regions. We adopt the Earth Mover’s Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance. The EMD generates the optimal matching flows between structural elements that have the minimum matching cost, which is used to represent the image distance for classification. To generate the important weights of elements in the EMD formulation, we design a cross-reference mechanism, which can effectively minimize the impact caused by the cluttered background and large intra-class appearance variations. To handle k-shot classification, we propose to learn a structured fully connected layer that can directly classify dense image representations with the EMD. Based on the implicit function theorem, the EMD can be inserted as a layer into the network for end-to-end training. We conduct comprehensive experiments to validate our algorithm and we set new state-of-the-art performance on four popular few-shot classification benchmarks, namely miniImageNet, tieredImageNet, Fewshot-CIFAR100 (FC100) and Caltech-UCSD Birds-200-2011 (CUB). AI Singapore Ministry of Education (MOE) National Research Foundation (NRF) Accepted version This research is supported by the National Research Foundation Singapore under its AI Singapore Programme (Award Number: AISG-RP-2018-003) and the MOE Tier-1 research grants: RG126/17 (S) and RG28/18 (S). Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of National Research Foundation, Singapore. This research is supported by the National Research Foundation Singapore under its AI Singapore Programme (Award Number: AISG-RP-2018-003) and the MOE Tier-1 research grants: RG126/17 (S) and RG28/18 (S). Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of National Research Foundation, Singapore. 2020-10-26T06:04:12Z 2020-10-26T06:04:12Z 2020 Conference Paper Zhang, C., Cai, Y., Lin, G., & Shen, C. (2020). DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 12203-12213. https://hdl.handle.net/10356/144270 12203 12213 en AISG-RP-2018-003 RG126/17 (S) RG28/18 (S) © 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Deep Neural Networks
Earth Mover’s Distance (EMD)
spellingShingle Engineering::Computer science and engineering
Deep Neural Networks
Earth Mover’s Distance (EMD)
Zhang, Chi
Cai, Yujun
Lin, Guosheng
Shen, Chunhua
DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers
description In this paper, we address the few-shot classification task from a new perspective of optimal matching between image regions. We adopt the Earth Mover’s Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance. The EMD generates the optimal matching flows between structural elements that have the minimum matching cost, which is used to represent the image distance for classification. To generate the important weights of elements in the EMD formulation, we design a cross-reference mechanism, which can effectively minimize the impact caused by the cluttered background and large intra-class appearance variations. To handle k-shot classification, we propose to learn a structured fully connected layer that can directly classify dense image representations with the EMD. Based on the implicit function theorem, the EMD can be inserted as a layer into the network for end-to-end training. We conduct comprehensive experiments to validate our algorithm and we set new state-of-the-art performance on four popular few-shot classification benchmarks, namely miniImageNet, tieredImageNet, Fewshot-CIFAR100 (FC100) and Caltech-UCSD Birds-200-2011 (CUB).
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Zhang, Chi
Cai, Yujun
Lin, Guosheng
Shen, Chunhua
format Conference or Workshop Item
author Zhang, Chi
Cai, Yujun
Lin, Guosheng
Shen, Chunhua
author_sort Zhang, Chi
title DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers
title_short DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers
title_full DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers
title_fullStr DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers
title_full_unstemmed DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers
title_sort deepemd : few-shot image classification with differentiable earth mover’s distance and structured classifiers
publishDate 2020
url https://hdl.handle.net/10356/144270
_version_ 1683494311144980480