DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers
In this paper, we address the few-shot classification task from a new perspective of optimal matching between image regions. We adopt the Earth Mover’s Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance. The EMD generates the...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/144270 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-144270 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1442702020-10-26T06:04:12Z DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers Zhang, Chi Cai, Yujun Lin, Guosheng Shen, Chunhua School of Computer Science and Engineering IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020 Engineering::Computer science and engineering Deep Neural Networks Earth Mover’s Distance (EMD) In this paper, we address the few-shot classification task from a new perspective of optimal matching between image regions. We adopt the Earth Mover’s Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance. The EMD generates the optimal matching flows between structural elements that have the minimum matching cost, which is used to represent the image distance for classification. To generate the important weights of elements in the EMD formulation, we design a cross-reference mechanism, which can effectively minimize the impact caused by the cluttered background and large intra-class appearance variations. To handle k-shot classification, we propose to learn a structured fully connected layer that can directly classify dense image representations with the EMD. Based on the implicit function theorem, the EMD can be inserted as a layer into the network for end-to-end training. We conduct comprehensive experiments to validate our algorithm and we set new state-of-the-art performance on four popular few-shot classification benchmarks, namely miniImageNet, tieredImageNet, Fewshot-CIFAR100 (FC100) and Caltech-UCSD Birds-200-2011 (CUB). AI Singapore Ministry of Education (MOE) National Research Foundation (NRF) Accepted version This research is supported by the National Research Foundation Singapore under its AI Singapore Programme (Award Number: AISG-RP-2018-003) and the MOE Tier-1 research grants: RG126/17 (S) and RG28/18 (S). Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of National Research Foundation, Singapore. This research is supported by the National Research Foundation Singapore under its AI Singapore Programme (Award Number: AISG-RP-2018-003) and the MOE Tier-1 research grants: RG126/17 (S) and RG28/18 (S). Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of National Research Foundation, Singapore. 2020-10-26T06:04:12Z 2020-10-26T06:04:12Z 2020 Conference Paper Zhang, C., Cai, Y., Lin, G., & Shen, C. (2020). DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 12203-12213. https://hdl.handle.net/10356/144270 12203 12213 en AISG-RP-2018-003 RG126/17 (S) RG28/18 (S) © 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Deep Neural Networks Earth Mover’s Distance (EMD) |
spellingShingle |
Engineering::Computer science and engineering Deep Neural Networks Earth Mover’s Distance (EMD) Zhang, Chi Cai, Yujun Lin, Guosheng Shen, Chunhua DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers |
description |
In this paper, we address the few-shot classification task from a new perspective of optimal matching between image regions. We adopt the Earth Mover’s Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance. The EMD generates the optimal matching flows between structural elements that have the minimum matching cost, which is used to represent the image distance for classification. To generate the important weights of elements in the EMD formulation, we design a cross-reference mechanism, which can effectively minimize the impact caused by the cluttered
background and large intra-class appearance variations. To handle k-shot classification, we propose to learn a structured fully connected layer that can directly classify dense image representations with the EMD. Based on the implicit
function theorem, the EMD can be inserted as a layer into the network for end-to-end training. We conduct comprehensive experiments to validate our algorithm and we set new state-of-the-art performance on four popular few-shot classification benchmarks, namely miniImageNet, tieredImageNet, Fewshot-CIFAR100 (FC100) and Caltech-UCSD Birds-200-2011 (CUB). |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Zhang, Chi Cai, Yujun Lin, Guosheng Shen, Chunhua |
format |
Conference or Workshop Item |
author |
Zhang, Chi Cai, Yujun Lin, Guosheng Shen, Chunhua |
author_sort |
Zhang, Chi |
title |
DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers |
title_short |
DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers |
title_full |
DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers |
title_fullStr |
DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers |
title_full_unstemmed |
DeepEMD : few-shot image classification with differentiable Earth Mover’s Distance and structured classifiers |
title_sort |
deepemd : few-shot image classification with differentiable earth mover’s distance and structured classifiers |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/144270 |
_version_ |
1683494311144980480 |