Finding meta winning ticket to train your MAML
The lottery ticket hypothesis (LTH) states that a randomly initialized dense network contains sub-networks that can be trained in isolation to the performance of the dense network. In this paper, to achieve rapid learning with less computational cost, we explore LTH in the context of meta learning....
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7260 https://ink.library.smu.edu.sg/context/sis_research/article/8263/viewcontent/kdd22_qu.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8263 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-82632022-09-12T10:08:14Z Finding meta winning ticket to train your MAML GAO, Dawei XIE, Yuexiang ZHOU, Zimu WANG, Zhen LI, Yaliang DING, Bolin. The lottery ticket hypothesis (LTH) states that a randomly initialized dense network contains sub-networks that can be trained in isolation to the performance of the dense network. In this paper, to achieve rapid learning with less computational cost, we explore LTH in the context of meta learning. First, we experimentally show that there are sparse sub-networks, known as meta winning tickets, which can be meta-trained to few-shot classification accuracy to the original backbone. The application of LTH in meta learning enables the adaptation of meta-trained networks on various IoT devices with fewer computation. However, the status quo to identify winning tickets requires iterative training and pruning, which is particularly expensive for finding meta winning tickets. To this end, then we investigate the inter- and intra-layer patterns among different meta winning tickets, and propose a scheme for early detection of a meta winning ticket. The proposed scheme enables efficient training in resource-limited devices. Besides, it also designs a lightweight solution to search the meta winning ticket. Evaluations on standard few-shot classification benchmarks show that we can find competitive meta winning tickets with 20% weights of the original backbone, while incurring only 8%-14% (Conv-4) and 19%-29% (ResNet-12) computation overhead (measured by FLOPs) of the standard winning ticket finding scheme. 2022-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7260 info:doi/10.1145/3534678.3539467 https://ink.library.smu.edu.sg/context/sis_research/article/8263/viewcontent/kdd22_qu.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Meta Learning Network Pruning Lottery Ticket Hypothesis Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Meta Learning Network Pruning Lottery Ticket Hypothesis Software Engineering |
spellingShingle |
Meta Learning Network Pruning Lottery Ticket Hypothesis Software Engineering GAO, Dawei XIE, Yuexiang ZHOU, Zimu WANG, Zhen LI, Yaliang DING, Bolin. Finding meta winning ticket to train your MAML |
description |
The lottery ticket hypothesis (LTH) states that a randomly initialized dense network contains sub-networks that can be trained in isolation to the performance of the dense network. In this paper, to achieve rapid learning with less computational cost, we explore LTH in the context of meta learning. First, we experimentally show that there are sparse sub-networks, known as meta winning tickets, which can be meta-trained to few-shot classification accuracy to the original backbone. The application of LTH in meta learning enables the adaptation of meta-trained networks on various IoT devices with fewer computation. However, the status quo to identify winning tickets requires iterative training and pruning, which is particularly expensive for finding meta winning tickets. To this end, then we investigate the inter- and intra-layer patterns among different meta winning tickets, and propose a scheme for early detection of a meta winning ticket. The proposed scheme enables efficient training in resource-limited devices. Besides, it also designs a lightweight solution to search the meta winning ticket. Evaluations on standard few-shot classification benchmarks show that we can find competitive meta winning tickets with 20% weights of the original backbone, while incurring only 8%-14% (Conv-4) and 19%-29% (ResNet-12) computation overhead (measured by FLOPs) of the standard winning ticket finding scheme. |
format |
text |
author |
GAO, Dawei XIE, Yuexiang ZHOU, Zimu WANG, Zhen LI, Yaliang DING, Bolin. |
author_facet |
GAO, Dawei XIE, Yuexiang ZHOU, Zimu WANG, Zhen LI, Yaliang DING, Bolin. |
author_sort |
GAO, Dawei |
title |
Finding meta winning ticket to train your MAML |
title_short |
Finding meta winning ticket to train your MAML |
title_full |
Finding meta winning ticket to train your MAML |
title_fullStr |
Finding meta winning ticket to train your MAML |
title_full_unstemmed |
Finding meta winning ticket to train your MAML |
title_sort |
finding meta winning ticket to train your maml |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/7260 https://ink.library.smu.edu.sg/context/sis_research/article/8263/viewcontent/kdd22_qu.pdf |
_version_ |
1770576293283233792 |