Finding meta winning ticket to train your MAML

The lottery ticket hypothesis (LTH) states that a randomly initialized dense network contains sub-networks that can be trained in isolation to the performance of the dense network. In this paper, to achieve rapid learning with less computational cost, we explore LTH in the context of meta learning....

Full description

Saved in:
Bibliographic Details
Main Authors: GAO, Dawei, XIE, Yuexiang, ZHOU, Zimu, WANG, Zhen, LI, Yaliang, DING, Bolin.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7260
https://ink.library.smu.edu.sg/context/sis_research/article/8263/viewcontent/kdd22_qu.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8263
record_format dspace
spelling sg-smu-ink.sis_research-82632022-09-12T10:08:14Z Finding meta winning ticket to train your MAML GAO, Dawei XIE, Yuexiang ZHOU, Zimu WANG, Zhen LI, Yaliang DING, Bolin. The lottery ticket hypothesis (LTH) states that a randomly initialized dense network contains sub-networks that can be trained in isolation to the performance of the dense network. In this paper, to achieve rapid learning with less computational cost, we explore LTH in the context of meta learning. First, we experimentally show that there are sparse sub-networks, known as meta winning tickets, which can be meta-trained to few-shot classification accuracy to the original backbone. The application of LTH in meta learning enables the adaptation of meta-trained networks on various IoT devices with fewer computation. However, the status quo to identify winning tickets requires iterative training and pruning, which is particularly expensive for finding meta winning tickets. To this end, then we investigate the inter- and intra-layer patterns among different meta winning tickets, and propose a scheme for early detection of a meta winning ticket. The proposed scheme enables efficient training in resource-limited devices. Besides, it also designs a lightweight solution to search the meta winning ticket. Evaluations on standard few-shot classification benchmarks show that we can find competitive meta winning tickets with 20% weights of the original backbone, while incurring only 8%-14% (Conv-4) and 19%-29% (ResNet-12) computation overhead (measured by FLOPs) of the standard winning ticket finding scheme. 2022-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7260 info:doi/10.1145/3534678.3539467 https://ink.library.smu.edu.sg/context/sis_research/article/8263/viewcontent/kdd22_qu.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Meta Learning Network Pruning Lottery Ticket Hypothesis Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Meta Learning
Network Pruning
Lottery Ticket Hypothesis
Software Engineering
spellingShingle Meta Learning
Network Pruning
Lottery Ticket Hypothesis
Software Engineering
GAO, Dawei
XIE, Yuexiang
ZHOU, Zimu
WANG, Zhen
LI, Yaliang
DING, Bolin.
Finding meta winning ticket to train your MAML
description The lottery ticket hypothesis (LTH) states that a randomly initialized dense network contains sub-networks that can be trained in isolation to the performance of the dense network. In this paper, to achieve rapid learning with less computational cost, we explore LTH in the context of meta learning. First, we experimentally show that there are sparse sub-networks, known as meta winning tickets, which can be meta-trained to few-shot classification accuracy to the original backbone. The application of LTH in meta learning enables the adaptation of meta-trained networks on various IoT devices with fewer computation. However, the status quo to identify winning tickets requires iterative training and pruning, which is particularly expensive for finding meta winning tickets. To this end, then we investigate the inter- and intra-layer patterns among different meta winning tickets, and propose a scheme for early detection of a meta winning ticket. The proposed scheme enables efficient training in resource-limited devices. Besides, it also designs a lightweight solution to search the meta winning ticket. Evaluations on standard few-shot classification benchmarks show that we can find competitive meta winning tickets with 20% weights of the original backbone, while incurring only 8%-14% (Conv-4) and 19%-29% (ResNet-12) computation overhead (measured by FLOPs) of the standard winning ticket finding scheme.
format text
author GAO, Dawei
XIE, Yuexiang
ZHOU, Zimu
WANG, Zhen
LI, Yaliang
DING, Bolin.
author_facet GAO, Dawei
XIE, Yuexiang
ZHOU, Zimu
WANG, Zhen
LI, Yaliang
DING, Bolin.
author_sort GAO, Dawei
title Finding meta winning ticket to train your MAML
title_short Finding meta winning ticket to train your MAML
title_full Finding meta winning ticket to train your MAML
title_fullStr Finding meta winning ticket to train your MAML
title_full_unstemmed Finding meta winning ticket to train your MAML
title_sort finding meta winning ticket to train your maml
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7260
https://ink.library.smu.edu.sg/context/sis_research/article/8263/viewcontent/kdd22_qu.pdf
_version_ 1770576293283233792