Improved generalisation bounds for deep learning through L∞ covering numbers

Using proof techniques involving L∞ covering numbers, we show generalisation error bounds for deep learning with two main improvements over the state of the art. First, our bounds have no explicit dependence on the number of classes except for logarithmic factors. This holds even when formulating th...

Full description

Saved in:
Bibliographic Details
Main Authors: LEDENT, Antoine, LEI, Yunwen, KLOFT, Marius
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7211
https://ink.library.smu.edu.sg/context/sis_research/article/8214/viewcontent/85_wrshpnew.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8214
record_format dspace
spelling sg-smu-ink.sis_research-82142022-08-04T08:44:16Z Improved generalisation bounds for deep learning through L∞ covering numbers LEDENT, Antoine LEI, Yunwen KLOFT, Marius Using proof techniques involving L∞ covering numbers, we show generalisation error bounds for deep learning with two main improvements over the state of the art. First, our bounds have no explicit dependence on the number of classes except for logarithmic factors. This holds even when formulating the bounds in terms of the L 2 norm of the weight matrices, while previous bounds exhibit at least a square-root dependence on the number of classes in this case. Second, we adapt the Rademacher analysis of DNNs to incorporate weight sharing—a task of fundamental theoretical importance which was previously attempted only under very restrictive assumptions. In our results, each convolutional filter contributes only once to the bound, regardless of how many times it is applied. Finally we provide a few further technical improvements, including improving the width dependence from before to after pooling. We also examine our bound’s behaviour on artificial data. 2019-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7211 https://ink.library.smu.edu.sg/context/sis_research/article/8214/viewcontent/85_wrshpnew.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Deep Learning convolutions multi-class multi-label Databases and Information Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Deep Learning
convolutions
multi-class
multi-label
Databases and Information Systems
Graphics and Human Computer Interfaces
spellingShingle Deep Learning
convolutions
multi-class
multi-label
Databases and Information Systems
Graphics and Human Computer Interfaces
LEDENT, Antoine
LEI, Yunwen
KLOFT, Marius
Improved generalisation bounds for deep learning through L∞ covering numbers
description Using proof techniques involving L∞ covering numbers, we show generalisation error bounds for deep learning with two main improvements over the state of the art. First, our bounds have no explicit dependence on the number of classes except for logarithmic factors. This holds even when formulating the bounds in terms of the L 2 norm of the weight matrices, while previous bounds exhibit at least a square-root dependence on the number of classes in this case. Second, we adapt the Rademacher analysis of DNNs to incorporate weight sharing—a task of fundamental theoretical importance which was previously attempted only under very restrictive assumptions. In our results, each convolutional filter contributes only once to the bound, regardless of how many times it is applied. Finally we provide a few further technical improvements, including improving the width dependence from before to after pooling. We also examine our bound’s behaviour on artificial data.
format text
author LEDENT, Antoine
LEI, Yunwen
KLOFT, Marius
author_facet LEDENT, Antoine
LEI, Yunwen
KLOFT, Marius
author_sort LEDENT, Antoine
title Improved generalisation bounds for deep learning through L∞ covering numbers
title_short Improved generalisation bounds for deep learning through L∞ covering numbers
title_full Improved generalisation bounds for deep learning through L∞ covering numbers
title_fullStr Improved generalisation bounds for deep learning through L∞ covering numbers
title_full_unstemmed Improved generalisation bounds for deep learning through L∞ covering numbers
title_sort improved generalisation bounds for deep learning through l∞ covering numbers
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/7211
https://ink.library.smu.edu.sg/context/sis_research/article/8214/viewcontent/85_wrshpnew.pdf
_version_ 1770576271011479552