An empirical study of memorization in NLP
A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7705 https://ink.library.smu.edu.sg/context/sis_research/article/8708/viewcontent/2022.acl_long.434.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8708 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-87082023-01-10T03:05:39Z An empirical study of memorization in NLP ZHENG, Xiaosen JIANG, Jing A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long-tail theory holds. Our experiments demonstrate that top-ranked memorized training instances are likely atypical, and removing the top-memorized training instances leads to a more serious drop in test accuracy compared with removing training instances randomly. Furthermore, we develop an attribution method to better understand why a training instance is memorized. We empirically show that our memorization attribution method is faithful, and share our interesting finding that the top-memorized parts of a training instance tend to be features negatively correlated with the class label. 2022-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7705 info:doi/10.18653/v1/2022.acl-long.434 https://ink.library.smu.edu.sg/context/sis_research/article/8708/viewcontent/2022.acl_long.434.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Natural language processing Computer Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Natural language processing Computer Engineering |
spellingShingle |
Natural language processing Computer Engineering ZHENG, Xiaosen JIANG, Jing An empirical study of memorization in NLP |
description |
A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long-tail theory holds. Our experiments demonstrate that top-ranked memorized training instances are likely atypical, and removing the top-memorized training instances leads to a more serious drop in test accuracy compared with removing training instances randomly. Furthermore, we develop an attribution method to better understand why a training instance is memorized. We empirically show that our memorization attribution method is faithful, and share our interesting finding that the top-memorized parts of a training instance tend to be features negatively correlated with the class label. |
format |
text |
author |
ZHENG, Xiaosen JIANG, Jing |
author_facet |
ZHENG, Xiaosen JIANG, Jing |
author_sort |
ZHENG, Xiaosen |
title |
An empirical study of memorization in NLP |
title_short |
An empirical study of memorization in NLP |
title_full |
An empirical study of memorization in NLP |
title_fullStr |
An empirical study of memorization in NLP |
title_full_unstemmed |
An empirical study of memorization in NLP |
title_sort |
empirical study of memorization in nlp |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/7705 https://ink.library.smu.edu.sg/context/sis_research/article/8708/viewcontent/2022.acl_long.434.pdf |
_version_ |
1770576417924317184 |