An empirical study of memorization in NLP

A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long...

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHENG, Xiaosen, JIANG, Jing
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2022
Subjects:	Natural language processing Computer Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/7705 https://ink.library.smu.edu.sg/context/sis_research/article/8708/viewcontent/2022.acl_long.434.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-8708
record_format	dspace
spelling	sg-smu-ink.sis_research-87082023-01-10T03:05:39Z An empirical study of memorization in NLP ZHENG, Xiaosen JIANG, Jing A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long-tail theory holds. Our experiments demonstrate that top-ranked memorized training instances are likely atypical, and removing the top-memorized training instances leads to a more serious drop in test accuracy compared with removing training instances randomly. Furthermore, we develop an attribution method to better understand why a training instance is memorized. We empirically show that our memorization attribution method is faithful, and share our interesting finding that the top-memorized parts of a training instance tend to be features negatively correlated with the class label. 2022-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7705 info:doi/10.18653/v1/2022.acl-long.434 https://ink.library.smu.edu.sg/context/sis_research/article/8708/viewcontent/2022.acl_long.434.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Natural language processing Computer Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Natural language processing Computer Engineering
spellingShingle	Natural language processing Computer Engineering ZHENG, Xiaosen JIANG, Jing An empirical study of memorization in NLP
description	A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long-tail theory holds. Our experiments demonstrate that top-ranked memorized training instances are likely atypical, and removing the top-memorized training instances leads to a more serious drop in test accuracy compared with removing training instances randomly. Furthermore, we develop an attribution method to better understand why a training instance is memorized. We empirically show that our memorization attribution method is faithful, and share our interesting finding that the top-memorized parts of a training instance tend to be features negatively correlated with the class label.
format	text
author	ZHENG, Xiaosen JIANG, Jing
author_facet	ZHENG, Xiaosen JIANG, Jing
author_sort	ZHENG, Xiaosen
title	An empirical study of memorization in NLP
title_short	An empirical study of memorization in NLP
title_full	An empirical study of memorization in NLP
title_fullStr	An empirical study of memorization in NLP
title_full_unstemmed	An empirical study of memorization in NLP
title_sort	empirical study of memorization in nlp
publisher	Institutional Knowledge at Singapore Management University
publishDate	2022
url	https://ink.library.smu.edu.sg/sis_research/7705 https://ink.library.smu.edu.sg/context/sis_research/article/8708/viewcontent/2022.acl_long.434.pdf
_version_	1770576417924317184

An empirical study of memorization in NLP

Similar Items