An empirical study of memorization in NLP

A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHENG, Xiaosen, JIANG, Jing
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7705
https://ink.library.smu.edu.sg/context/sis_research/article/8708/viewcontent/2022.acl_long.434.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8708
record_format dspace
spelling sg-smu-ink.sis_research-87082023-01-10T03:05:39Z An empirical study of memorization in NLP ZHENG, Xiaosen JIANG, Jing A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long-tail theory holds. Our experiments demonstrate that top-ranked memorized training instances are likely atypical, and removing the top-memorized training instances leads to a more serious drop in test accuracy compared with removing training instances randomly. Furthermore, we develop an attribution method to better understand why a training instance is memorized. We empirically show that our memorization attribution method is faithful, and share our interesting finding that the top-memorized parts of a training instance tend to be features negatively correlated with the class label. 2022-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7705 info:doi/10.18653/v1/2022.acl-long.434 https://ink.library.smu.edu.sg/context/sis_research/article/8708/viewcontent/2022.acl_long.434.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Natural language processing Computer Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Natural language processing
Computer Engineering
spellingShingle Natural language processing
Computer Engineering
ZHENG, Xiaosen
JIANG, Jing
An empirical study of memorization in NLP
description A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long-tail theory holds. Our experiments demonstrate that top-ranked memorized training instances are likely atypical, and removing the top-memorized training instances leads to a more serious drop in test accuracy compared with removing training instances randomly. Furthermore, we develop an attribution method to better understand why a training instance is memorized. We empirically show that our memorization attribution method is faithful, and share our interesting finding that the top-memorized parts of a training instance tend to be features negatively correlated with the class label.
format text
author ZHENG, Xiaosen
JIANG, Jing
author_facet ZHENG, Xiaosen
JIANG, Jing
author_sort ZHENG, Xiaosen
title An empirical study of memorization in NLP
title_short An empirical study of memorization in NLP
title_full An empirical study of memorization in NLP
title_fullStr An empirical study of memorization in NLP
title_full_unstemmed An empirical study of memorization in NLP
title_sort empirical study of memorization in nlp
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7705
https://ink.library.smu.edu.sg/context/sis_research/article/8708/viewcontent/2022.acl_long.434.pdf
_version_ 1770576417924317184