RNNRepair: Automatic RNN Repair via model-based analysis
Deep neural networks are vulnerable to adversarial attacks. Due to their black-box nature, it is rather challenging to interpret and properly repair these incorrect behaviors. This paper focuses on interpreting and repairing the incorrect behaviors of Recurrent Neural Networks (RNNs). We propose a l...
Saved in:
Main Authors: | , , , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6938 https://ink.library.smu.edu.sg/context/sis_research/article/7941/viewcontent/xie21b_pvoa.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7941 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-79412022-03-04T09:14:15Z RNNRepair: Automatic RNN Repair via model-based analysis XIE, Xiaofei GUO, Wenbo MA, Lei LE, Wei WANG, Jian ZHOU, Lingjun LIU, Yang XING, Xinyu Deep neural networks are vulnerable to adversarial attacks. Due to their black-box nature, it is rather challenging to interpret and properly repair these incorrect behaviors. This paper focuses on interpreting and repairing the incorrect behaviors of Recurrent Neural Networks (RNNs). We propose a lightweight model-based approach (RNNRepair) to help understand and repair incorrect behaviors of an RNN. Specifically, we build an influence model to characterize the stateful and statistical behaviors of an RNN over all the training data and to perform the influence analysis for the errors. Compared with the existing techniques on influence function, our method can efficiently estimate the influence of existing or newly added training samples for a given prediction at both sample level and segmentation level. Our empirical evaluation shows that the proposed influence model is able to extract accurate and understandable features. Based on the influence model, our proposed technique could effectively infer the influential instances from not only an entire testing sequence but also a segment within that sequence. Moreover, with the sample-level and segment-level influence relations, RNNRepair could further remediate two types of incorrect predictions at the sample level and segment level. 2021-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6938 https://ink.library.smu.edu.sg/context/sis_research/article/7941/viewcontent/xie21b_pvoa.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Information Security Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Information Security Software Engineering |
spellingShingle |
Information Security Software Engineering XIE, Xiaofei GUO, Wenbo MA, Lei LE, Wei WANG, Jian ZHOU, Lingjun LIU, Yang XING, Xinyu RNNRepair: Automatic RNN Repair via model-based analysis |
description |
Deep neural networks are vulnerable to adversarial attacks. Due to their black-box nature, it is rather challenging to interpret and properly repair these incorrect behaviors. This paper focuses on interpreting and repairing the incorrect behaviors of Recurrent Neural Networks (RNNs). We propose a lightweight model-based approach (RNNRepair) to help understand and repair incorrect behaviors of an RNN. Specifically, we build an influence model to characterize the stateful and statistical behaviors of an RNN over all the training data and to perform the influence analysis for the errors. Compared with the existing techniques on influence function, our method can efficiently estimate the influence of existing or newly added training samples for a given prediction at both sample level and segmentation level. Our empirical evaluation shows that the proposed influence model is able to extract accurate and understandable features. Based on the influence model, our proposed technique could effectively infer the influential instances from not only an entire testing sequence but also a segment within that sequence. Moreover, with the sample-level and segment-level influence relations, RNNRepair could further remediate two types of incorrect predictions at the sample level and segment level. |
format |
text |
author |
XIE, Xiaofei GUO, Wenbo MA, Lei LE, Wei WANG, Jian ZHOU, Lingjun LIU, Yang XING, Xinyu |
author_facet |
XIE, Xiaofei GUO, Wenbo MA, Lei LE, Wei WANG, Jian ZHOU, Lingjun LIU, Yang XING, Xinyu |
author_sort |
XIE, Xiaofei |
title |
RNNRepair: Automatic RNN Repair via model-based analysis |
title_short |
RNNRepair: Automatic RNN Repair via model-based analysis |
title_full |
RNNRepair: Automatic RNN Repair via model-based analysis |
title_fullStr |
RNNRepair: Automatic RNN Repair via model-based analysis |
title_full_unstemmed |
RNNRepair: Automatic RNN Repair via model-based analysis |
title_sort |
rnnrepair: automatic rnn repair via model-based analysis |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2021 |
url |
https://ink.library.smu.edu.sg/sis_research/6938 https://ink.library.smu.edu.sg/context/sis_research/article/7941/viewcontent/xie21b_pvoa.pdf |
_version_ |
1770576163348938752 |