RNNRepair: Automatic RNN Repair via model-based analysis

Deep neural networks are vulnerable to adversarial attacks. Due to their black-box nature, it is rather challenging to interpret and properly repair these incorrect behaviors. This paper focuses on interpreting and repairing the incorrect behaviors of Recurrent Neural Networks (RNNs). We propose a l...

Full description

Saved in:
Bibliographic Details
Main Authors: XIE, Xiaofei, GUO, Wenbo, MA, Lei, LE, Wei, WANG, Jian, ZHOU, Lingjun, LIU, Yang, XING, Xinyu
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6938
https://ink.library.smu.edu.sg/context/sis_research/article/7941/viewcontent/xie21b_pvoa.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7941
record_format dspace
spelling sg-smu-ink.sis_research-79412022-03-04T09:14:15Z RNNRepair: Automatic RNN Repair via model-based analysis XIE, Xiaofei GUO, Wenbo MA, Lei LE, Wei WANG, Jian ZHOU, Lingjun LIU, Yang XING, Xinyu Deep neural networks are vulnerable to adversarial attacks. Due to their black-box nature, it is rather challenging to interpret and properly repair these incorrect behaviors. This paper focuses on interpreting and repairing the incorrect behaviors of Recurrent Neural Networks (RNNs). We propose a lightweight model-based approach (RNNRepair) to help understand and repair incorrect behaviors of an RNN. Specifically, we build an influence model to characterize the stateful and statistical behaviors of an RNN over all the training data and to perform the influence analysis for the errors. Compared with the existing techniques on influence function, our method can efficiently estimate the influence of existing or newly added training samples for a given prediction at both sample level and segmentation level. Our empirical evaluation shows that the proposed influence model is able to extract accurate and understandable features. Based on the influence model, our proposed technique could effectively infer the influential instances from not only an entire testing sequence but also a segment within that sequence. Moreover, with the sample-level and segment-level influence relations, RNNRepair could further remediate two types of incorrect predictions at the sample level and segment level. 2021-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6938 https://ink.library.smu.edu.sg/context/sis_research/article/7941/viewcontent/xie21b_pvoa.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Information Security Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Information Security
Software Engineering
spellingShingle Information Security
Software Engineering
XIE, Xiaofei
GUO, Wenbo
MA, Lei
LE, Wei
WANG, Jian
ZHOU, Lingjun
LIU, Yang
XING, Xinyu
RNNRepair: Automatic RNN Repair via model-based analysis
description Deep neural networks are vulnerable to adversarial attacks. Due to their black-box nature, it is rather challenging to interpret and properly repair these incorrect behaviors. This paper focuses on interpreting and repairing the incorrect behaviors of Recurrent Neural Networks (RNNs). We propose a lightweight model-based approach (RNNRepair) to help understand and repair incorrect behaviors of an RNN. Specifically, we build an influence model to characterize the stateful and statistical behaviors of an RNN over all the training data and to perform the influence analysis for the errors. Compared with the existing techniques on influence function, our method can efficiently estimate the influence of existing or newly added training samples for a given prediction at both sample level and segmentation level. Our empirical evaluation shows that the proposed influence model is able to extract accurate and understandable features. Based on the influence model, our proposed technique could effectively infer the influential instances from not only an entire testing sequence but also a segment within that sequence. Moreover, with the sample-level and segment-level influence relations, RNNRepair could further remediate two types of incorrect predictions at the sample level and segment level.
format text
author XIE, Xiaofei
GUO, Wenbo
MA, Lei
LE, Wei
WANG, Jian
ZHOU, Lingjun
LIU, Yang
XING, Xinyu
author_facet XIE, Xiaofei
GUO, Wenbo
MA, Lei
LE, Wei
WANG, Jian
ZHOU, Lingjun
LIU, Yang
XING, Xinyu
author_sort XIE, Xiaofei
title RNNRepair: Automatic RNN Repair via model-based analysis
title_short RNNRepair: Automatic RNN Repair via model-based analysis
title_full RNNRepair: Automatic RNN Repair via model-based analysis
title_fullStr RNNRepair: Automatic RNN Repair via model-based analysis
title_full_unstemmed RNNRepair: Automatic RNN Repair via model-based analysis
title_sort rnnrepair: automatic rnn repair via model-based analysis
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/6938
https://ink.library.smu.edu.sg/context/sis_research/article/7941/viewcontent/xie21b_pvoa.pdf
_version_ 1770576163348938752