NPC: neuron path coverage via characterizing decision logic of deep neural networks
Deep learning has recently been widely applied to many applications across different domains, e.g., image classification and audio recognition. However, the quality of Deep Neural Networks (DNNs) still raises concerns in the practical operational environment, which calls for systematic testing, espe...
Saved in:
Main Authors: | , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/162501 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-162501 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1625012022-10-25T08:12:26Z NPC: neuron path coverage via characterizing decision logic of deep neural networks Xie, Xiaofei Li, Tianlin Wang, Jian Ma, Lei Guo, Qing Juefei-Xu, Felix Liu, Yang School of Computer Science and Engineering Engineering::Computer science and engineering Deep Learning Testing Testing Coverage Criteria Deep learning has recently been widely applied to many applications across different domains, e.g., image classification and audio recognition. However, the quality of Deep Neural Networks (DNNs) still raises concerns in the practical operational environment, which calls for systematic testing, especially in safety-critical scenarios. Inspired by software testing, a number of structural coverage criteria are designed and proposed to measure the test adequacy of DNNs. However, due to the blackbox nature of DNN, the existing structural coverage criteria are difficult to interpret, making it hard to understand the underlying principles of these criteria. The relationship between the structural coverage and the decision logic of DNNs is unknown. Moreover, recent studies have further revealed the non-existence of correlation between the structural coverage and DNN defect detection, which further posts concerns on what a suitable DNN testing criterion should be.In this article, we propose the interpretable coverage criteria through constructing the decision structure of a DNN. Mirroring the control flow graph of the traditional program, we first extract a decision graph from a DNN based on its interpretation, where a path of the decision graph represents a decision logic of the DNN. Based on the control flow and data flow of the decision graph, we propose two variants of path coverage to measure the adequacy of the test cases in exercising the decision logic. The higher the path coverage, the more diverse decision logic the DNN is expected to be explored. Our large-scale evaluation results demonstrate that: The path in the decision graph is effective in characterizing the decision of the DNN, and the proposed coverage criteria are also sensitive with errors, including natural errors and adversarial examples, and strongly correlate with the output impartiality. Ministry of Education (MOE) National Research Foundation (NRF) This research is partially supported by the National Research Foundation, Singapore under its the AI Singapore Programme (AISG2-RP-2020-019), the National Research Foundation, Prime Minister’s Office, Singapore under its National Cybersecurity R&D Program (Award No. NRF2018NCR-NCR005-0001), NRF Investigatorship NRFI06-2020-0022-0001, the National Research Foundation through its National Satellite of Excellence in Trustworthy Software Systems (NSOE-TSS) project under the National Cybersecurity R&D (NCR) Grant Award No. NRF2018NCR-NSOE003-0001, the National Research Foundation, Singapore under its AI Singapore Programme (AISG Award No: AISG-PhD/2021-01-022[T]), the Ministry of Education, Singapore under its Academic Research Fund Tier 3 (MOET32020-0004), the JSPS KAKENHI Grant Nos. JP20H04168, JP19K24348, JP19H04086, JP21H04877, and JST-Mirai Program Grant No. JPMJMI20B8, Japan. Lei Ma is also supported by Canada CIFAR AI Program and Natural Sciences and Engineering Research Council of Canada. 2022-10-25T08:12:26Z 2022-10-25T08:12:26Z 2022 Journal Article Xie, X., Li, T., Wang, J., Ma, L., Guo, Q., Juefei-Xu, F. & Liu, Y. (2022). NPC: neuron path coverage via characterizing decision logic of deep neural networks. ACM Transactions On Software Engineering and Methodology, 31(3), 1-27. https://dx.doi.org/10.1145/3490489 1049-331X https://hdl.handle.net/10356/162501 10.1145/3490489 2-s2.0-85130728492 3 31 1 27 en AISG2-RP-2020-019 NRF2018NCR-NCR005-0001 NRFI06-2020-0022-0001 NRF2018NCR-NSOE003-0001 AISG-PhD/2021-01-022[T] MOET32020-0004 ACM Transactions on Software Engineering and Methodology © 2022 Association for Computing Machinery. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Deep Learning Testing Testing Coverage Criteria |
spellingShingle |
Engineering::Computer science and engineering Deep Learning Testing Testing Coverage Criteria Xie, Xiaofei Li, Tianlin Wang, Jian Ma, Lei Guo, Qing Juefei-Xu, Felix Liu, Yang NPC: neuron path coverage via characterizing decision logic of deep neural networks |
description |
Deep learning has recently been widely applied to many applications across different domains, e.g., image classification and audio recognition. However, the quality of Deep Neural Networks (DNNs) still raises concerns in the practical operational environment, which calls for systematic testing, especially in safety-critical scenarios. Inspired by software testing, a number of structural coverage criteria are designed and proposed to measure the test adequacy of DNNs. However, due to the blackbox nature of DNN, the existing structural coverage criteria are difficult to interpret, making it hard to understand the underlying principles of these criteria. The relationship between the structural coverage and the decision logic of DNNs is unknown. Moreover, recent studies have further revealed the non-existence of correlation between the structural coverage and DNN defect detection, which further posts concerns on what a suitable DNN testing criterion should be.In this article, we propose the interpretable coverage criteria through constructing the decision structure of a DNN. Mirroring the control flow graph of the traditional program, we first extract a decision graph from a DNN based on its interpretation, where a path of the decision graph represents a decision logic of the DNN. Based on the control flow and data flow of the decision graph, we propose two variants of path coverage to measure the adequacy of the test cases in exercising the decision logic. The higher the path coverage, the more diverse decision logic the DNN is expected to be explored. Our large-scale evaluation results demonstrate that: The path in the decision graph is effective in characterizing the decision of the DNN, and the proposed coverage criteria are also sensitive with errors, including natural errors and adversarial examples, and strongly correlate with the output impartiality. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Xie, Xiaofei Li, Tianlin Wang, Jian Ma, Lei Guo, Qing Juefei-Xu, Felix Liu, Yang |
format |
Article |
author |
Xie, Xiaofei Li, Tianlin Wang, Jian Ma, Lei Guo, Qing Juefei-Xu, Felix Liu, Yang |
author_sort |
Xie, Xiaofei |
title |
NPC: neuron path coverage via characterizing decision logic of deep neural networks |
title_short |
NPC: neuron path coverage via characterizing decision logic of deep neural networks |
title_full |
NPC: neuron path coverage via characterizing decision logic of deep neural networks |
title_fullStr |
NPC: neuron path coverage via characterizing decision logic of deep neural networks |
title_full_unstemmed |
NPC: neuron path coverage via characterizing decision logic of deep neural networks |
title_sort |
npc: neuron path coverage via characterizing decision logic of deep neural networks |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/162501 |
_version_ |
1749179173910347776 |