NPC: neuron path coverage via characterizing decision logic of deep neural networks

Deep learning has recently been widely applied to many applications across different domains, e.g., image classification and audio recognition. However, the quality of Deep Neural Networks (DNNs) still raises concerns in the practical operational environment, which calls for systematic testing, espe...

Full description

Saved in:
Bibliographic Details
Main Authors: Xie, Xiaofei, Li, Tianlin, Wang, Jian, Ma, Lei, Guo, Qing, Juefei-Xu, Felix, Liu, Yang
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/162501
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-162501
record_format dspace
spelling sg-ntu-dr.10356-1625012022-10-25T08:12:26Z NPC: neuron path coverage via characterizing decision logic of deep neural networks Xie, Xiaofei Li, Tianlin Wang, Jian Ma, Lei Guo, Qing Juefei-Xu, Felix Liu, Yang School of Computer Science and Engineering Engineering::Computer science and engineering Deep Learning Testing Testing Coverage Criteria Deep learning has recently been widely applied to many applications across different domains, e.g., image classification and audio recognition. However, the quality of Deep Neural Networks (DNNs) still raises concerns in the practical operational environment, which calls for systematic testing, especially in safety-critical scenarios. Inspired by software testing, a number of structural coverage criteria are designed and proposed to measure the test adequacy of DNNs. However, due to the blackbox nature of DNN, the existing structural coverage criteria are difficult to interpret, making it hard to understand the underlying principles of these criteria. The relationship between the structural coverage and the decision logic of DNNs is unknown. Moreover, recent studies have further revealed the non-existence of correlation between the structural coverage and DNN defect detection, which further posts concerns on what a suitable DNN testing criterion should be.In this article, we propose the interpretable coverage criteria through constructing the decision structure of a DNN. Mirroring the control flow graph of the traditional program, we first extract a decision graph from a DNN based on its interpretation, where a path of the decision graph represents a decision logic of the DNN. Based on the control flow and data flow of the decision graph, we propose two variants of path coverage to measure the adequacy of the test cases in exercising the decision logic. The higher the path coverage, the more diverse decision logic the DNN is expected to be explored. Our large-scale evaluation results demonstrate that: The path in the decision graph is effective in characterizing the decision of the DNN, and the proposed coverage criteria are also sensitive with errors, including natural errors and adversarial examples, and strongly correlate with the output impartiality. Ministry of Education (MOE) National Research Foundation (NRF) This research is partially supported by the National Research Foundation, Singapore under its the AI Singapore Programme (AISG2-RP-2020-019), the National Research Foundation, Prime Minister’s Office, Singapore under its National Cybersecurity R&D Program (Award No. NRF2018NCR-NCR005-0001), NRF Investigatorship NRFI06-2020-0022-0001, the National Research Foundation through its National Satellite of Excellence in Trustworthy Software Systems (NSOE-TSS) project under the National Cybersecurity R&D (NCR) Grant Award No. NRF2018NCR-NSOE003-0001, the National Research Foundation, Singapore under its AI Singapore Programme (AISG Award No: AISG-PhD/2021-01-022[T]), the Ministry of Education, Singapore under its Academic Research Fund Tier 3 (MOET32020-0004), the JSPS KAKENHI Grant Nos. JP20H04168, JP19K24348, JP19H04086, JP21H04877, and JST-Mirai Program Grant No. JPMJMI20B8, Japan. Lei Ma is also supported by Canada CIFAR AI Program and Natural Sciences and Engineering Research Council of Canada. 2022-10-25T08:12:26Z 2022-10-25T08:12:26Z 2022 Journal Article Xie, X., Li, T., Wang, J., Ma, L., Guo, Q., Juefei-Xu, F. & Liu, Y. (2022). NPC: neuron path coverage via characterizing decision logic of deep neural networks. ACM Transactions On Software Engineering and Methodology, 31(3), 1-27. https://dx.doi.org/10.1145/3490489 1049-331X https://hdl.handle.net/10356/162501 10.1145/3490489 2-s2.0-85130728492 3 31 1 27 en AISG2-RP-2020-019 NRF2018NCR-NCR005-0001 NRFI06-2020-0022-0001 NRF2018NCR-NSOE003-0001 AISG-PhD/2021-01-022[T] MOET32020-0004 ACM Transactions on Software Engineering and Methodology © 2022 Association for Computing Machinery. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Deep Learning Testing
Testing Coverage Criteria
spellingShingle Engineering::Computer science and engineering
Deep Learning Testing
Testing Coverage Criteria
Xie, Xiaofei
Li, Tianlin
Wang, Jian
Ma, Lei
Guo, Qing
Juefei-Xu, Felix
Liu, Yang
NPC: neuron path coverage via characterizing decision logic of deep neural networks
description Deep learning has recently been widely applied to many applications across different domains, e.g., image classification and audio recognition. However, the quality of Deep Neural Networks (DNNs) still raises concerns in the practical operational environment, which calls for systematic testing, especially in safety-critical scenarios. Inspired by software testing, a number of structural coverage criteria are designed and proposed to measure the test adequacy of DNNs. However, due to the blackbox nature of DNN, the existing structural coverage criteria are difficult to interpret, making it hard to understand the underlying principles of these criteria. The relationship between the structural coverage and the decision logic of DNNs is unknown. Moreover, recent studies have further revealed the non-existence of correlation between the structural coverage and DNN defect detection, which further posts concerns on what a suitable DNN testing criterion should be.In this article, we propose the interpretable coverage criteria through constructing the decision structure of a DNN. Mirroring the control flow graph of the traditional program, we first extract a decision graph from a DNN based on its interpretation, where a path of the decision graph represents a decision logic of the DNN. Based on the control flow and data flow of the decision graph, we propose two variants of path coverage to measure the adequacy of the test cases in exercising the decision logic. The higher the path coverage, the more diverse decision logic the DNN is expected to be explored. Our large-scale evaluation results demonstrate that: The path in the decision graph is effective in characterizing the decision of the DNN, and the proposed coverage criteria are also sensitive with errors, including natural errors and adversarial examples, and strongly correlate with the output impartiality.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Xie, Xiaofei
Li, Tianlin
Wang, Jian
Ma, Lei
Guo, Qing
Juefei-Xu, Felix
Liu, Yang
format Article
author Xie, Xiaofei
Li, Tianlin
Wang, Jian
Ma, Lei
Guo, Qing
Juefei-Xu, Felix
Liu, Yang
author_sort Xie, Xiaofei
title NPC: neuron path coverage via characterizing decision logic of deep neural networks
title_short NPC: neuron path coverage via characterizing decision logic of deep neural networks
title_full NPC: neuron path coverage via characterizing decision logic of deep neural networks
title_fullStr NPC: neuron path coverage via characterizing decision logic of deep neural networks
title_full_unstemmed NPC: neuron path coverage via characterizing decision logic of deep neural networks
title_sort npc: neuron path coverage via characterizing decision logic of deep neural networks
publishDate 2022
url https://hdl.handle.net/10356/162501
_version_ 1749179173910347776