Neuron semantic-guided test generation for deep neural networks fuzzing

In recent years, significant progress has been made in testing methods for deep neural networks (DNNs) to ensure their correctness and robustness. Coverage-guided criteria, such as neuron-wise, layer-wise, and path-/trace-wise, have been proposed for DNN fuzzing. However, existing coverage-based cri...

Full description

Saved in:

Bibliographic Details
Main Authors:	HUANG, Li, SUN, Weifeng, YAN, Meng, LIU, Zhongxin, LEI, Yan, LO, David
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2025
Subjects:	Deep learning testing fuzzing test input generation Software Engineering Theory and Algorithms
Online Access:	https://ink.library.smu.edu.sg/sis_research/10121 https://ink.library.smu.edu.sg/context/sis_research/article/11121/viewcontent/Neuron_Semantic_av.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-11121
record_format	dspace
spelling	sg-smu-ink.sis_research-111212025-03-10T04:11:37Z Neuron semantic-guided test generation for deep neural networks fuzzing HUANG, Li SUN, Weifeng YAN, Meng LIU, Zhongxin LEI, Yan LO, David In recent years, significant progress has been made in testing methods for deep neural networks (DNNs) to ensure their correctness and robustness. Coverage-guided criteria, such as neuron-wise, layer-wise, and path-/trace-wise, have been proposed for DNN fuzzing. However, existing coverage-based criteria encounter performance bottlenecks for several reasons: Testing Adequacy: Partial neural coverage criteria have been observed to achieve full coverage using only a small number of test inputs. In this case, increasing the number of test inputs does not consistently improve the quality of models. Interpretability: The current coverage criteria lack interpretability. Consequently, testers are unable to identify and understand which incorrect attributes or patterns of the model are triggered by the test inputs. This lack of interpretability hampers the subsequent debugging and fixing process. Therefore, there is an urgent need for a novel fuzzing criterion that offers improved testing adequacy, better interpretability, and more effective failure detection capabilities for DNNs.To alleviate these limitations, we propose NSGen, an approach for DNN fuzzing that utilizes neuron semantics as guidance during test generation. NSGen identifies critical neurons, translates their high-level semantic features into natural language descriptions, and then assembles them into human-readable DNN decision paths (representing the internal decision of the DNN). With these decision paths, we can generate more fault-revealing test inputs by quantifying the similarity between original test inputs and mutated test inputs for fuzzing. We evaluate NSGen on popular DNN models (VGG16_BN, ResNet50, and MobileNet_v2) using CIFAR10, CIFAR100, Oxford 102 Flower, and ImageNet datasets. Compared to 12 existing coverage-guided fuzzing criteria, NSGen outperforms all baselines, increasing the number of triggered faults by 21.4% to 61.2% compared to the state-of-the-art coverage-guided fuzzing criterion. This demonstrates NSGen's effectiveness in generating fault-revealing test inputs through guided input mutation, highlighting its potential to enhance DNN testing and interpretability. 2025-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/10121 info:doi/10.1145/3688835 https://ink.library.smu.edu.sg/context/sis_research/article/11121/viewcontent/Neuron_Semantic_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Deep learning testing fuzzing test input generation Software Engineering Theory and Algorithms
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Deep learning testing fuzzing test input generation Software Engineering Theory and Algorithms
spellingShingle	Deep learning testing fuzzing test input generation Software Engineering Theory and Algorithms HUANG, Li SUN, Weifeng YAN, Meng LIU, Zhongxin LEI, Yan LO, David Neuron semantic-guided test generation for deep neural networks fuzzing
description	In recent years, significant progress has been made in testing methods for deep neural networks (DNNs) to ensure their correctness and robustness. Coverage-guided criteria, such as neuron-wise, layer-wise, and path-/trace-wise, have been proposed for DNN fuzzing. However, existing coverage-based criteria encounter performance bottlenecks for several reasons: Testing Adequacy: Partial neural coverage criteria have been observed to achieve full coverage using only a small number of test inputs. In this case, increasing the number of test inputs does not consistently improve the quality of models. Interpretability: The current coverage criteria lack interpretability. Consequently, testers are unable to identify and understand which incorrect attributes or patterns of the model are triggered by the test inputs. This lack of interpretability hampers the subsequent debugging and fixing process. Therefore, there is an urgent need for a novel fuzzing criterion that offers improved testing adequacy, better interpretability, and more effective failure detection capabilities for DNNs.To alleviate these limitations, we propose NSGen, an approach for DNN fuzzing that utilizes neuron semantics as guidance during test generation. NSGen identifies critical neurons, translates their high-level semantic features into natural language descriptions, and then assembles them into human-readable DNN decision paths (representing the internal decision of the DNN). With these decision paths, we can generate more fault-revealing test inputs by quantifying the similarity between original test inputs and mutated test inputs for fuzzing. We evaluate NSGen on popular DNN models (VGG16_BN, ResNet50, and MobileNet_v2) using CIFAR10, CIFAR100, Oxford 102 Flower, and ImageNet datasets. Compared to 12 existing coverage-guided fuzzing criteria, NSGen outperforms all baselines, increasing the number of triggered faults by 21.4% to 61.2% compared to the state-of-the-art coverage-guided fuzzing criterion. This demonstrates NSGen's effectiveness in generating fault-revealing test inputs through guided input mutation, highlighting its potential to enhance DNN testing and interpretability.
format	text
author	HUANG, Li SUN, Weifeng YAN, Meng LIU, Zhongxin LEI, Yan LO, David
author_facet	HUANG, Li SUN, Weifeng YAN, Meng LIU, Zhongxin LEI, Yan LO, David
author_sort	HUANG, Li
title	Neuron semantic-guided test generation for deep neural networks fuzzing
title_short	Neuron semantic-guided test generation for deep neural networks fuzzing
title_full	Neuron semantic-guided test generation for deep neural networks fuzzing
title_fullStr	Neuron semantic-guided test generation for deep neural networks fuzzing
title_full_unstemmed	Neuron semantic-guided test generation for deep neural networks fuzzing
title_sort	neuron semantic-guided test generation for deep neural networks fuzzing
publisher	Institutional Knowledge at Singapore Management University
publishDate	2025
url	https://ink.library.smu.edu.sg/sis_research/10121 https://ink.library.smu.edu.sg/context/sis_research/article/11121/viewcontent/Neuron_Semantic_av.pdf
_version_	1827070822423986176

Neuron semantic-guided test generation for deep neural networks fuzzing

Similar Items