QuoTe: Quality-oriented Testing for deep learning systems

Recently, there has been a significant growth of interest in applying software engineering techniques for the quality assurance of deep learning (DL) systems. One popular direction is deep learning testing, i.e., given a property of test, defects of DL systems are found either by fuzzing or guided s...

Full description

Saved in:
Bibliographic Details
Main Authors: CHEN, Jialuo, WANG, Jingyi, MA, Xingjun, SUN, Youcheng, SUN, Jun, ZHANG, Peixin, CHENG, Peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7785
https://ink.library.smu.edu.sg/context/sis_research/article/8788/viewcontent/3582573_justAccepted.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8788
record_format dspace
spelling sg-smu-ink.sis_research-87882023-09-05T02:14:43Z QuoTe: Quality-oriented Testing for deep learning systems CHEN, Jialuo WANG, Jingyi MA, Xingjun SUN, Youcheng SUN, Jun ZHANG, Peixin CHENG, Peng Recently, there has been a significant growth of interest in applying software engineering techniques for the quality assurance of deep learning (DL) systems. One popular direction is deep learning testing, i.e., given a property of test, defects of DL systems are found either by fuzzing or guided search with the help of certain testing metrics. However, recent studies have revealed that the neuron coverage metrics, commonly used by most existing DL testing approaches, are not necessarily correlated with model quality (e.g., robustness, the most studied model property), and are also not an effective measurement on the confidence of the model quality after testing. In this work, we address this gap by proposing a novel testing framework called QuoTe (i.e., Quality-oriented Testing). A key part of QuoTe is a quantitative measurement on 1) the value of each test case in enhancing the model property of interest (often via retraining), and 2) the convergence quality of the model property improvement. QuoTe utilizes the proposed metric to automatically select or generate valuable test cases for improving model quality. The proposed metric is also a lightweight yet strong indicator of how well the improvement converged. Extensive experiments on both image and tabular datasets with a variety of model architectures confirm the effectiveness and efficiency of QuoTe in improving DL model quality, i.e., robustness and fairness. As a generic quality-oriented testing framework, future adaptions can be made to other domains (e.g., text) as well as other model properties. 2022-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7785 info:doi/10.1145/3582573 https://ink.library.smu.edu.sg/context/sis_research/article/8788/viewcontent/3582573_justAccepted.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Deep learning testing robustness fairness software debugging Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Deep learning
testing
robustness
fairness
software debugging
Software Engineering
spellingShingle Deep learning
testing
robustness
fairness
software debugging
Software Engineering
CHEN, Jialuo
WANG, Jingyi
MA, Xingjun
SUN, Youcheng
SUN, Jun
ZHANG, Peixin
CHENG, Peng
QuoTe: Quality-oriented Testing for deep learning systems
description Recently, there has been a significant growth of interest in applying software engineering techniques for the quality assurance of deep learning (DL) systems. One popular direction is deep learning testing, i.e., given a property of test, defects of DL systems are found either by fuzzing or guided search with the help of certain testing metrics. However, recent studies have revealed that the neuron coverage metrics, commonly used by most existing DL testing approaches, are not necessarily correlated with model quality (e.g., robustness, the most studied model property), and are also not an effective measurement on the confidence of the model quality after testing. In this work, we address this gap by proposing a novel testing framework called QuoTe (i.e., Quality-oriented Testing). A key part of QuoTe is a quantitative measurement on 1) the value of each test case in enhancing the model property of interest (often via retraining), and 2) the convergence quality of the model property improvement. QuoTe utilizes the proposed metric to automatically select or generate valuable test cases for improving model quality. The proposed metric is also a lightweight yet strong indicator of how well the improvement converged. Extensive experiments on both image and tabular datasets with a variety of model architectures confirm the effectiveness and efficiency of QuoTe in improving DL model quality, i.e., robustness and fairness. As a generic quality-oriented testing framework, future adaptions can be made to other domains (e.g., text) as well as other model properties.
format text
author CHEN, Jialuo
WANG, Jingyi
MA, Xingjun
SUN, Youcheng
SUN, Jun
ZHANG, Peixin
CHENG, Peng
author_facet CHEN, Jialuo
WANG, Jingyi
MA, Xingjun
SUN, Youcheng
SUN, Jun
ZHANG, Peixin
CHENG, Peng
author_sort CHEN, Jialuo
title QuoTe: Quality-oriented Testing for deep learning systems
title_short QuoTe: Quality-oriented Testing for deep learning systems
title_full QuoTe: Quality-oriented Testing for deep learning systems
title_fullStr QuoTe: Quality-oriented Testing for deep learning systems
title_full_unstemmed QuoTe: Quality-oriented Testing for deep learning systems
title_sort quote: quality-oriented testing for deep learning systems
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7785
https://ink.library.smu.edu.sg/context/sis_research/article/8788/viewcontent/3582573_justAccepted.pdf
_version_ 1779157088383008768