Diversity Maximization Speedup for Fault Localization

Fault localization is useful for reducing debugging effort. However, many fault localization techniques require non-trivial number of test cases with oracles, which can determine whether a program behaves correctly for every test input. Test oracle creation is expensive because it can take much manu...

Full description

Saved in:
Bibliographic Details
Main Authors: Gong, Liang, LO, David, JIANG, Lingxiao, Zhang, Hongyu
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2012
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1569
http://dl.acm.org/citation.cfm?id=2351682
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2568
record_format dspace
spelling sg-smu-ink.sis_research-25682012-09-24T03:48:42Z Diversity Maximization Speedup for Fault Localization Gong, Liang LO, David JIANG, Lingxiao Zhang, Hongyu Fault localization is useful for reducing debugging effort. However, many fault localization techniques require non-trivial number of test cases with oracles, which can determine whether a program behaves correctly for every test input. Test oracle creation is expensive because it can take much manual labeling effort. Given a number of test cases to be executed, it is challenging to minimize the number of test cases requiring manual labeling and in the meantime achieve good fault localization accuracy. To address this challenge, this paper presents a novel test case selection strategy based on Diversity Maximization Speedup (DMS). DMS orders a set of unlabeled test cases in a way that maximizes the effectiveness of a fault localization technique. Developers are only expected to label a much smaller number of test cases along this ordering to achieve good fault localization results. Our experiments with more than 250 bugs from the Software-artifact Infrastructure Repository show (1) that DMS can help existing fault localization techniques to achieve comparable accuracy with on average 67% fewer labeled test cases than previously best test case prioritization techniques, and (2) that given a labeling budget (i.e., a fixed number of labeled test cases), DMS can help existing fault localization techniques reduce their debugging cost (in terms of the amount of code needed to be inspected to locate faults). We conduct hypothesis test and show that the saving of the debugging cost we achieve for the real C programs are statistically significant. 2012-09-01T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/1569 info:doi/10.1145/2351676.2351682 http://dl.acm.org/citation.cfm?id=2351682 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Algorithms Experimentation Reliability Computer Sciences Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Algorithms
Experimentation
Reliability
Computer Sciences
Software Engineering
spellingShingle Algorithms
Experimentation
Reliability
Computer Sciences
Software Engineering
Gong, Liang
LO, David
JIANG, Lingxiao
Zhang, Hongyu
Diversity Maximization Speedup for Fault Localization
description Fault localization is useful for reducing debugging effort. However, many fault localization techniques require non-trivial number of test cases with oracles, which can determine whether a program behaves correctly for every test input. Test oracle creation is expensive because it can take much manual labeling effort. Given a number of test cases to be executed, it is challenging to minimize the number of test cases requiring manual labeling and in the meantime achieve good fault localization accuracy. To address this challenge, this paper presents a novel test case selection strategy based on Diversity Maximization Speedup (DMS). DMS orders a set of unlabeled test cases in a way that maximizes the effectiveness of a fault localization technique. Developers are only expected to label a much smaller number of test cases along this ordering to achieve good fault localization results. Our experiments with more than 250 bugs from the Software-artifact Infrastructure Repository show (1) that DMS can help existing fault localization techniques to achieve comparable accuracy with on average 67% fewer labeled test cases than previously best test case prioritization techniques, and (2) that given a labeling budget (i.e., a fixed number of labeled test cases), DMS can help existing fault localization techniques reduce their debugging cost (in terms of the amount of code needed to be inspected to locate faults). We conduct hypothesis test and show that the saving of the debugging cost we achieve for the real C programs are statistically significant.
format text
author Gong, Liang
LO, David
JIANG, Lingxiao
Zhang, Hongyu
author_facet Gong, Liang
LO, David
JIANG, Lingxiao
Zhang, Hongyu
author_sort Gong, Liang
title Diversity Maximization Speedup for Fault Localization
title_short Diversity Maximization Speedup for Fault Localization
title_full Diversity Maximization Speedup for Fault Localization
title_fullStr Diversity Maximization Speedup for Fault Localization
title_full_unstemmed Diversity Maximization Speedup for Fault Localization
title_sort diversity maximization speedup for fault localization
publisher Institutional Knowledge at Singapore Management University
publishDate 2012
url https://ink.library.smu.edu.sg/sis_research/1569
http://dl.acm.org/citation.cfm?id=2351682
_version_ 1770571304033845248