Towards more precise coincidental correctness detection with Deep Semantic Learning

Coincidental correctness (CC) is a situation during the execution of a test case, the buggy entity is executed, but the program behaves correctly as expected. Many automated fault localization (FL) techniques use runtime information to discover the underlying connection between the executed buggy en...

Full description

Saved in:
Bibliographic Details
Main Authors: XIE, Huan, LEI, Yan, YAN, Meng, LI, Shanshan, MAO, Xiaoguang, YU, Yue, LO, David
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9920
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10920
record_format dspace
spelling sg-smu-ink.sis_research-109202025-01-02T08:03:58Z Towards more precise coincidental correctness detection with Deep Semantic Learning XIE, Huan LEI, Yan YAN, Meng LI, Shanshan MAO, Xiaoguang YU, Yue LO, David Coincidental correctness (CC) is a situation during the execution of a test case, the buggy entity is executed, but the program behaves correctly as expected. Many automated fault localization (FL) techniques use runtime information to discover the underlying connection between the executed buggy entity and the failing test result. The existence of CC will weaken such connection, mislead the FL algorithms to build inaccurate models, and consequently, decrease the localization accuracy. To alleviate the adverse effect of CC on FL, CC detection techniques have been proposed to identify the possible CC tests via heuristic or machine learning algorithms. However, their performance on precision is not satisfactory since they overestimate the possible CC tests and are insufficient in learning the deep semantic features. In this work, we propose a novel Triplet network-based Coincidental Correctness detection technique (i.e., TriCoCo) to overcome the limitations of the prior works. TriCoCo narrows the possible CC tests by designing three features to identify genuine passing tests. Instead of using all tests as inputs by existing techniques, TriCoCo takes the identified genuine passing tests and failing ones to train a triplet model that can evaluate their relative distance. Finally, TriCoCo infers the probability of being a CC test of the test in the rest of the passing tests by using the trained triplet model. We conduct large-scale experiments to evaluate TriCoCo based on the widely-used Defects4J benchmark. The results demonstrate that TriCoCo can improve not only the precision of CC detection but also the effectiveness of FL techniques, e.g., the precision of TriCoCo is 80.33% on average, and TriCoCo boosts the efficacy of DStar by 18% - 74% in terms of MFR metric when compared to seven state-of-the-art CC detection baselines. 2024-12-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/9920 info:doi/10.1109/TSE.2024.3481893 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Coincidental correctness Deep Semantic Learning Fault localization Feature extraction Machine learning algorithms Location awareness Deep learning Clustering algorithms Prediction algorithms Benchmark testing Partitioning algorithms Artificial Intelligence and Robotics
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Coincidental correctness
Deep Semantic Learning
Fault localization
Feature extraction
Machine learning algorithms
Location awareness
Deep learning
Clustering algorithms
Prediction algorithms
Benchmark testing
Partitioning algorithms
Artificial Intelligence and Robotics
spellingShingle Coincidental correctness
Deep Semantic Learning
Fault localization
Feature extraction
Machine learning algorithms
Location awareness
Deep learning
Clustering algorithms
Prediction algorithms
Benchmark testing
Partitioning algorithms
Artificial Intelligence and Robotics
XIE, Huan
LEI, Yan
YAN, Meng
LI, Shanshan
MAO, Xiaoguang
YU, Yue
LO, David
Towards more precise coincidental correctness detection with Deep Semantic Learning
description Coincidental correctness (CC) is a situation during the execution of a test case, the buggy entity is executed, but the program behaves correctly as expected. Many automated fault localization (FL) techniques use runtime information to discover the underlying connection between the executed buggy entity and the failing test result. The existence of CC will weaken such connection, mislead the FL algorithms to build inaccurate models, and consequently, decrease the localization accuracy. To alleviate the adverse effect of CC on FL, CC detection techniques have been proposed to identify the possible CC tests via heuristic or machine learning algorithms. However, their performance on precision is not satisfactory since they overestimate the possible CC tests and are insufficient in learning the deep semantic features. In this work, we propose a novel Triplet network-based Coincidental Correctness detection technique (i.e., TriCoCo) to overcome the limitations of the prior works. TriCoCo narrows the possible CC tests by designing three features to identify genuine passing tests. Instead of using all tests as inputs by existing techniques, TriCoCo takes the identified genuine passing tests and failing ones to train a triplet model that can evaluate their relative distance. Finally, TriCoCo infers the probability of being a CC test of the test in the rest of the passing tests by using the trained triplet model. We conduct large-scale experiments to evaluate TriCoCo based on the widely-used Defects4J benchmark. The results demonstrate that TriCoCo can improve not only the precision of CC detection but also the effectiveness of FL techniques, e.g., the precision of TriCoCo is 80.33% on average, and TriCoCo boosts the efficacy of DStar by 18% - 74% in terms of MFR metric when compared to seven state-of-the-art CC detection baselines.
format text
author XIE, Huan
LEI, Yan
YAN, Meng
LI, Shanshan
MAO, Xiaoguang
YU, Yue
LO, David
author_facet XIE, Huan
LEI, Yan
YAN, Meng
LI, Shanshan
MAO, Xiaoguang
YU, Yue
LO, David
author_sort XIE, Huan
title Towards more precise coincidental correctness detection with Deep Semantic Learning
title_short Towards more precise coincidental correctness detection with Deep Semantic Learning
title_full Towards more precise coincidental correctness detection with Deep Semantic Learning
title_fullStr Towards more precise coincidental correctness detection with Deep Semantic Learning
title_full_unstemmed Towards more precise coincidental correctness detection with Deep Semantic Learning
title_sort towards more precise coincidental correctness detection with deep semantic learning
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9920
_version_ 1821237285781241856