Active refinement of clone anomaly reports

Software clones have been widely studied in the recent literature and shown useful for finding bugs because inconsistent changes among clones in a clone group may indicate potential bugs. However, many inconsistent clone groups are not real bugs (true positives). The excessive number of false positi...

Full description

Saved in:
Bibliographic Details
Main Authors: LUCIA, LO, David, JIANG, Lingxiao, BUDI, Aditya
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2012
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1530
https://ink.library.smu.edu.sg/context/sis_research/article/2529/viewcontent/icse12_ActiveRefinementCloneAnomaly.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2529
record_format dspace
spelling sg-smu-ink.sis_research-25292017-02-04T22:38:27Z Active refinement of clone anomaly reports LUCIA, LO, David JIANG, Lingxiao BUDI, Aditya Software clones have been widely studied in the recent literature and shown useful for finding bugs because inconsistent changes among clones in a clone group may indicate potential bugs. However, many inconsistent clone groups are not real bugs (true positives). The excessive number of false positives could easily impede broad adoption of clone-based bug detection approaches. In this work, we aim to improve the usability of clone-based bug detection tools by increasing the rate of true positives found when a developer analyzes anomaly reports. Our idea is to control the number of anomaly reports a user can see at a time and actively incorporate incremental user feedback to continually refine the anomaly reports. Our system first presents top few anomaly reports from the list of reports generated by a tool in its default ordering. Users then either accept or reject each of the reports. Based on the feedback, our system automatically and iteratively refines a classification model for anomalies and re-sorts the rest of the reports. Our goal is to present the true positives to the users earlier than the default ordering. The rationale of the idea is based on our observation that false positives among the inconsistent clone groups could share common features (in terms of code structure, programming patterns, etc.), and these features can be learned from the incremental user feedback. We evaluate our refinement process on three sets of clone-based anomaly reports from three large real programs: the Linux Kernel (C), Eclipse, and ArgoUML (Java), extracted by a clone-based anomaly detection tool. The results show that compared to the original ordering of bug reports, we can improve the rate of true positives found (i.e., true positives are found faster) by 11%, 87%, and 86% for Linux kernel, Eclipse, and ArgoUML, respectively. 2012-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1530 info:doi/10.1109/ICSE.2012.6227175 https://ink.library.smu.edu.sg/context/sis_research/article/2529/viewcontent/icse12_ActiveRefinementCloneAnomaly.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Software Engineering
spellingShingle Software Engineering
LUCIA,
LO, David
JIANG, Lingxiao
BUDI, Aditya
Active refinement of clone anomaly reports
description Software clones have been widely studied in the recent literature and shown useful for finding bugs because inconsistent changes among clones in a clone group may indicate potential bugs. However, many inconsistent clone groups are not real bugs (true positives). The excessive number of false positives could easily impede broad adoption of clone-based bug detection approaches. In this work, we aim to improve the usability of clone-based bug detection tools by increasing the rate of true positives found when a developer analyzes anomaly reports. Our idea is to control the number of anomaly reports a user can see at a time and actively incorporate incremental user feedback to continually refine the anomaly reports. Our system first presents top few anomaly reports from the list of reports generated by a tool in its default ordering. Users then either accept or reject each of the reports. Based on the feedback, our system automatically and iteratively refines a classification model for anomalies and re-sorts the rest of the reports. Our goal is to present the true positives to the users earlier than the default ordering. The rationale of the idea is based on our observation that false positives among the inconsistent clone groups could share common features (in terms of code structure, programming patterns, etc.), and these features can be learned from the incremental user feedback. We evaluate our refinement process on three sets of clone-based anomaly reports from three large real programs: the Linux Kernel (C), Eclipse, and ArgoUML (Java), extracted by a clone-based anomaly detection tool. The results show that compared to the original ordering of bug reports, we can improve the rate of true positives found (i.e., true positives are found faster) by 11%, 87%, and 86% for Linux kernel, Eclipse, and ArgoUML, respectively.
format text
author LUCIA,
LO, David
JIANG, Lingxiao
BUDI, Aditya
author_facet LUCIA,
LO, David
JIANG, Lingxiao
BUDI, Aditya
author_sort LUCIA,
title Active refinement of clone anomaly reports
title_short Active refinement of clone anomaly reports
title_full Active refinement of clone anomaly reports
title_fullStr Active refinement of clone anomaly reports
title_full_unstemmed Active refinement of clone anomaly reports
title_sort active refinement of clone anomaly reports
publisher Institutional Knowledge at Singapore Management University
publishDate 2012
url https://ink.library.smu.edu.sg/sis_research/1530
https://ink.library.smu.edu.sg/context/sis_research/article/2529/viewcontent/icse12_ActiveRefinementCloneAnomaly.pdf
_version_ 1770571257552568320