It's not a bug, it's a feature: Does misclassification affect bug localization?
Bug localization refers to the task of automatically processing bug reports to locate source code files that are responsible for the bugs. Many bug localization techniques have been proposed in the literature. These techniques are often evaluated on issue reports that are marked as bugs by their rep...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2014
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2606 https://ink.library.smu.edu.sg/context/sis_research/article/3606/viewcontent/msr14_misclassification.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3606 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-36062020-12-04T02:55:41Z It's not a bug, it's a feature: Does misclassification affect bug localization? KOCCHAR, Pavneet Singh LE, Tien-Duy B. LO, David Bug localization refers to the task of automatically processing bug reports to locate source code files that are responsible for the bugs. Many bug localization techniques have been proposed in the literature. These techniques are often evaluated on issue reports that are marked as bugs by their reporters in issue tracking systems. However, recent findings by Herzig et al. find that a substantial number of issue reports marked as bugs, are not bugs but other kinds of issues like refactorings, request for enhancement, documentation changes, test case creation, and so on. Herzig et al. report that these misclassifications affect bug prediction, namely the task of predicting which files are likely to be buggy in the future. In this work, we investigate whether these misclassifications also affect bug localization. To do so, we analyze issue reports that have been manually categorized by Herzig et al. and apply a bug localization technique to recover a ranked list of candidate buggy files for each issue report. We then evaluate whether the quality of ranked lists of reports reported as bugs is the same as that of real bug reports. Our findings shed light that there is a need for additional cleaning steps to be performed on issue reports before they are used to evaluate bug localization techniques. 2014-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2606 info:doi/10.1145/2597073.2597105 https://ink.library.smu.edu.sg/context/sis_research/article/3606/viewcontent/msr14_misclassification.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Sciences Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Computer Sciences Software Engineering |
spellingShingle |
Computer Sciences Software Engineering KOCCHAR, Pavneet Singh LE, Tien-Duy B. LO, David It's not a bug, it's a feature: Does misclassification affect bug localization? |
description |
Bug localization refers to the task of automatically processing bug reports to locate source code files that are responsible for the bugs. Many bug localization techniques have been proposed in the literature. These techniques are often evaluated on issue reports that are marked as bugs by their reporters in issue tracking systems. However, recent findings by Herzig et al. find that a substantial number of issue reports marked as bugs, are not bugs but other kinds of issues like refactorings, request for enhancement, documentation changes, test case creation, and so on. Herzig et al. report that these misclassifications affect bug prediction, namely the task of predicting which files are likely to be buggy in the future. In this work, we investigate whether these misclassifications also affect bug localization. To do so, we analyze issue reports that have been manually categorized by Herzig et al. and apply a bug localization technique to recover a ranked list of candidate buggy files for each issue report. We then evaluate whether the quality of ranked lists of reports reported as bugs is the same as that of real bug reports. Our findings shed light that there is a need for additional cleaning steps to be performed on issue reports before they are used to evaluate bug localization techniques. |
format |
text |
author |
KOCCHAR, Pavneet Singh LE, Tien-Duy B. LO, David |
author_facet |
KOCCHAR, Pavneet Singh LE, Tien-Duy B. LO, David |
author_sort |
KOCCHAR, Pavneet Singh |
title |
It's not a bug, it's a feature: Does misclassification affect bug localization? |
title_short |
It's not a bug, it's a feature: Does misclassification affect bug localization? |
title_full |
It's not a bug, it's a feature: Does misclassification affect bug localization? |
title_fullStr |
It's not a bug, it's a feature: Does misclassification affect bug localization? |
title_full_unstemmed |
It's not a bug, it's a feature: Does misclassification affect bug localization? |
title_sort |
it's not a bug, it's a feature: does misclassification affect bug localization? |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2014 |
url |
https://ink.library.smu.edu.sg/sis_research/2606 https://ink.library.smu.edu.sg/context/sis_research/article/3606/viewcontent/msr14_misclassification.pdf |
_version_ |
1770572522961502208 |