Automatic, high accuracy prediction of reopened bugs

Bug fixing is one of the most time-consuming and costly activities of the software development life cycle. In general, bugs are reported in a bug tracking system, validated by a triage team, assigned for someone to fix, and finally verified and closed. However, in some cases bugs have to be reopened...

Full description

Saved in:
Bibliographic Details
Main Authors: Xia, Xin, LO, David, Shihab, Emad, Wang, Xinyu, Zhou, Bo
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2015
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2436
https://ink.library.smu.edu.sg/context/sis_research/article/3436/viewcontent/art_3A10.1007_2Fs10515_014_0162_2.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3436
record_format dspace
spelling sg-smu-ink.sis_research-34362018-07-13T04:05:58Z Automatic, high accuracy prediction of reopened bugs Xia, Xin LO, David Shihab, Emad Wang, Xinyu Zhou, Bo Bug fixing is one of the most time-consuming and costly activities of the software development life cycle. In general, bugs are reported in a bug tracking system, validated by a triage team, assigned for someone to fix, and finally verified and closed. However, in some cases bugs have to be reopened. Reopened bugs increase software maintenance cost, cause rework for already busy developers and in some cases even delay the future delivery of a software release. Therefore, a few recent studies focused on studying reopened bugs. However, these prior studies did not achieve high performance (in terms of precision and recall), required manual intervention, and used very simplistic techniques when dealing with this textual data, which leads us to believe that further improvements are possible. In this paper, we propose ReopenPredictor, which is an automatic, high accuracy predictor of reopened bugs. ReopenPredictor uses a number of features, including textual features, to achieve high accuracy prediction of reopened bugs. As part of ReopenPredictor, we propose two algorithms that are used to automatically estimate various thresholds to maximize the prediction performance. To examine the benefits of ReopenPredictor, we perform experiments on three large open source projects—namely Eclipse, Apache HTTP and OpenOffice. Our results show that ReopenPredictor outperforms prior work, achieving a reopened F-measure of 0.744, 0.770, and 0.860 for Eclipse, Apache HTTP and OpenOffice, respectively. These results correspond to an improvement in the reopened F-measure of the method proposed in the prior work by Shihab et al. by 33.33, 12.57 and 3.12 % for Eclipse, Apache HTTP and OpenOffice, respectively. 2015-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2436 info:doi/10.1007/s10515-014-0162-2 https://ink.library.smu.edu.sg/context/sis_research/article/3436/viewcontent/art_3A10.1007_2Fs10515_014_0162_2.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Reopened bugs Imbalanced feature selection Imbalanced learning Computer Sciences
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Reopened bugs
Imbalanced feature selection
Imbalanced learning
Computer Sciences
spellingShingle Reopened bugs
Imbalanced feature selection
Imbalanced learning
Computer Sciences
Xia, Xin
LO, David
Shihab, Emad
Wang, Xinyu
Zhou, Bo
Automatic, high accuracy prediction of reopened bugs
description Bug fixing is one of the most time-consuming and costly activities of the software development life cycle. In general, bugs are reported in a bug tracking system, validated by a triage team, assigned for someone to fix, and finally verified and closed. However, in some cases bugs have to be reopened. Reopened bugs increase software maintenance cost, cause rework for already busy developers and in some cases even delay the future delivery of a software release. Therefore, a few recent studies focused on studying reopened bugs. However, these prior studies did not achieve high performance (in terms of precision and recall), required manual intervention, and used very simplistic techniques when dealing with this textual data, which leads us to believe that further improvements are possible. In this paper, we propose ReopenPredictor, which is an automatic, high accuracy predictor of reopened bugs. ReopenPredictor uses a number of features, including textual features, to achieve high accuracy prediction of reopened bugs. As part of ReopenPredictor, we propose two algorithms that are used to automatically estimate various thresholds to maximize the prediction performance. To examine the benefits of ReopenPredictor, we perform experiments on three large open source projects—namely Eclipse, Apache HTTP and OpenOffice. Our results show that ReopenPredictor outperforms prior work, achieving a reopened F-measure of 0.744, 0.770, and 0.860 for Eclipse, Apache HTTP and OpenOffice, respectively. These results correspond to an improvement in the reopened F-measure of the method proposed in the prior work by Shihab et al. by 33.33, 12.57 and 3.12 % for Eclipse, Apache HTTP and OpenOffice, respectively.
format text
author Xia, Xin
LO, David
Shihab, Emad
Wang, Xinyu
Zhou, Bo
author_facet Xia, Xin
LO, David
Shihab, Emad
Wang, Xinyu
Zhou, Bo
author_sort Xia, Xin
title Automatic, high accuracy prediction of reopened bugs
title_short Automatic, high accuracy prediction of reopened bugs
title_full Automatic, high accuracy prediction of reopened bugs
title_fullStr Automatic, high accuracy prediction of reopened bugs
title_full_unstemmed Automatic, high accuracy prediction of reopened bugs
title_sort automatic, high accuracy prediction of reopened bugs
publisher Institutional Knowledge at Singapore Management University
publishDate 2015
url https://ink.library.smu.edu.sg/sis_research/2436
https://ink.library.smu.edu.sg/context/sis_research/article/3436/viewcontent/art_3A10.1007_2Fs10515_014_0162_2.pdf
_version_ 1770572146165153792