Improved Duplicate Bug Report Identification

Bugs are prevalent in software systems. To improve the reliability of software systems, developers often allow end users to provide feedback on bugs that they encounter. Users could perform this by sending a bug report in a bug report management system like Bugzilla. This process however is uncoordi...

Full description

Saved in:

Bibliographic Details
Main Authors:	TIAN, Yuan, SUN, Chengnian, LO, David
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2012
Subjects:	Software Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/1533 https://ink.library.smu.edu.sg/context/sis_research/article/2532/viewcontent/Improved_Duplicate_Bug_Report_csmr12_av.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-2532
record_format	dspace
spelling	sg-smu-ink.sis_research-25322020-01-08T08:31:31Z Improved Duplicate Bug Report Identification TIAN, Yuan SUN, Chengnian LO, David Bugs are prevalent in software systems. To improve the reliability of software systems, developers often allow end users to provide feedback on bugs that they encounter. Users could perform this by sending a bug report in a bug report management system like Bugzilla. This process however is uncoordinated and distributed, which means that many users could submit bug reports reporting the same problem. These are referred to as duplicate bug reports. The existence of many duplicate bug reports may cause much unnecessary manual efforts as often a triager would need to manually tag bug reports as being duplicates. Recently, there have been a number of studies that investigate duplicate bug report problem which in effect answer the following question: given a new bug report, retrieve k other similar bug reports. This, however, still requires substantive manual effort which could be reduced further. Jalbert and Weimer are the first to introduce the direct detection of duplicate bug reports, it answers the question: given a new bug report, classify if it as a duplicate bug report or not. In this paper, we extend Jalbert and Weimer's work by improving the accuracy of automated duplicate bug report identification. We experiments with bug reports from Mozilla bug tracking system which were reported between February 2005 to October 2005, and find that we could improve the accuracy of the previous approach by about 160%. 2012-03-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1533 info:doi/10.1109/CSMR.2012.48 https://ink.library.smu.edu.sg/context/sis_research/article/2532/viewcontent/Improved_Duplicate_Bug_Report_csmr12_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Software Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Software Engineering
spellingShingle	Software Engineering TIAN, Yuan SUN, Chengnian LO, David Improved Duplicate Bug Report Identification
description	Bugs are prevalent in software systems. To improve the reliability of software systems, developers often allow end users to provide feedback on bugs that they encounter. Users could perform this by sending a bug report in a bug report management system like Bugzilla. This process however is uncoordinated and distributed, which means that many users could submit bug reports reporting the same problem. These are referred to as duplicate bug reports. The existence of many duplicate bug reports may cause much unnecessary manual efforts as often a triager would need to manually tag bug reports as being duplicates. Recently, there have been a number of studies that investigate duplicate bug report problem which in effect answer the following question: given a new bug report, retrieve k other similar bug reports. This, however, still requires substantive manual effort which could be reduced further. Jalbert and Weimer are the first to introduce the direct detection of duplicate bug reports, it answers the question: given a new bug report, classify if it as a duplicate bug report or not. In this paper, we extend Jalbert and Weimer's work by improving the accuracy of automated duplicate bug report identification. We experiments with bug reports from Mozilla bug tracking system which were reported between February 2005 to October 2005, and find that we could improve the accuracy of the previous approach by about 160%.
format	text
author	TIAN, Yuan SUN, Chengnian LO, David
author_facet	TIAN, Yuan SUN, Chengnian LO, David
author_sort	TIAN, Yuan
title	Improved Duplicate Bug Report Identification
title_short	Improved Duplicate Bug Report Identification
title_full	Improved Duplicate Bug Report Identification
title_fullStr	Improved Duplicate Bug Report Identification
title_full_unstemmed	Improved Duplicate Bug Report Identification
title_sort	improved duplicate bug report identification
publisher	Institutional Knowledge at Singapore Management University
publishDate	2012
url	https://ink.library.smu.edu.sg/sis_research/1533 https://ink.library.smu.edu.sg/context/sis_research/article/2532/viewcontent/Improved_Duplicate_Bug_Report_csmr12_av.pdf
_version_	1770571259462025216

Improved Duplicate Bug Report Identification

Similar Items