DupFinder: Integrated tool support for duplicate bug report detection

To track bugs that appear in a software, developers often make use of a bug tracking system. Users can report bugs that they encounter in such a system. Bug reporting is inherently an uncoordinated distributed process though and thus when a user submits a new bug report, there might be cases when an...

Full description

Saved in:
Bibliographic Details
Main Authors: THUNG, Ferdian, KOCHHAR, Pavneet Singh, LO, David
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2014
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2426
https://ink.library.smu.edu.sg/context/sis_research/article/3426/viewcontent/p871_thung.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3426
record_format dspace
spelling sg-smu-ink.sis_research-34262020-12-07T07:07:04Z DupFinder: Integrated tool support for duplicate bug report detection THUNG, Ferdian KOCHHAR, Pavneet Singh LO, David To track bugs that appear in a software, developers often make use of a bug tracking system. Users can report bugs that they encounter in such a system. Bug reporting is inherently an uncoordinated distributed process though and thus when a user submits a new bug report, there might be cases when another bug report describing exactly the same problem is already present in the system. Such bug reports are duplicate of each other and these duplicate bug reports need to be identified. A number of past studies have proposed a number of automated approaches to detect duplicate bug reports. However, these approaches are not integrated to existing bug tracking systems. In this paper, we propose a tool named DupFinder, which implements the state-of-the-art unsupervised duplicate bug report approach by Runeson et al., as a Bugzilla extension. DupFinder does not require any training data and thus can easily be deployed to any project. DupFinder extracts texts from summary and description fields of a new bug report and recent bug reports present in a bug tracking system, uses vector space model to measure similarity of bug reports, and provides developers with a list of potential duplicate bug reports based on the similarity of these reports with the new bug report. We have released DupFinder as an open source tool in GitHub, which is available at: https://github.com/smagsmu/dupfinder. 2014-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2426 info:doi/10.1145/2642937.2648627 https://ink.library.smu.edu.sg/context/sis_research/article/3426/viewcontent/p871_thung.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Bugzilla Duplicate bug reports Integrated tool support Information Security Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Bugzilla
Duplicate bug reports
Integrated tool support
Information Security
Software Engineering
spellingShingle Bugzilla
Duplicate bug reports
Integrated tool support
Information Security
Software Engineering
THUNG, Ferdian
KOCHHAR, Pavneet Singh
LO, David
DupFinder: Integrated tool support for duplicate bug report detection
description To track bugs that appear in a software, developers often make use of a bug tracking system. Users can report bugs that they encounter in such a system. Bug reporting is inherently an uncoordinated distributed process though and thus when a user submits a new bug report, there might be cases when another bug report describing exactly the same problem is already present in the system. Such bug reports are duplicate of each other and these duplicate bug reports need to be identified. A number of past studies have proposed a number of automated approaches to detect duplicate bug reports. However, these approaches are not integrated to existing bug tracking systems. In this paper, we propose a tool named DupFinder, which implements the state-of-the-art unsupervised duplicate bug report approach by Runeson et al., as a Bugzilla extension. DupFinder does not require any training data and thus can easily be deployed to any project. DupFinder extracts texts from summary and description fields of a new bug report and recent bug reports present in a bug tracking system, uses vector space model to measure similarity of bug reports, and provides developers with a list of potential duplicate bug reports based on the similarity of these reports with the new bug report. We have released DupFinder as an open source tool in GitHub, which is available at: https://github.com/smagsmu/dupfinder.
format text
author THUNG, Ferdian
KOCHHAR, Pavneet Singh
LO, David
author_facet THUNG, Ferdian
KOCHHAR, Pavneet Singh
LO, David
author_sort THUNG, Ferdian
title DupFinder: Integrated tool support for duplicate bug report detection
title_short DupFinder: Integrated tool support for duplicate bug report detection
title_full DupFinder: Integrated tool support for duplicate bug report detection
title_fullStr DupFinder: Integrated tool support for duplicate bug report detection
title_full_unstemmed DupFinder: Integrated tool support for duplicate bug report detection
title_sort dupfinder: integrated tool support for duplicate bug report detection
publisher Institutional Knowledge at Singapore Management University
publishDate 2014
url https://ink.library.smu.edu.sg/sis_research/2426
https://ink.library.smu.edu.sg/context/sis_research/article/3426/viewcontent/p871_thung.pdf
_version_ 1770572142667104256