A discriminative model approach for accurate duplicate bug report retrieval

Bug repositories are usually maintained in software projects. Testers or users submit bug reports to identify various issues with systems. Sometimes two or more bug reports correspond to the same defect. To address the problem with duplicate bug reports, a person called a triager needs to manually l...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	SUN, Chengnian, LO, David, WANG, Xiaoyin, KHOO, Siau-Cheng
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2010
الموضوعات:	Distribution Maintenance Enhancement Management Reliability Software Engineering
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/sis_research/3721 https://ink.library.smu.edu.sg/context/sis_research/article/4723/viewcontent/A_discriminative_model_approach_for_accurate_dupli.pdf
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Singapore Management University
اللغة:	English

الوصف
الملخص:	Bug repositories are usually maintained in software projects. Testers or users submit bug reports to identify various issues with systems. Sometimes two or more bug reports correspond to the same defect. To address the problem with duplicate bug reports, a person called a triager needs to manually label these bug reports as duplicates, and link them to their "master" reports for subsequent maintenance work. However, in practice there are considerable duplicate bug reports sent daily; requesting triagers to manually label these bugs could be highly time consuming. To address this issue, recently, several techniques have be proposed using various similarity based metrics to detect candidate duplicate bug reports for manual verification. Automating triaging has been proved challenging as two reports of the same bug could be written in various ways. There is still much room for improvement in terms of accuracy of duplicate detection process. In this paper, we leverage recent advances on using discriminative models for information retrieval to detect duplicate bug reports more accurately. We have validated our approach on three large software bug repositories from Firefox, Eclipse, and OpenOffice. We show that our technique could result in 17--31%, 22--26%, and 35--43% relative improvement over state-of-the-art techniques in OpenOffice, Firefox, and Eclipse datasets respectively using commonly available natural language information only.

A discriminative model approach for accurate duplicate bug report retrieval

مواد مشابهة