Data quality matters: A case study on data label correctness for security bug report prediction
In the research of mining software repositories, we need to label a large amount of data to construct a predictive model. The correctness of the labels will affect the performance of a model substantially. However, limited studies have been performed to investigate the impact of mislabeled instances...
Saved in:
Main Authors: | WU, Xiaoxue, ZHENG, Wei, XIA, Xin, LO, David |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7436 https://ink.library.smu.edu.sg/context/sis_research/article/8439/viewcontent/DataQualityMatters_2022_av.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
On the unreliability of bug severity data
by: TIAN, Yuan, et al.
Published: (2015) -
Predicting Effectiveness of IR-Based Bug Localization Techniques
by: LE, Tien-Duy B., et al.
Published: (2014) -
Software bug management from bug reports to bug signatures
by: SUN CHENGNIAN
Published: (2013) -
Where should the bugs be fixed? More accurate information retrieval-based bug localization based on bug reports
by: ZHOU, Jian, et al.
Published: (2012) -
Will this localization tool be effective for this bug? Mitigating the impact of unreliability of information retrieval based bug localization tools
by: LE, Tien-Duy B., et al.
Published: (2017)