Combined classifier for cross-project defect prediction: An extended empirical study
To help developers better allocate testing and debugging efforts, many software defect prediction techniques have been proposed in the literature. These techniques can be used to predict classes that are more likely to be buggy based on past history of buggy classes. These techniques work well as lo...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2018
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4130 https://ink.library.smu.edu.sg/context/sis_research/article/5133/viewcontent/Combined_classifier_for_cross_project_defect_prediction.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-5133 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-51332019-06-06T08:53:04Z Combined classifier for cross-project defect prediction: An extended empirical study ZHANG, Yun LO, David XIA, Xin SUN, Jianling To help developers better allocate testing and debugging efforts, many software defect prediction techniques have been proposed in the literature. These techniques can be used to predict classes that are more likely to be buggy based on past history of buggy classes. These techniques work well as long as a sufficient amount of data is available to train a prediction model. However, there is rarely enough training data for new software projects. To deal with this problem, cross-project defect prediction, which transfers a prediction model trained using data from one project to another, has been proposed and is regarded as a new challenge for defect prediction. So far, only a few cross-project defect prediction techniques have been proposed. To advance the state-of-the-art, in this work, we investigate 7 composite algorithms, which integrate multiple machine learning classifiers, to improve cross-project defect prediction. To evaluate the performance of the composite algorithms, we perform experiments on 10 open source software systems from the PROMISE repository which contain a total of 5,305 instances labeled as defective or clean. We compare the composite algorithms with CODEPLogistic, which is the latest cross-project defect prediction algorithm proposed by Panichella et al. [1], in terms of two standard evaluation metrics: cost effectiveness and F-measure. Our experiment results show that several algorithms outperform CODEPLogistic: Max performs the best in terms of F-measure and its average F-measure outperforms that of CODEPLogistic by 36.88%. BaggingJ48 performs the best in terms of cost effectiveness and its average cost effectiveness outperforms that of CODEPLogistic by 15.34%. 2018-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4130 info:doi/10.1007/s11704-017-6015-y https://ink.library.smu.edu.sg/context/sis_research/article/5133/viewcontent/Combined_classifier_for_cross_project_defect_prediction.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Defect Prediction Cross-Project Classifier Combination Programming Languages and Compilers Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Defect Prediction Cross-Project Classifier Combination Programming Languages and Compilers Software Engineering |
spellingShingle |
Defect Prediction Cross-Project Classifier Combination Programming Languages and Compilers Software Engineering ZHANG, Yun LO, David XIA, Xin SUN, Jianling Combined classifier for cross-project defect prediction: An extended empirical study |
description |
To help developers better allocate testing and debugging efforts, many software defect prediction techniques have been proposed in the literature. These techniques can be used to predict classes that are more likely to be buggy based on past history of buggy classes. These techniques work well as long as a sufficient amount of data is available to train a prediction model. However, there is rarely enough training data for new software projects. To deal with this problem, cross-project defect prediction, which transfers a prediction model trained using data from one project to another, has been proposed and is regarded as a new challenge for defect prediction. So far, only a few cross-project defect prediction techniques have been proposed. To advance the state-of-the-art, in this work, we investigate 7 composite algorithms, which integrate multiple machine learning classifiers, to improve cross-project defect prediction. To evaluate the performance of the composite algorithms, we perform experiments on 10 open source software systems from the PROMISE repository which contain a total of 5,305 instances labeled as defective or clean. We compare the composite algorithms with CODEPLogistic, which is the latest cross-project defect prediction algorithm proposed by Panichella et al. [1], in terms of two standard evaluation metrics: cost effectiveness and F-measure. Our experiment results show that several algorithms outperform CODEPLogistic: Max performs the best in terms of F-measure and its average F-measure outperforms that of CODEPLogistic by 36.88%. BaggingJ48 performs the best in terms of cost effectiveness and its average cost effectiveness outperforms that of CODEPLogistic by 15.34%. |
format |
text |
author |
ZHANG, Yun LO, David XIA, Xin SUN, Jianling |
author_facet |
ZHANG, Yun LO, David XIA, Xin SUN, Jianling |
author_sort |
ZHANG, Yun |
title |
Combined classifier for cross-project defect prediction: An extended empirical study |
title_short |
Combined classifier for cross-project defect prediction: An extended empirical study |
title_full |
Combined classifier for cross-project defect prediction: An extended empirical study |
title_fullStr |
Combined classifier for cross-project defect prediction: An extended empirical study |
title_full_unstemmed |
Combined classifier for cross-project defect prediction: An extended empirical study |
title_sort |
combined classifier for cross-project defect prediction: an extended empirical study |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2018 |
url |
https://ink.library.smu.edu.sg/sis_research/4130 https://ink.library.smu.edu.sg/context/sis_research/article/5133/viewcontent/Combined_classifier_for_cross_project_defect_prediction.pdf |
_version_ |
1770574345779806208 |