Software fault prediction based on improved fuzzy clustering

Predicting parts of the software programs that are more defects prone could ease up the software testing process and helps effectively to reduce the cost and time of developments. Although many machine-learning and statistical techniques have been proposed widely for defining fault prone modules in...

Full description

Saved in:
Bibliographic Details
Main Authors: Abaei, Golnoosh, Selamat, Ali
Format: Article
Published: Springer Verlag 2014
Subjects:
Online Access:http://eprints.utm.my/id/eprint/62611/
http://dx.doi.org/10.1007/978-3-319-07593-8_21
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.62611
record_format eprints
spelling my.utm.626112017-06-18T08:57:19Z http://eprints.utm.my/id/eprint/62611/ Software fault prediction based on improved fuzzy clustering Abaei, Golnoosh Selamat, Ali QA75 Electronic computers. Computer science Predicting parts of the software programs that are more defects prone could ease up the software testing process and helps effectively to reduce the cost and time of developments. Although many machine-learning and statistical techniques have been proposed widely for defining fault prone modules in software fault prediction, but this area have yet to be explored with high accuracy and less error. Unfortunately, several earlier methods including artificial neural networks and its variants that have been used, marred by limitations such as inability to adequately handle uncertainties in software measurement data which leads to low accuracy, instability and inconsistency in prediction. In this paper, first the effect of irrelevant and inconsistent modules on fault prediction is decreased by designing a new framework, in which the entire project's modules are clustered. The generated output is then passed to the next model in the hybrid setting, which is a probabilistic neural network (PNN) for training and prediction. We used four NASA data sets to evaluate our results. Performance evaluation in terms of false positive rate, false negative rate, and overall error are calculated and showed 30% to 60% improvement in false negative rate compared to other well-performed training methods such as naïve Bayes and random forest. Springer Verlag 2014 Article PeerReviewed Abaei, Golnoosh and Selamat, Ali (2014) Software fault prediction based on improved fuzzy clustering. Advances in Intelligent Systems and Computing, 290 . pp. 165-172. ISSN 2194-5357 http://dx.doi.org/10.1007/978-3-319-07593-8_21 DOI:10.1007/978-3-319-07593-8_21
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Abaei, Golnoosh
Selamat, Ali
Software fault prediction based on improved fuzzy clustering
description Predicting parts of the software programs that are more defects prone could ease up the software testing process and helps effectively to reduce the cost and time of developments. Although many machine-learning and statistical techniques have been proposed widely for defining fault prone modules in software fault prediction, but this area have yet to be explored with high accuracy and less error. Unfortunately, several earlier methods including artificial neural networks and its variants that have been used, marred by limitations such as inability to adequately handle uncertainties in software measurement data which leads to low accuracy, instability and inconsistency in prediction. In this paper, first the effect of irrelevant and inconsistent modules on fault prediction is decreased by designing a new framework, in which the entire project's modules are clustered. The generated output is then passed to the next model in the hybrid setting, which is a probabilistic neural network (PNN) for training and prediction. We used four NASA data sets to evaluate our results. Performance evaluation in terms of false positive rate, false negative rate, and overall error are calculated and showed 30% to 60% improvement in false negative rate compared to other well-performed training methods such as naïve Bayes and random forest.
format Article
author Abaei, Golnoosh
Selamat, Ali
author_facet Abaei, Golnoosh
Selamat, Ali
author_sort Abaei, Golnoosh
title Software fault prediction based on improved fuzzy clustering
title_short Software fault prediction based on improved fuzzy clustering
title_full Software fault prediction based on improved fuzzy clustering
title_fullStr Software fault prediction based on improved fuzzy clustering
title_full_unstemmed Software fault prediction based on improved fuzzy clustering
title_sort software fault prediction based on improved fuzzy clustering
publisher Springer Verlag
publishDate 2014
url http://eprints.utm.my/id/eprint/62611/
http://dx.doi.org/10.1007/978-3-319-07593-8_21
_version_ 1643655471235596288