Empirical analysis of rank aggregation-based multi-filter feature selection methods in software defect prediction

Selecting the most suitable filter method that will produce a subset of features with the best performance remains an open problem that is known as filter rank selection problem. A viable solution to this problem is to independently apply a mixture of filter methods and evaluate the results. This st...

Full description

Saved in:
Bibliographic Details
Main Authors: Balogun, A.O., Basri, S., Mahamad, S., Abdulkadir, S.J., Capretz, L.F., Imam, A.A., Almomani, M.A., Adeyemo, V.E., Kumar, G.
Format: Article
Published: MDPI AG 2021
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85100099715&doi=10.3390%2felectronics10020179&partnerID=40&md5=2b52e334c21bf34d53bbf53d0cea34b7
http://eprints.utp.edu.my/23911/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Petronas
Description
Summary:Selecting the most suitable filter method that will produce a subset of features with the best performance remains an open problem that is known as filter rank selection problem. A viable solution to this problem is to independently apply a mixture of filter methods and evaluate the results. This study proposes novel rank aggregation-based multi-filter feature selection (FS) methods to address high dimensionality and filter rank selection problem in software defect prediction (SDP). The proposed methods combine rank lists generated by individual filter methods using rank aggregation mechanisms into a single aggregated rank list. The proposed methods aim to resolve the filter selection problem by using multiple filter methods of diverse computational characteristics to produce a dis-joint and complete feature rank list superior to individual filter rank methods. The effectiveness of the proposed method was evaluated with Decision Tree (DT) and Naïve Bayes (NB) models on defect datasets from NASA repository. From the experimental results, the proposed methods had a superior impact (positive) on prediction performances of NB and DT models than other experimented FS methods. This makes the combination of filter rank methods a viable solution to filter rank selection problem and enhancement of prediction models in SDP. © 2021 by the authors. Licensee MDPI, Basel, Switzerland.