Ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem

Ensemble learning by combining several single classifiers or another ensemble classifier is one of the procedures to solve the imbalance problem in multiclass data. However, this approach still faces the question of how the ensemble methods obtain their higher performance. In this paper, an investig...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohd Shamrie Sainin, Rayner Alfred, Faudziah Ahmad
Format: Article
Language:English
English
Published: Universiti Utara Malaysia 2021
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/29977/1/Ensemble%20meta%20classifier%20with%20sampling%20and%20feature%20selection%20for%20data%20with%20multiclass%20imbalance%20problem-Abstract.pdf
https://eprints.ums.edu.my/id/eprint/29977/2/Ensemble%20meta%20classifier%20with%20sampling%20and%20feature%20selection%20for%20data%20with%20multiclass%20imbalance%20problem.pdf
https://eprints.ums.edu.my/id/eprint/29977/
http://jict.uum.edu.my/images/Vol20No2April2021/103-133.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Sabah
Language: English
English
id my.ums.eprints.29977
record_format eprints
spelling my.ums.eprints.299772021-07-15T06:31:17Z https://eprints.ums.edu.my/id/eprint/29977/ Ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem Mohd Shamrie Sainin Rayner Alfred Faudziah Ahmad QA Mathematics TK Electrical engineering. Electronics Nuclear engineering Ensemble learning by combining several single classifiers or another ensemble classifier is one of the procedures to solve the imbalance problem in multiclass data. However, this approach still faces the question of how the ensemble methods obtain their higher performance. In this paper, an investigation was carried out on the design of the meta classifier ensemble with sampling and feature selection for multiclass imbalanced data. The specific objectives were: 1) to improve the ensemble classifier through data-level approach (sampling and feature selection); 2) to perform experiments on sampling, feature selection, and ensemble classifier model; and 3) to evaluate the performance of the ensemble classifier. To fulfil the objectives, a preliminary data collection of Malaysian plants’ leaf images was prepared and experimented, and the results were compared. The ensemble design was also tested with three other high imbalance ratio benchmark data. It was found that the design using sampling, feature selection, and ensemble classifier method via AdaboostM1 with random forest (also an ensemble classifier) provided improved performance throughout the investigation. The result of this study is important to the ongoing problem of multiclass imbalance where specific structure and its performance can be improved in terms of processing time and accuracy Universiti Utara Malaysia 2021-02-23 Article PeerReviewed text en https://eprints.ums.edu.my/id/eprint/29977/1/Ensemble%20meta%20classifier%20with%20sampling%20and%20feature%20selection%20for%20data%20with%20multiclass%20imbalance%20problem-Abstract.pdf text en https://eprints.ums.edu.my/id/eprint/29977/2/Ensemble%20meta%20classifier%20with%20sampling%20and%20feature%20selection%20for%20data%20with%20multiclass%20imbalance%20problem.pdf Mohd Shamrie Sainin and Rayner Alfred and Faudziah Ahmad (2021) Ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem. Journal of Information and Communication Technology (JICT), 20 (2). pp. 103-133. ISSN 2180-3862 http://jict.uum.edu.my/images/Vol20No2April2021/103-133.pdf
institution Universiti Malaysia Sabah
building UMS Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sabah
content_source UMS Institutional Repository
url_provider http://eprints.ums.edu.my/
language English
English
topic QA Mathematics
TK Electrical engineering. Electronics Nuclear engineering
spellingShingle QA Mathematics
TK Electrical engineering. Electronics Nuclear engineering
Mohd Shamrie Sainin
Rayner Alfred
Faudziah Ahmad
Ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem
description Ensemble learning by combining several single classifiers or another ensemble classifier is one of the procedures to solve the imbalance problem in multiclass data. However, this approach still faces the question of how the ensemble methods obtain their higher performance. In this paper, an investigation was carried out on the design of the meta classifier ensemble with sampling and feature selection for multiclass imbalanced data. The specific objectives were: 1) to improve the ensemble classifier through data-level approach (sampling and feature selection); 2) to perform experiments on sampling, feature selection, and ensemble classifier model; and 3) to evaluate the performance of the ensemble classifier. To fulfil the objectives, a preliminary data collection of Malaysian plants’ leaf images was prepared and experimented, and the results were compared. The ensemble design was also tested with three other high imbalance ratio benchmark data. It was found that the design using sampling, feature selection, and ensemble classifier method via AdaboostM1 with random forest (also an ensemble classifier) provided improved performance throughout the investigation. The result of this study is important to the ongoing problem of multiclass imbalance where specific structure and its performance can be improved in terms of processing time and accuracy
format Article
author Mohd Shamrie Sainin
Rayner Alfred
Faudziah Ahmad
author_facet Mohd Shamrie Sainin
Rayner Alfred
Faudziah Ahmad
author_sort Mohd Shamrie Sainin
title Ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem
title_short Ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem
title_full Ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem
title_fullStr Ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem
title_full_unstemmed Ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem
title_sort ensemble meta classifier with sampling and feature selection for data with multiclass imbalance problem
publisher Universiti Utara Malaysia
publishDate 2021
url https://eprints.ums.edu.my/id/eprint/29977/1/Ensemble%20meta%20classifier%20with%20sampling%20and%20feature%20selection%20for%20data%20with%20multiclass%20imbalance%20problem-Abstract.pdf
https://eprints.ums.edu.my/id/eprint/29977/2/Ensemble%20meta%20classifier%20with%20sampling%20and%20feature%20selection%20for%20data%20with%20multiclass%20imbalance%20problem.pdf
https://eprints.ums.edu.my/id/eprint/29977/
http://jict.uum.edu.my/images/Vol20No2April2021/103-133.pdf
_version_ 1760230703339929600