The Effective Redistribution for Imbalance Dataset : Relocating Safe-Level SMOTE with Minority Outcast Handling
The redistribution of the target class by oversampling synthetic minority instances is one of the effective directions for class imbalance problem. Safe-level SMOTE generates synthetic minority instances around original instances while avoiding nearby majority ones. However, despite of this intentio...
Saved in:
Main Authors: | , |
---|---|
Language: | English |
Published: |
Science Faculty of Chiang Mai University
2019
|
Subjects: | |
Online Access: | http://it.science.cmu.ac.th/ejournal/dl.php?journal_id=6324 http://cmuir.cmu.ac.th/jspui/handle/6653943832/66081 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Chiang Mai University |
Language: | English |
Summary: | The redistribution of the target class by oversampling synthetic minority instances is one of the effective directions for class imbalance problem. Safe-level SMOTE generates synthetic minority instances around original instances while avoiding nearby majority ones. However, despite of this intention, it is still possible that some synthetic instances can be placed too close to nearby majority instances which possibly confuse some classifiers. Moreover, Safe-Level SMOTE technically avoids using minority outcast instances for generating synthetic instances. This generated dataset may lose some precious information of minority class. Our paper aims to remedy these two drawbacks of Safe-Level SMOTE by combining two processes. The first one is checking and moving these synthetic instances away from possibly surrounding majority instances. The second is handling minority outcast with 1-nearest neighbor model. The empirical results on UCI and PROMISE datasets show the improvements of F-measure, which is the performance measure used in the class imbalance problem, for various classifiers such as decision tree, naïve Bayes classifier, multilayer perceptron, support vector machine and K-nearest neighbor. The improvements are tested by Wilcoxon sign test to show its significance. |
---|