Safe level graph for synthetic minority over-sampling techniques
In the class imbalance problem, most existent classifiers which are designed by the distribution of balance datasets fail to recognize minority classes since a large number of negative instances can dominate a few positive instances. Borderline-SMOTE and Safe-Level-SMOTE are over-sampling techniques...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference Proceeding |
Published: |
2018
|
Subjects: | |
Online Access: | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84891076473&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/52410 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Chiang Mai University |
id |
th-cmuir.6653943832-52410 |
---|---|
record_format |
dspace |
spelling |
th-cmuir.6653943832-524102018-09-04T09:25:02Z Safe level graph for synthetic minority over-sampling techniques Chumphol Bunkhumpornpat Sitthichoke Subpaiboonkit Computer Science In the class imbalance problem, most existent classifiers which are designed by the distribution of balance datasets fail to recognize minority classes since a large number of negative instances can dominate a few positive instances. Borderline-SMOTE and Safe-Level-SMOTE are over-sampling techniques which are applied to handle this situation by generating synthetic instances in different regions. The former operates on the border of a minority class while the latter works inside the class far from the border. Unfortunately, a data miner is unable to conveniently justify a suitable SMOTE for each dataset. In this paper, a safe level graph is proposed as a guideline tool for selecting an appropriate SMOTE and describes the characteristic of a minority class in an imbalance dataset. Relying on advice of a safe level graph, the experimental success rate is shown to reach 73% when an F-measure is used as the performance measure and 78% for satisfactory AUCs. © 2013 IEEE. 2018-09-04T09:25:02Z 2018-09-04T09:25:02Z 2013-12-31 Conference Proceeding 2-s2.0-84891076473 10.1109/ISCIT.2013.6645923 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84891076473&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/52410 |
institution |
Chiang Mai University |
building |
Chiang Mai University Library |
country |
Thailand |
collection |
CMU Intellectual Repository |
topic |
Computer Science |
spellingShingle |
Computer Science Chumphol Bunkhumpornpat Sitthichoke Subpaiboonkit Safe level graph for synthetic minority over-sampling techniques |
description |
In the class imbalance problem, most existent classifiers which are designed by the distribution of balance datasets fail to recognize minority classes since a large number of negative instances can dominate a few positive instances. Borderline-SMOTE and Safe-Level-SMOTE are over-sampling techniques which are applied to handle this situation by generating synthetic instances in different regions. The former operates on the border of a minority class while the latter works inside the class far from the border. Unfortunately, a data miner is unable to conveniently justify a suitable SMOTE for each dataset. In this paper, a safe level graph is proposed as a guideline tool for selecting an appropriate SMOTE and describes the characteristic of a minority class in an imbalance dataset. Relying on advice of a safe level graph, the experimental success rate is shown to reach 73% when an F-measure is used as the performance measure and 78% for satisfactory AUCs. © 2013 IEEE. |
format |
Conference Proceeding |
author |
Chumphol Bunkhumpornpat Sitthichoke Subpaiboonkit |
author_facet |
Chumphol Bunkhumpornpat Sitthichoke Subpaiboonkit |
author_sort |
Chumphol Bunkhumpornpat |
title |
Safe level graph for synthetic minority over-sampling techniques |
title_short |
Safe level graph for synthetic minority over-sampling techniques |
title_full |
Safe level graph for synthetic minority over-sampling techniques |
title_fullStr |
Safe level graph for synthetic minority over-sampling techniques |
title_full_unstemmed |
Safe level graph for synthetic minority over-sampling techniques |
title_sort |
safe level graph for synthetic minority over-sampling techniques |
publishDate |
2018 |
url |
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84891076473&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/52410 |
_version_ |
1681423946049650688 |