Parameter-free imputation for imbalance datasets
© Springer International Publishing Switzerland 2014. Class imbalance is a problem that aims to improve the accuracy of a minority class, while imputation is a process to replace missing values. Traditionally, class imbalance and imputation problems are considered independently. In addition, filled-...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Published: |
Springer Verlag
2015
|
Subjects: | |
Online Access: | http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84909619298&origin=inward http://cmuir.cmu.ac.th/handle/6653943832/38797 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Chiang Mai University |
id |
th-cmuir.6653943832-38797 |
---|---|
record_format |
dspace |
spelling |
th-cmuir.6653943832-387972015-06-16T07:54:15Z Parameter-free imputation for imbalance datasets Takum,J. Bunkhumpornpat,C. Theoretical Computer Science Computer Science (all) © Springer International Publishing Switzerland 2014. Class imbalance is a problem that aims to improve the accuracy of a minority class, while imputation is a process to replace missing values. Traditionally, class imbalance and imputation problems are considered independently. In addition, filled-in minority-class values that are substituted by traditional methods are not sufficient for imbalance datasets. In this paper, we provide a new parameter-free imputation to operate on imbalance datasets by estimating a random value between the mean of the missing value attribute and a value in this attribute of the closet record instance from the missing value record. Our proposed algorithm ignores mean of instances to avoid an over-fitting problem. Consequently, experimental results on imbalance datasets reveal that our imputation outperforms other techniques, when class imbalance measures are used. 2015-06-16T07:54:15Z 2015-06-16T07:54:15Z 2014-01-01 Article 03029743 2-s2.0-84909619298 http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84909619298&origin=inward http://cmuir.cmu.ac.th/handle/6653943832/38797 Springer Verlag |
institution |
Chiang Mai University |
building |
Chiang Mai University Library |
country |
Thailand |
collection |
CMU Intellectual Repository |
topic |
Theoretical Computer Science Computer Science (all) |
spellingShingle |
Theoretical Computer Science Computer Science (all) Takum,J. Bunkhumpornpat,C. Parameter-free imputation for imbalance datasets |
description |
© Springer International Publishing Switzerland 2014. Class imbalance is a problem that aims to improve the accuracy of a minority class, while imputation is a process to replace missing values. Traditionally, class imbalance and imputation problems are considered independently. In addition, filled-in minority-class values that are substituted by traditional methods are not sufficient for imbalance datasets. In this paper, we provide a new parameter-free imputation to operate on imbalance datasets by estimating a random value between the mean of the missing value attribute and a value in this attribute of the closet record instance from the missing value record. Our proposed algorithm ignores mean of instances to avoid an over-fitting problem. Consequently, experimental results on imbalance datasets reveal that our imputation outperforms other techniques, when class imbalance measures are used. |
format |
Article |
author |
Takum,J. Bunkhumpornpat,C. |
author_facet |
Takum,J. Bunkhumpornpat,C. |
author_sort |
Takum,J. |
title |
Parameter-free imputation for imbalance datasets |
title_short |
Parameter-free imputation for imbalance datasets |
title_full |
Parameter-free imputation for imbalance datasets |
title_fullStr |
Parameter-free imputation for imbalance datasets |
title_full_unstemmed |
Parameter-free imputation for imbalance datasets |
title_sort |
parameter-free imputation for imbalance datasets |
publisher |
Springer Verlag |
publishDate |
2015 |
url |
http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84909619298&origin=inward http://cmuir.cmu.ac.th/handle/6653943832/38797 |
_version_ |
1681421538677489664 |