Parameter-free imputation for imbalance datasets

© Springer International Publishing Switzerland 2014. Class imbalance is a problem that aims to improve the accuracy of a minority class, while imputation is a process to replace missing values. Traditionally, class imbalance and imputation problems are considered independently. In addition, filled-...

Full description

Saved in:
Bibliographic Details
Main Authors: Takum,J., Bunkhumpornpat,C.
Format: Article
Published: Springer Verlag 2015
Subjects:
Online Access:http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84909619298&origin=inward
http://cmuir.cmu.ac.th/handle/6653943832/38797
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-38797
record_format dspace
spelling th-cmuir.6653943832-387972015-06-16T07:54:15Z Parameter-free imputation for imbalance datasets Takum,J. Bunkhumpornpat,C. Theoretical Computer Science Computer Science (all) © Springer International Publishing Switzerland 2014. Class imbalance is a problem that aims to improve the accuracy of a minority class, while imputation is a process to replace missing values. Traditionally, class imbalance and imputation problems are considered independently. In addition, filled-in minority-class values that are substituted by traditional methods are not sufficient for imbalance datasets. In this paper, we provide a new parameter-free imputation to operate on imbalance datasets by estimating a random value between the mean of the missing value attribute and a value in this attribute of the closet record instance from the missing value record. Our proposed algorithm ignores mean of instances to avoid an over-fitting problem. Consequently, experimental results on imbalance datasets reveal that our imputation outperforms other techniques, when class imbalance measures are used. 2015-06-16T07:54:15Z 2015-06-16T07:54:15Z 2014-01-01 Article 03029743 2-s2.0-84909619298 http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84909619298&origin=inward http://cmuir.cmu.ac.th/handle/6653943832/38797 Springer Verlag
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Theoretical Computer Science
Computer Science (all)
spellingShingle Theoretical Computer Science
Computer Science (all)
Takum,J.
Bunkhumpornpat,C.
Parameter-free imputation for imbalance datasets
description © Springer International Publishing Switzerland 2014. Class imbalance is a problem that aims to improve the accuracy of a minority class, while imputation is a process to replace missing values. Traditionally, class imbalance and imputation problems are considered independently. In addition, filled-in minority-class values that are substituted by traditional methods are not sufficient for imbalance datasets. In this paper, we provide a new parameter-free imputation to operate on imbalance datasets by estimating a random value between the mean of the missing value attribute and a value in this attribute of the closet record instance from the missing value record. Our proposed algorithm ignores mean of instances to avoid an over-fitting problem. Consequently, experimental results on imbalance datasets reveal that our imputation outperforms other techniques, when class imbalance measures are used.
format Article
author Takum,J.
Bunkhumpornpat,C.
author_facet Takum,J.
Bunkhumpornpat,C.
author_sort Takum,J.
title Parameter-free imputation for imbalance datasets
title_short Parameter-free imputation for imbalance datasets
title_full Parameter-free imputation for imbalance datasets
title_fullStr Parameter-free imputation for imbalance datasets
title_full_unstemmed Parameter-free imputation for imbalance datasets
title_sort parameter-free imputation for imbalance datasets
publisher Springer Verlag
publishDate 2015
url http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84909619298&origin=inward
http://cmuir.cmu.ac.th/handle/6653943832/38797
_version_ 1681421538677489664