การใส่ค่าข้อมูลที่ขาดหายโดยวิธีเพื่อนบ้านใกล้ที่สุดเคตัวเพื่อการจำแนกประเภทในชุดข้อมูลอสมดุล

Class imbalance is a problem that aims to improve the accuracy of a minority class, while imputation is a process to replace missing values. Traditionally, class imbalance and imputation problems are considered independently. In addition, filled-in minority-class values that are substituted by tradi...

Full description

Saved in:
Bibliographic Details
Main Author: จินตนา ตาคำ
Other Authors: ผู้ช่วยศาสตราจารย์ ดร.ชุมพล บุญคุ้มพรภัทร
Format: Theses and Dissertations
Language:Thai
Published: เชียงใหม่ : บัณฑิตวิทยาลัย มหาวิทยาลัยเชียงใหม่ 2020
Online Access:http://cmuir.cmu.ac.th/jspui/handle/6653943832/69257
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
Language: Thai
Description
Summary:Class imbalance is a problem that aims to improve the accuracy of a minority class, while imputation is a process to replace missing values. Traditionally, class imbalance and imputation problems are considered independently. In addition, filled-in minority-class values that are substituted by traditional methods are not sufficient for imbalance datasets. In this paper, we provide a new parameter-free imputation to operate on imbalance datasets by estimating a random value between the mean of the missing value attribute and a value in this attribute of the closet record instance from the missing value record. Our proposed algorithm ignores mean of instances to avoid an over-fitting problem. Consequently, experimental results on imbalance datasets reveal that our imputation outperforms other techniques, when class imbalance measures are used.