Finnim Iterative Imputation Of Missing Values In Dissolved Gas Analysis Dataset
Missing values are a common occurrence in a number of real world databases, and statistical methods have been developed to deal with this problem, referred to as missing data imputation. In the detection and prediction of incipient faults in power transformers using Dissolved Gas Analysis (DGA), th...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2014
|
Subjects: | |
Online Access: | http://eprints.utem.edu.my/id/eprint/16925/2/06882199.pdf http://eprints.utem.edu.my/id/eprint/16925/ http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&arnumber=6882199 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknikal Malaysia Melaka |
Language: | English |
Summary: | Missing values are a common occurrence in a number of real world databases, and statistical methods have been developed to deal with this problem, referred to as missing
data imputation. In the detection and prediction of incipient faults in power transformers using Dissolved Gas Analysis (DGA), the problem of missing values is significant and has resulted in inconclusive decision making. This study proposes an efficient non-parametric iterative imputation method, named FINNIM, which comprises of three components : the imputation ordering, the imputation estimator and the iterative imputation. The relationship between gases and faults and the percentage
of missing values in an instance are used as a basis for the
imputation ordering; whilst the plausible values for the missing values are estimated from k-nearest neighbour instances in the imputation estimator; and the iterative imputation allows complete and incomplete instances in a DGA dataset to be utilized iteratively for imputing all the missing values. Experimental results on both artificially inserted and actual missing values found in a few DGA datasets demonstrate that the proposed method outperforms the existing methods in imputation accuracy, classification performance and convergence criteria at different missing percentages. |
---|