Finnim Iterative Imputation Of Missing Values In Dissolved Gas Analysis Dataset

Missing values are a common occurrence in a number of real world databases, and statistical methods have been developed to deal with this problem, referred to as missing data imputation. In the detection and prediction of incipient faults in power transformers using Dissolved Gas Analysis (DGA), th...

Full description

Saved in:
Bibliographic Details
Main Authors: Zahriah, Sahri, Rubiyah, Yusof, Junzo, Watada
Format: Article
Language:English
Published: IEEE 2014
Subjects:
Online Access:http://eprints.utem.edu.my/id/eprint/16925/2/06882199.pdf
http://eprints.utem.edu.my/id/eprint/16925/
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&arnumber=6882199
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknikal Malaysia Melaka
Language: English
Description
Summary:Missing values are a common occurrence in a number of real world databases, and statistical methods have been developed to deal with this problem, referred to as missing data imputation. In the detection and prediction of incipient faults in power transformers using Dissolved Gas Analysis (DGA), the problem of missing values is significant and has resulted in inconclusive decision making. This study proposes an efficient non-parametric iterative imputation method, named FINNIM, which comprises of three components : the imputation ordering, the imputation estimator and the iterative imputation. The relationship between gases and faults and the percentage of missing values in an instance are used as a basis for the imputation ordering; whilst the plausible values for the missing values are estimated from k-nearest neighbour instances in the imputation estimator; and the iterative imputation allows complete and incomplete instances in a DGA dataset to be utilized iteratively for imputing all the missing values. Experimental results on both artificially inserted and actual missing values found in a few DGA datasets demonstrate that the proposed method outperforms the existing methods in imputation accuracy, classification performance and convergence criteria at different missing percentages.