A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia

Rainfall data are the most significant values in hydrology and climatology modelling. However, the datasets are prone to missing values due to various issues. This study aspires to impute the rainfall missing values by using various imputation method such as Replace by Mean, Nearest Neighbor, Random...

全面介紹

Saved in:
書目詳細資料
Main Authors: Che Mat Nor, Siti Mariana, Shaharudin, Shazlyn Milleana, Ismail, Shuhaida, Zainuddin, Nurul Hila, Tan, Mou Leong
格式: Article
出版: Universitas Ahmad Dahlan 2020
主題:
在線閱讀:http://eprints.uthm.edu.my/6100/
https://dx.doi.org/10.11591/eei.v9i2.2090
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Rainfall data are the most significant values in hydrology and climatology modelling. However, the datasets are prone to missing values due to various issues. This study aspires to impute the rainfall missing values by using various imputation method such as Replace by Mean, Nearest Neighbor, Random Forest, Non-linear Interactive Partial Least-Square (NIPALS) and Markov Chain Monte Carlo (MCMC). Daily rainfall datasets from 48 rainfall stations across east-coast Peninsular Malaysia were used in this study. The dataset were then fed into Multiple Linear Regression (MLR) model. The performance of abovementioned methods were evaluated using Root Mean Square Method (RMSE), Mean Absolute Error (MAE) and Nash-Sutcliffe Efficiency Coefficient (CE). The experimental results showed that RF coupled with MLR (RF-MLR) approach was attained as more fitting for satisfying the missing data in east-coast Peninsular Malaysia.