Hybrid modeling in the predictive analytics of energy systems and prices

The aim of this paper is to illustrate the nature of the residuals of a forecasting process and to propose a hybrid approach with linear and nonlinear components predicted by corresponding methodologies. It is a common practice that residuals are assumed to be unpredictable or are reiterated into a...

Full description

Saved in:
Bibliographic Details
Main Authors: Gulay, Emrah, Duru, Okan
Other Authors: School of Civil and Environmental Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/155500
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:The aim of this paper is to illustrate the nature of the residuals of a forecasting process and to propose a hybrid approach with linear and nonlinear components predicted by corresponding methodologies. It is a common practice that residuals are assumed to be unpredictable or are reiterated into a model as lagged variables to capture any information remaining in the residual data. The central argument of this paper is that residuals from energy price forecasting can still carry predictive information in its complex and nonlinear form. Although the linear modeling is initially very accurate, reiterating residuals in linear structures is a mismatch of data type and methodology. In this regard, the proposed algorithm hybridizes or combines linear components captured by the Autoregressive Distributed Lag Model (ARDL) and nonlinear components processed by the Empirical Mode Decomposition (EMD) and an Artificial Neural Network (ANN) to improve post-sample accuracy. The conventional reiterative process can improve in-sample accuracy, which literally has no value for business forecasting practices. Through a fair benchmark comparison, including methodologies of other combinations, the proposed algorithm is cross-validated by predictive accuracy gain in the out-of-sample (holdout) dataset.