Exploration of machine learning forecasting methods in M4 competition / Muhammad Halim Hamdan and Shuzlina Abdul-Rahman

There are so many forecasting algorithms and techniques available. The abilities of Data Mining to obtain and gather data from multiple sources is very useful to researcher, practitioner, business and more. From a long list of forecasting algorithms that have been built throughout the years, it will...

Full description

Saved in:
Bibliographic Details
Main Authors: Hamdan, Muhammad Halim, Abdul-Rahman, Shuzlina
Format: Article
Language:English
Published: Universiti Teknologi MARA, Perak 2021
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/61456/1/61456.pdf
https://ir.uitm.edu.my/id/eprint/61456/
https://mijuitm.com.my/view-articles/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Mara
Language: English
Description
Summary:There are so many forecasting algorithms and techniques available. The abilities of Data Mining to obtain and gather data from multiple sources is very useful to researcher, practitioner, business and more. From a long list of forecasting algorithms that have been built throughout the years, it will be exhaustive for someone to go through the list one by one to choose which algorithm to use. With M competition established, there are many more new techniques being innovated each time it is organized. This research aims to compare and contrast the machine learning forecasting techniques that are used in M4 Competition, to get better understanding on each technique and to identify the best technique. Three machine learning techniques from M4 Competition were chosen to be compared in this research. Each technique was replicated, trained and tested accordingly. M4 competition dataset was used in this research, with 100,000 time series data and multiple data frequency, which is enough to replicate the real-world situation. The results indicate that the three techniques have their strength, with RNN+ES technique on top of it. RNN+ES and CNN-TS performed well in relative to Naive2 benchmark, while k-NS model performed the worst. Further research on the datasets and investigation of each model to further improve its capabilities are needed to improve the performance of the model.