Feature selection for demand forecasting incorporating external covariates

Feature selection is used to select a subset of features from a dataset, when developing a predictive model, and use only these selected features for prediction. This helps in not only reducing the computational cost but also in improving the forecasting performance of a machine learning model. Popu...

Full description

Saved in:
Bibliographic Details
Main Author: Mantri, Raghav
Other Authors: Jagath C Rajapakse
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/153745
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-153745
record_format dspace
spelling sg-ntu-dr.10356-1537452021-12-10T01:40:07Z Feature selection for demand forecasting incorporating external covariates Mantri, Raghav Jagath C Rajapakse School of Computer Science and Engineering ASJagath@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Feature selection is used to select a subset of features from a dataset, when developing a predictive model, and use only these selected features for prediction. This helps in not only reducing the computational cost but also in improving the forecasting performance of a machine learning model. Popular approaches in this domain engineer features from the target variable, and then proceed to independently rank these features using some scoring function. These ranks then decide the features used for forecasting the target variable. However, such scoring functions discard feature inter-dependencies, and these frameworks fail to capture the characteristics of the features that could benefit the model in prediction. To solve this, wrapper approaches are used to test out the actual feature subsets with the target model, but these approaches fail to capture the wider characteristics of a rolling window of features within each time step, when being fit in the model for evaluation. In this paper, we forecast the demand for medical products using historical in-market-sales data for a multinational medical company. We utilise machine learning methods for this purpose and perform feature selection to use only relevant features. To enhance the forecast accuracy, we incorporate external variables like Google search word trends. We also enhance this method by utilising a fixed rolling window on each of the external covariates, and reducing them dimensionally using Principal Component Analysis, to fit in the predictive model. These features are then used and have shown reduction in errors when compared to benchmark techniques. For this project, the dataset provided is confidential, so information regarding the products under study has not been shared in this report. Bachelor of Engineering (Computer Engineering) 2021-12-10T01:40:07Z 2021-12-10T01:40:07Z 2021 Final Year Project (FYP) Mantri, R. (2021). Feature selection for demand forecasting incorporating external covariates. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/153745 https://hdl.handle.net/10356/153745 en SCSE20-0867 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Mantri, Raghav
Feature selection for demand forecasting incorporating external covariates
description Feature selection is used to select a subset of features from a dataset, when developing a predictive model, and use only these selected features for prediction. This helps in not only reducing the computational cost but also in improving the forecasting performance of a machine learning model. Popular approaches in this domain engineer features from the target variable, and then proceed to independently rank these features using some scoring function. These ranks then decide the features used for forecasting the target variable. However, such scoring functions discard feature inter-dependencies, and these frameworks fail to capture the characteristics of the features that could benefit the model in prediction. To solve this, wrapper approaches are used to test out the actual feature subsets with the target model, but these approaches fail to capture the wider characteristics of a rolling window of features within each time step, when being fit in the model for evaluation. In this paper, we forecast the demand for medical products using historical in-market-sales data for a multinational medical company. We utilise machine learning methods for this purpose and perform feature selection to use only relevant features. To enhance the forecast accuracy, we incorporate external variables like Google search word trends. We also enhance this method by utilising a fixed rolling window on each of the external covariates, and reducing them dimensionally using Principal Component Analysis, to fit in the predictive model. These features are then used and have shown reduction in errors when compared to benchmark techniques. For this project, the dataset provided is confidential, so information regarding the products under study has not been shared in this report.
author2 Jagath C Rajapakse
author_facet Jagath C Rajapakse
Mantri, Raghav
format Final Year Project
author Mantri, Raghav
author_sort Mantri, Raghav
title Feature selection for demand forecasting incorporating external covariates
title_short Feature selection for demand forecasting incorporating external covariates
title_full Feature selection for demand forecasting incorporating external covariates
title_fullStr Feature selection for demand forecasting incorporating external covariates
title_full_unstemmed Feature selection for demand forecasting incorporating external covariates
title_sort feature selection for demand forecasting incorporating external covariates
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/153745
_version_ 1718928724552843264