Investigating spatial and non-spatial cross-validation techniques

Accurately evaluating predictive models, especially when built for spatial data, remains a challenge due to the limitations of traditional cross-validation techniques. While spatial cross-validation techniques have been developed to address those challenges, there is still a lack of clear guidance o...

Full description

Saved in:
Bibliographic Details
Main Author: Kan, Kai Huai Feng
Other Authors: Michele Nguyen
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181133
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181133
record_format dspace
spelling sg-ntu-dr.10356-1811332024-11-15T12:26:01Z Investigating spatial and non-spatial cross-validation techniques Kan, Kai Huai Feng Michele Nguyen College of Computing and Data Science michele.nguyen@ntu.edu.sg Computer and Information Science Cross-validation techniques Accurately evaluating predictive models, especially when built for spatial data, remains a challenge due to the limitations of traditional cross-validation techniques. While spatial cross-validation techniques have been developed to address those challenges, there is still a lack of clear guidance on when a spatial or non-spatial technique would yield the most precise and dependable assessment of a model's performance. This study investigates the effectiveness of the cross-validation method, spatial or non-spatial, in yielding accurate estimates by comparing three spatial techniques (Spatial K-Fold, Blocked, Buffered) and three non-spatial techniques (Random K-Fold, Bootstrap, Importance-Weighted). Using simulated landscapes, model performance is assessed across a range of metrics including Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Coefficient of Determination (R-Squared), and Bias. The findings emphasize the importance of considering spatial autocorrelation and covariate shifts, offering practical guidance on the selection of cross-validation techniques in spatial modelling contexts. Bachelor's degree 2024-11-15T12:26:01Z 2024-11-15T12:26:01Z 2024 Final Year Project (FYP) Kan, K. H. F. (2024). Investigating spatial and non-spatial cross-validation techniques. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181133 https://hdl.handle.net/10356/181133 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Cross-validation techniques
spellingShingle Computer and Information Science
Cross-validation techniques
Kan, Kai Huai Feng
Investigating spatial and non-spatial cross-validation techniques
description Accurately evaluating predictive models, especially when built for spatial data, remains a challenge due to the limitations of traditional cross-validation techniques. While spatial cross-validation techniques have been developed to address those challenges, there is still a lack of clear guidance on when a spatial or non-spatial technique would yield the most precise and dependable assessment of a model's performance. This study investigates the effectiveness of the cross-validation method, spatial or non-spatial, in yielding accurate estimates by comparing three spatial techniques (Spatial K-Fold, Blocked, Buffered) and three non-spatial techniques (Random K-Fold, Bootstrap, Importance-Weighted). Using simulated landscapes, model performance is assessed across a range of metrics including Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Coefficient of Determination (R-Squared), and Bias. The findings emphasize the importance of considering spatial autocorrelation and covariate shifts, offering practical guidance on the selection of cross-validation techniques in spatial modelling contexts.
author2 Michele Nguyen
author_facet Michele Nguyen
Kan, Kai Huai Feng
format Final Year Project
author Kan, Kai Huai Feng
author_sort Kan, Kai Huai Feng
title Investigating spatial and non-spatial cross-validation techniques
title_short Investigating spatial and non-spatial cross-validation techniques
title_full Investigating spatial and non-spatial cross-validation techniques
title_fullStr Investigating spatial and non-spatial cross-validation techniques
title_full_unstemmed Investigating spatial and non-spatial cross-validation techniques
title_sort investigating spatial and non-spatial cross-validation techniques
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/181133
_version_ 1816859020234850304