Time series clustering and characterization

Among the vast volumes of data generated daily in our modern digital world, time series data represents a major category with broad applicability. Time series data refers to a collection of data points indexed by time, and it has major practical uses in the fields of finance, business, environment,...

Full description

Saved in:
Bibliographic Details
Main Author: Lie, Rhys
Other Authors: Michele Nguyen
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175124
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-175124
record_format dspace
spelling sg-ntu-dr.10356-1751242024-04-26T15:40:45Z Time series clustering and characterization Lie, Rhys Michele Nguyen School of Computer Science and Engineering michele.nguyen@ntu.edu.sg Computer and Information Science Time series analysis Clustering Regime analysis Financial trading Among the vast volumes of data generated daily in our modern digital world, time series data represents a major category with broad applicability. Time series data refers to a collection of data points indexed by time, and it has major practical uses in the fields of finance, business, environment, healthcare, and more. There are multiple types of time series analysis, the most common of which are clustering and forecasting. While forecasting predicts future trends based on historical data, clustering plays a pivotal role in pre-processing, grouping time series into homogenous clusters based on their temporal trends and underlying characteristics. This unsupervised learning task provides valuable insights for pattern discovery, anomaly detection, and data organization. Without accurate cluster labels in most real-world data, we need to rely on tuning the parameters that affect time series clustering, primarily the clustering algorithm and distance metric, as well as suitable clustering metrics to evaluate our results across methods for different contexts. In this paper, I carefully evaluate the shape-based, feature-based, and model-based clustering algorithms, focusing on the most widely used approaches. The algorithms will be tested on a well-known synthetic time series dataset and the Standard & Poor 500 (SP500) financial time series for real-world data. I have chosen to explore the clustering of financial time series into market regimes, recognizing it as one of the lesser-known yet compelling applications of clustering relevant to our daily lives. This analysis proves invaluable in identifying recurring economic regimes and aiding retail investors in making informed investment decisions. Even in large portfolio and wealth management funds such as StashAway, Syfe, and Endowus, regime analysis plays a crucial role in guiding key investment decisions and portfolio rebalancing as market conditions change. The applicability of regime analysis also extends to diverse industries such as environmental science, engineering, healthcare, politics, and the social sciences. Bachelor's degree 2024-04-22T02:33:05Z 2024-04-22T02:33:05Z 2024 Final Year Project (FYP) Lie, R. (2024). Time series clustering and characterization. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175124 https://hdl.handle.net/10356/175124 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Time series analysis
Clustering
Regime analysis
Financial trading
spellingShingle Computer and Information Science
Time series analysis
Clustering
Regime analysis
Financial trading
Lie, Rhys
Time series clustering and characterization
description Among the vast volumes of data generated daily in our modern digital world, time series data represents a major category with broad applicability. Time series data refers to a collection of data points indexed by time, and it has major practical uses in the fields of finance, business, environment, healthcare, and more. There are multiple types of time series analysis, the most common of which are clustering and forecasting. While forecasting predicts future trends based on historical data, clustering plays a pivotal role in pre-processing, grouping time series into homogenous clusters based on their temporal trends and underlying characteristics. This unsupervised learning task provides valuable insights for pattern discovery, anomaly detection, and data organization. Without accurate cluster labels in most real-world data, we need to rely on tuning the parameters that affect time series clustering, primarily the clustering algorithm and distance metric, as well as suitable clustering metrics to evaluate our results across methods for different contexts. In this paper, I carefully evaluate the shape-based, feature-based, and model-based clustering algorithms, focusing on the most widely used approaches. The algorithms will be tested on a well-known synthetic time series dataset and the Standard & Poor 500 (SP500) financial time series for real-world data. I have chosen to explore the clustering of financial time series into market regimes, recognizing it as one of the lesser-known yet compelling applications of clustering relevant to our daily lives. This analysis proves invaluable in identifying recurring economic regimes and aiding retail investors in making informed investment decisions. Even in large portfolio and wealth management funds such as StashAway, Syfe, and Endowus, regime analysis plays a crucial role in guiding key investment decisions and portfolio rebalancing as market conditions change. The applicability of regime analysis also extends to diverse industries such as environmental science, engineering, healthcare, politics, and the social sciences.
author2 Michele Nguyen
author_facet Michele Nguyen
Lie, Rhys
format Final Year Project
author Lie, Rhys
author_sort Lie, Rhys
title Time series clustering and characterization
title_short Time series clustering and characterization
title_full Time series clustering and characterization
title_fullStr Time series clustering and characterization
title_full_unstemmed Time series clustering and characterization
title_sort time series clustering and characterization
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/175124
_version_ 1814047088424517632