Pairs trading strategy with unsupervised clustering methods
Machine learning has been gaining momentum and has been applied in various fields including finance in recent years. Most financial application of machine learning are used for predictive tasks, such as predicting returns or risk, which can be easily converted into supervised learning or reinforceme...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/137833 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-137833 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1378332023-03-04T19:40:12Z Pairs trading strategy with unsupervised clustering methods Toh, Alenson Jun Wei Heng Kok Hui, John Gerard School of Mechanical and Aerospace Engineering Durham University mkhheng@ntu.edu.sg Engineering::Aeronautical engineering Machine learning has been gaining momentum and has been applied in various fields including finance in recent years. Most financial application of machine learning are used for predictive tasks, such as predicting returns or risk, which can be easily converted into supervised learning or reinforcement learning problems. This paper proposes a framework to construct a novel clustering-based pairs trading strategy which is the first attempt of applying unsupervised learning method in finance literature. Three clustering methods namely K-means clustering, Density-based Spatial Clustering of Applications with Noise (DBSCAN) and Agglomerative clustering on pairs trading are explored on the US stock market. Comparing the performance of equally-weighted long-short portfolios from the 3 clustering methods, DBSCAN outperforms the other 2 significantly where it attains an annualized Sharpe ratio of 2.141 and an annualised mean return of 26.5% prior to transaction cost during January 2016 to December 2019. It is then proposed in this paper to define "pairs" using a new perspective, that is to find pairs in terms of the data density in a high-dimensional data structure. An industry breakdown of stocks chosen and traded by DBSCAN is also conducted to unveil the sources of profitability. It is discovered that most of the stocks traded by the clustering strategies are in the financial industry. This shows that financial institutions are very similar to one another in terms of financial performance and should give similar stock returns in an efficient market. DBSCAN strategy has also outperformed existing pairs trading strategy such as cointegration, distance, time series and supervised learning approach significantly. Bachelor of Engineering (Aerospace Engineering) 2020-04-16T01:34:39Z 2020-04-16T01:34:39Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/137833 en C095 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Aeronautical engineering |
spellingShingle |
Engineering::Aeronautical engineering Toh, Alenson Jun Wei Pairs trading strategy with unsupervised clustering methods |
description |
Machine learning has been gaining momentum and has been applied in various fields including finance in recent years. Most financial application of machine learning are used for predictive tasks, such as predicting returns or risk, which can be easily converted into supervised learning or reinforcement learning problems. This paper proposes a framework to construct a novel clustering-based pairs trading strategy which is the first attempt of applying unsupervised learning method in finance literature. Three clustering methods namely K-means clustering, Density-based Spatial Clustering of Applications with Noise (DBSCAN) and Agglomerative clustering on pairs trading are explored on the US stock market. Comparing the performance of equally-weighted long-short portfolios from the 3 clustering methods, DBSCAN outperforms the other 2 significantly where it attains an annualized Sharpe ratio of 2.141 and an annualised mean return of 26.5% prior to transaction cost during January 2016 to December 2019. It is then proposed in this paper to define "pairs" using a new perspective, that is to find pairs in terms of the data density in a high-dimensional data structure. An industry breakdown of stocks chosen and traded by DBSCAN is also conducted to unveil the sources of profitability. It is discovered that most of the stocks traded by the clustering strategies are in the financial industry. This shows that financial institutions are very similar to one another in terms of financial performance and should give similar stock returns in an efficient market. DBSCAN strategy has also outperformed existing pairs trading strategy such as cointegration, distance, time series and supervised learning approach significantly. |
author2 |
Heng Kok Hui, John Gerard |
author_facet |
Heng Kok Hui, John Gerard Toh, Alenson Jun Wei |
format |
Final Year Project |
author |
Toh, Alenson Jun Wei |
author_sort |
Toh, Alenson Jun Wei |
title |
Pairs trading strategy with unsupervised clustering methods |
title_short |
Pairs trading strategy with unsupervised clustering methods |
title_full |
Pairs trading strategy with unsupervised clustering methods |
title_fullStr |
Pairs trading strategy with unsupervised clustering methods |
title_full_unstemmed |
Pairs trading strategy with unsupervised clustering methods |
title_sort |
pairs trading strategy with unsupervised clustering methods |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/137833 |
_version_ |
1759857177495863296 |