Stock selection using machine learning

The final year project involves an empirical investigation of the predictability of stock returns and feasibility of building a stock selection application using machine learning. The purpose of the project is to build a successful application of machine learning on S&P 500 data. The proposed...

Full description

Saved in:
Bibliographic Details
Main Author: Shen, Nan.
Other Authors: Vivekanand Gopalkrishnan
Format: Final Year Project
Language:English
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10356/17042
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-17042
record_format dspace
spelling sg-ntu-dr.10356-170422023-03-03T20:24:13Z Stock selection using machine learning Shen, Nan. Vivekanand Gopalkrishnan School of Computer Engineering Centre for Advanced Information Systems DRNTU::Engineering::Computer science and engineering::Computer applications::Computers in other systems DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence The final year project involves an empirical investigation of the predictability of stock returns and feasibility of building a stock selection application using machine learning. The purpose of the project is to build a successful application of machine learning on S&P 500 data. The proposed application is capable to predict future stock returns and make wise decisions to select stocks and construct portfolios. Econometric analysis and empirical financial modeling involving machine learning methods are widely studied and adopted both in literature and practice. However, prediction models employing artificial intelligence techniques such as neural networks and decision trees are used arbitrarily or implemented in ad-hoc scenarios, such as feature selection, data preprocessing, portfolio construction methods, and evaluation criteria etc. There has never been a proper and comprehensive survey particularly on stock selection applications. Also, economic indicators, which proved to be an important factor in predicting stock returns by economist, have rarely been used in machine learning applications. In this project, a comprehensive and detailed survey on machine learning applications on stock selection is presented. A framework is proposed to take considerations of all possible choices and parameters in relevant practical issues. Thereafter, a model-tree based stock selection application using technical, fundamental and economic indicators is built with a Greedy Search procedure in parameter selection. We show its validity and superiority by comparing against both S&P500 benchmark and several previous machine learning applications. And finally we apply our model for a case study on the financial crisis periods and demonstrate its capability to provide economically significant profits for investors. Bachelor of Engineering (Computer Science) 2009-05-29T04:23:28Z 2009-05-29T04:23:28Z 2009 2009 Final Year Project (FYP) http://hdl.handle.net/10356/17042 en Nanyang Technological University 70 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computer applications::Computers in other systems
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle DRNTU::Engineering::Computer science and engineering::Computer applications::Computers in other systems
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Shen, Nan.
Stock selection using machine learning
description The final year project involves an empirical investigation of the predictability of stock returns and feasibility of building a stock selection application using machine learning. The purpose of the project is to build a successful application of machine learning on S&P 500 data. The proposed application is capable to predict future stock returns and make wise decisions to select stocks and construct portfolios. Econometric analysis and empirical financial modeling involving machine learning methods are widely studied and adopted both in literature and practice. However, prediction models employing artificial intelligence techniques such as neural networks and decision trees are used arbitrarily or implemented in ad-hoc scenarios, such as feature selection, data preprocessing, portfolio construction methods, and evaluation criteria etc. There has never been a proper and comprehensive survey particularly on stock selection applications. Also, economic indicators, which proved to be an important factor in predicting stock returns by economist, have rarely been used in machine learning applications. In this project, a comprehensive and detailed survey on machine learning applications on stock selection is presented. A framework is proposed to take considerations of all possible choices and parameters in relevant practical issues. Thereafter, a model-tree based stock selection application using technical, fundamental and economic indicators is built with a Greedy Search procedure in parameter selection. We show its validity and superiority by comparing against both S&P500 benchmark and several previous machine learning applications. And finally we apply our model for a case study on the financial crisis periods and demonstrate its capability to provide economically significant profits for investors.
author2 Vivekanand Gopalkrishnan
author_facet Vivekanand Gopalkrishnan
Shen, Nan.
format Final Year Project
author Shen, Nan.
author_sort Shen, Nan.
title Stock selection using machine learning
title_short Stock selection using machine learning
title_full Stock selection using machine learning
title_fullStr Stock selection using machine learning
title_full_unstemmed Stock selection using machine learning
title_sort stock selection using machine learning
publishDate 2009
url http://hdl.handle.net/10356/17042
_version_ 1759856102122455040