Incremental Q-learning Partially Observable Markov Decision Process intraday trading system

Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected retu...

Full description

Saved in:
Bibliographic Details
Main Author: Goh, Choon Tat.
Other Authors: Quek Hiok Chai
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/40070
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-40070
record_format dspace
spelling sg-ntu-dr.10356-400702023-03-03T20:36:05Z Incremental Q-learning Partially Observable Markov Decision Process intraday trading system Goh, Choon Tat. Quek Hiok Chai School of Computer Engineering Centre for Computational Intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected returns. An autoregressive MLP-ARMA hybrid model combines both Autoregressive Model (AR) and Moving Average Model (MA) predictions. MLP produces a stationary time series prediction for Autoregressive Moving Average Model (ARMA) to work on. AR model predict the general trend of the intraday price swing. MA uses volatility to predict the intraday price fluctuation. An ensemble output is formed to further improve the opening intraday price swing prediction so that an opening trade can be determined. Traders often seek advices from a collective analysis of different risk appetites to arrive at a informed signal. The Partially Observed Markov Decision Process (POMDP) is able to generate a informed signal similar to that of a collective analysis of different risk appetite. Three type of traders with different risk appetites are being modeled with POMDP. The state of stock market identified with Relative Strength Index (RSI) and Exponential Moving Average (EMA) determined the appropriate trading policies in the reinforcement learning policies. The selected trading policies directly reinforces the trading signals generated by the autoregressive model. Such an approach relies on the learnt trading strategies rather than the predictive power of a model to generate profitable trading signals. Bachelor of Engineering (Computer Science) 2010-06-10T02:15:37Z 2010-06-10T02:15:37Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/40070 en Nanyang Technological University 107 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Goh, Choon Tat.
Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
description Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected returns. An autoregressive MLP-ARMA hybrid model combines both Autoregressive Model (AR) and Moving Average Model (MA) predictions. MLP produces a stationary time series prediction for Autoregressive Moving Average Model (ARMA) to work on. AR model predict the general trend of the intraday price swing. MA uses volatility to predict the intraday price fluctuation. An ensemble output is formed to further improve the opening intraday price swing prediction so that an opening trade can be determined. Traders often seek advices from a collective analysis of different risk appetites to arrive at a informed signal. The Partially Observed Markov Decision Process (POMDP) is able to generate a informed signal similar to that of a collective analysis of different risk appetite. Three type of traders with different risk appetites are being modeled with POMDP. The state of stock market identified with Relative Strength Index (RSI) and Exponential Moving Average (EMA) determined the appropriate trading policies in the reinforcement learning policies. The selected trading policies directly reinforces the trading signals generated by the autoregressive model. Such an approach relies on the learnt trading strategies rather than the predictive power of a model to generate profitable trading signals.
author2 Quek Hiok Chai
author_facet Quek Hiok Chai
Goh, Choon Tat.
format Final Year Project
author Goh, Choon Tat.
author_sort Goh, Choon Tat.
title Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_short Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_full Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_fullStr Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_full_unstemmed Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_sort incremental q-learning partially observable markov decision process intraday trading system
publishDate 2010
url http://hdl.handle.net/10356/40070
_version_ 1759854029667565568