Incremental Q-learning Partially Observable Markov Decision Process intraday trading system

Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected retu...

Full description

Saved in:

Bibliographic Details
Main Author:	Goh, Choon Tat.
Other Authors:	Quek Hiok Chai
Format:	Final Year Project
Language:	English
Published:	2010
Subjects:	DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Online Access:	http://hdl.handle.net/10356/40070
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-40070
record_format	dspace
spelling	sg-ntu-dr.10356-400702023-03-03T20:36:05Z Incremental Q-learning Partially Observable Markov Decision Process intraday trading system Goh, Choon Tat. Quek Hiok Chai School of Computer Engineering Centre for Computational Intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected returns. An autoregressive MLP-ARMA hybrid model combines both Autoregressive Model (AR) and Moving Average Model (MA) predictions. MLP produces a stationary time series prediction for Autoregressive Moving Average Model (ARMA) to work on. AR model predict the general trend of the intraday price swing. MA uses volatility to predict the intraday price fluctuation. An ensemble output is formed to further improve the opening intraday price swing prediction so that an opening trade can be determined. Traders often seek advices from a collective analysis of different risk appetites to arrive at a informed signal. The Partially Observed Markov Decision Process (POMDP) is able to generate a informed signal similar to that of a collective analysis of different risk appetite. Three type of traders with different risk appetites are being modeled with POMDP. The state of stock market identified with Relative Strength Index (RSI) and Exponential Moving Average (EMA) determined the appropriate trading policies in the reinforcement learning policies. The selected trading policies directly reinforces the trading signals generated by the autoregressive model. Such an approach relies on the learnt trading strategies rather than the predictive power of a model to generate profitable trading signals. Bachelor of Engineering (Computer Science) 2010-06-10T02:15:37Z 2010-06-10T02:15:37Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/40070 en Nanyang Technological University 107 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Goh, Choon Tat. Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
description	Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected returns. An autoregressive MLP-ARMA hybrid model combines both Autoregressive Model (AR) and Moving Average Model (MA) predictions. MLP produces a stationary time series prediction for Autoregressive Moving Average Model (ARMA) to work on. AR model predict the general trend of the intraday price swing. MA uses volatility to predict the intraday price fluctuation. An ensemble output is formed to further improve the opening intraday price swing prediction so that an opening trade can be determined. Traders often seek advices from a collective analysis of different risk appetites to arrive at a informed signal. The Partially Observed Markov Decision Process (POMDP) is able to generate a informed signal similar to that of a collective analysis of different risk appetite. Three type of traders with different risk appetites are being modeled with POMDP. The state of stock market identified with Relative Strength Index (RSI) and Exponential Moving Average (EMA) determined the appropriate trading policies in the reinforcement learning policies. The selected trading policies directly reinforces the trading signals generated by the autoregressive model. Such an approach relies on the learnt trading strategies rather than the predictive power of a model to generate profitable trading signals.
author2	Quek Hiok Chai
author_facet	Quek Hiok Chai Goh, Choon Tat.
format	Final Year Project
author	Goh, Choon Tat.
author_sort	Goh, Choon Tat.
title	Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_short	Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_full	Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_fullStr	Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_full_unstemmed	Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
title_sort	incremental q-learning partially observable markov decision process intraday trading system
publishDate	2010
url	http://hdl.handle.net/10356/40070
_version_	1759854029667565568

Incremental Q-learning Partially Observable Markov Decision Process intraday trading system

Similar Items