Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected retu...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/40070 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-40070 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-400702023-03-03T20:36:05Z Incremental Q-learning Partially Observable Markov Decision Process intraday trading system Goh, Choon Tat. Quek Hiok Chai School of Computer Engineering Centre for Computational Intelligence DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected returns. An autoregressive MLP-ARMA hybrid model combines both Autoregressive Model (AR) and Moving Average Model (MA) predictions. MLP produces a stationary time series prediction for Autoregressive Moving Average Model (ARMA) to work on. AR model predict the general trend of the intraday price swing. MA uses volatility to predict the intraday price fluctuation. An ensemble output is formed to further improve the opening intraday price swing prediction so that an opening trade can be determined. Traders often seek advices from a collective analysis of different risk appetites to arrive at a informed signal. The Partially Observed Markov Decision Process (POMDP) is able to generate a informed signal similar to that of a collective analysis of different risk appetite. Three type of traders with different risk appetites are being modeled with POMDP. The state of stock market identified with Relative Strength Index (RSI) and Exponential Moving Average (EMA) determined the appropriate trading policies in the reinforcement learning policies. The selected trading policies directly reinforces the trading signals generated by the autoregressive model. Such an approach relies on the learnt trading strategies rather than the predictive power of a model to generate profitable trading signals. Bachelor of Engineering (Computer Science) 2010-06-10T02:15:37Z 2010-06-10T02:15:37Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/40070 en Nanyang Technological University 107 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Goh, Choon Tat. Incremental Q-learning Partially Observable Markov Decision Process intraday trading system |
description |
Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected returns. An autoregressive MLP-ARMA hybrid model combines both Autoregressive Model (AR) and Moving Average Model (MA) predictions. MLP produces a stationary time series prediction for Autoregressive Moving Average Model (ARMA) to work on. AR model predict the general trend of the intraday price swing. MA uses volatility to predict the intraday price fluctuation. An ensemble output is formed to further improve the opening intraday price swing prediction so that an opening trade can be determined.
Traders often seek advices from a collective analysis of different risk appetites to arrive at a informed signal. The Partially Observed Markov Decision Process (POMDP) is able to generate a informed signal similar to that of a collective analysis of different risk appetite. Three type of traders with different risk appetites are being modeled with POMDP. The state of stock market identified with Relative Strength Index (RSI) and Exponential Moving Average (EMA) determined the appropriate trading policies in the reinforcement learning policies. The selected trading policies directly reinforces the trading signals generated by the autoregressive model. Such an approach relies on the learnt trading strategies rather than the predictive power of a model to generate profitable trading signals. |
author2 |
Quek Hiok Chai |
author_facet |
Quek Hiok Chai Goh, Choon Tat. |
format |
Final Year Project |
author |
Goh, Choon Tat. |
author_sort |
Goh, Choon Tat. |
title |
Incremental Q-learning Partially Observable Markov Decision Process intraday trading system |
title_short |
Incremental Q-learning Partially Observable Markov Decision Process intraday trading system |
title_full |
Incremental Q-learning Partially Observable Markov Decision Process intraday trading system |
title_fullStr |
Incremental Q-learning Partially Observable Markov Decision Process intraday trading system |
title_full_unstemmed |
Incremental Q-learning Partially Observable Markov Decision Process intraday trading system |
title_sort |
incremental q-learning partially observable markov decision process intraday trading system |
publishDate |
2010 |
url |
http://hdl.handle.net/10356/40070 |
_version_ |
1759854029667565568 |