Incremental Q-learning Partially Observable Markov Decision Process intraday trading system

Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected retu...

Full description

Saved in:
Bibliographic Details
Main Author: Goh, Choon Tat.
Other Authors: Quek Hiok Chai
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/40070
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Neural network often performed only technical analysis in financial forecasting. Modern traders perform both fundamental and technical analysis to determine their next move. Volatility is a crucial factor considered by traders in deciding what trading signals to perform and determining expected returns. An autoregressive MLP-ARMA hybrid model combines both Autoregressive Model (AR) and Moving Average Model (MA) predictions. MLP produces a stationary time series prediction for Autoregressive Moving Average Model (ARMA) to work on. AR model predict the general trend of the intraday price swing. MA uses volatility to predict the intraday price fluctuation. An ensemble output is formed to further improve the opening intraday price swing prediction so that an opening trade can be determined. Traders often seek advices from a collective analysis of different risk appetites to arrive at a informed signal. The Partially Observed Markov Decision Process (POMDP) is able to generate a informed signal similar to that of a collective analysis of different risk appetite. Three type of traders with different risk appetites are being modeled with POMDP. The state of stock market identified with Relative Strength Index (RSI) and Exponential Moving Average (EMA) determined the appropriate trading policies in the reinforcement learning policies. The selected trading policies directly reinforces the trading signals generated by the autoregressive model. Such an approach relies on the learnt trading strategies rather than the predictive power of a model to generate profitable trading signals.