Feature selection methods for financial engineering

This paper explores the application of feature selection methods for financial engineering, and in particular the prediction of stock price movements. In the literature of feature selection methods, wrapper methods are found to be more accurate in generating the optimal subset. This is because the i...

Full description

Saved in:
Bibliographic Details
Main Author: Sridhar, Ashwin
Other Authors: Chen Lihui
Format: Final Year Project
Language:English
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10356/64713
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:This paper explores the application of feature selection methods for financial engineering, and in particular the prediction of stock price movements. In the literature of feature selection methods, wrapper methods are found to be more accurate in generating the optimal subset. This is because the induction algorithm is used as a black box in the feature selection process. In this paper, a linear forward selection method is proposed as a subset search method to reduce the number of iterations and thus increase the efficiency of the wrapper. Stock data from 7 blue chip multi-national companies that are part of the Dow Jones Industrial Average (DJIA) from different sectors is used. The data used consists of over 5 years of data and 65 features. The class labels are generated based on the percentage change in closing price the following day. A wrapper method with linear feature selection is used to select the optimal feature set. Subsequently, 10-fold cross validation is performed to analyze the performance. We expect the wrapper method to perform traditional filter methods. To add credibility to this hypothesis, consistency based feature selection, correlation feature selection, fast correlation based feature selection are also performed on the data sets. A comparative analysis is drawn between the methods and the wrapper method using linear forward selection is found to generally out-perform the other methods.