FINANCIAL NEWS SENTIMENT ANALYSIS USING MODIFIED VADER FOR STOCK PRICE PREDICTION

There are many factors to influence stock price movements, and one of those factors is opinions about the company. This paper builds a model to analyze sentiments in news about a company, then uses the sentiment scores as a feature together with social media sentiments and historica...

Full description

Saved in:
Bibliographic Details
Main Author: Prameswari Ekaputri, Anindya
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/66312
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:66312
spelling id-itb.:663122022-06-27T22:39:29ZFINANCIAL NEWS SENTIMENT ANALYSIS USING MODIFIED VADER FOR STOCK PRICE PREDICTION Prameswari Ekaputri, Anindya Indonesia Final Project sentiment analysis, stock price prediction, machine learning INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/66312 There are many factors to influence stock price movements, and one of those factors is opinions about the company. This paper builds a model to analyze sentiments in news about a company, then uses the sentiment scores as a feature together with social media sentiments and historical stock price to predict stock prices in the future. Analysis and prediction were carried out on four Indonesian banks: BBNI, BBRI, BMRI, and BBCA. Sentiment analysis model were built by modifying VADER, a rule-based model that uses lexicons, then experiments were conducted by comparing performances between general lexicon, financial lexicon, and combined general-financial lexicon. The best result from the experiment is averaged per day to transform the data into time-series. It is then used together with social media sentiments and historical stock price to build a price prediction model. Experiments for prediction model were conducted in two steps: the first step aims to determine the range of days that needs to be considered toget optimal prediction results; the second step aims to determine the best model architecture and feature combinations. In the first step, several ranges of days were experimented; the range that produced the best results were reused in the second step. In the second step, performances of random forest, LSTM, and CNN models were compared by varying the number of layers and units/filters along with varying features (with or without sentiment). Experiment results shows that sentiment analysis model with financial lexicon performs best with 73% accuracy, while prediction models perform best when trained without sentiment features. In general, CNN models give better results with MAPE values ranging from 1.5% until 2.8% text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description There are many factors to influence stock price movements, and one of those factors is opinions about the company. This paper builds a model to analyze sentiments in news about a company, then uses the sentiment scores as a feature together with social media sentiments and historical stock price to predict stock prices in the future. Analysis and prediction were carried out on four Indonesian banks: BBNI, BBRI, BMRI, and BBCA. Sentiment analysis model were built by modifying VADER, a rule-based model that uses lexicons, then experiments were conducted by comparing performances between general lexicon, financial lexicon, and combined general-financial lexicon. The best result from the experiment is averaged per day to transform the data into time-series. It is then used together with social media sentiments and historical stock price to build a price prediction model. Experiments for prediction model were conducted in two steps: the first step aims to determine the range of days that needs to be considered toget optimal prediction results; the second step aims to determine the best model architecture and feature combinations. In the first step, several ranges of days were experimented; the range that produced the best results were reused in the second step. In the second step, performances of random forest, LSTM, and CNN models were compared by varying the number of layers and units/filters along with varying features (with or without sentiment). Experiment results shows that sentiment analysis model with financial lexicon performs best with 73% accuracy, while prediction models perform best when trained without sentiment features. In general, CNN models give better results with MAPE values ranging from 1.5% until 2.8%
format Final Project
author Prameswari Ekaputri, Anindya
spellingShingle Prameswari Ekaputri, Anindya
FINANCIAL NEWS SENTIMENT ANALYSIS USING MODIFIED VADER FOR STOCK PRICE PREDICTION
author_facet Prameswari Ekaputri, Anindya
author_sort Prameswari Ekaputri, Anindya
title FINANCIAL NEWS SENTIMENT ANALYSIS USING MODIFIED VADER FOR STOCK PRICE PREDICTION
title_short FINANCIAL NEWS SENTIMENT ANALYSIS USING MODIFIED VADER FOR STOCK PRICE PREDICTION
title_full FINANCIAL NEWS SENTIMENT ANALYSIS USING MODIFIED VADER FOR STOCK PRICE PREDICTION
title_fullStr FINANCIAL NEWS SENTIMENT ANALYSIS USING MODIFIED VADER FOR STOCK PRICE PREDICTION
title_full_unstemmed FINANCIAL NEWS SENTIMENT ANALYSIS USING MODIFIED VADER FOR STOCK PRICE PREDICTION
title_sort financial news sentiment analysis using modified vader for stock price prediction
url https://digilib.itb.ac.id/gdl/view/66312
_version_ 1822933004727615488