Time series & sentiment analysis of top cryptocurrencies
Bitcoin, the world’s first cryptocurrency, was created in 2009 by Satoshi Nakamoto. Frustrated by the need of a central financial institute to oversee online payments made from one party to another, Bitcoin would use a blockchain framework that would be “powered by users with no central authority or...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/148200 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-148200 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1482002021-04-27T06:56:47Z Time series & sentiment analysis of top cryptocurrencies Yeoh, Chester Fu Soon Anwitaman Datta School of Computer Science and Engineering Anwitaman@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Bitcoin, the world’s first cryptocurrency, was created in 2009 by Satoshi Nakamoto. Frustrated by the need of a central financial institute to oversee online payments made from one party to another, Bitcoin would use a blockchain framework that would be “powered by users with no central authority or middlemen” .Fast forward close to a decade later, Bitcoin has seen a meteoric rise in its popularity, exceeding 300,000 transactions per day (Appendix A). Overall, it has also blossomed into a booming market worth over USD 1 Tn (as of 9 March 2021) and ushered in a new era of cryptocurrencies. The benefits of cryptocurrencies such as accessibility and user autonomy (i.e., lack of an intermediary authority) (Reiff, 2021) are well documented. However, most cryptocurrencies suffer from a key drawback - high price fluctuations. Bitcoin, for example, increased its value by 1900% in 2017 before plunging below USD 8,000 per coin up from its high of USD 19,000 in the 2018 Jan/Feb crash. Since cryptocurrency price research is still a relatively unexplored area and equilibrium market prices are poorly understood (Conrad, 2018), this project seeks to conduct a sentiment analysis to explore possible drivers such as social media and news reports that leads to these price fluctuations. Additionally, other non-sentiment factors such as Public Interest, Community Data and Developer Data will also be analysed for their respective price impacts. Under the sentiment analysis portion of this FYP, Reddit and Twitter social media content & articles from 30,000 news sources and blogs will be mined and analysed using a Valence Aware Dictionary for Sentiment Reasoning (VADER) model to facilitate a correlation analysis against prices of top cryptocurrencies such as Bitcoin (BTC) and Ethereum (ETH). Non-sentiment factors are also extracted as inputs for building a Linear Regression model as well as a Random Forest Classifier. The FYP results indicate that there are weak correlations (|r|< 0.1) with news and social media sentiments across both cryptocurrencies. On the other hand, non-sentiment factors were strongly correlated with price across the board (|r|>0.7) and specific factors such as Number of Twitter Followers for a cryptocurrency’s Twitter account (i.e. @Bitcoin) and Number of Reddit Accounts following a cryptocurrency’s subreddit (i.e. r/Bitcoin) appeared to be strong price drivers under the high-accuracy regression and classification models. Bachelor of Engineering (Computer Science) 2021-04-27T06:54:15Z 2021-04-27T06:54:15Z 2021 Final Year Project (FYP) Yeoh, C. F. S. (2021). Time series & sentiment analysis of top cryptocurrencies. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148200 https://hdl.handle.net/10356/148200 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Yeoh, Chester Fu Soon Time series & sentiment analysis of top cryptocurrencies |
description |
Bitcoin, the world’s first cryptocurrency, was created in 2009 by Satoshi Nakamoto. Frustrated by the need of a central financial institute to oversee online payments made from one party to another, Bitcoin would use a blockchain framework that would be “powered by users with no central authority or middlemen” .Fast forward close to a decade later, Bitcoin has seen a meteoric rise in its popularity, exceeding 300,000 transactions per day (Appendix A). Overall, it has also blossomed into a booming market worth over USD 1 Tn (as of 9 March 2021) and ushered in a new era of cryptocurrencies.
The benefits of cryptocurrencies such as accessibility and user autonomy (i.e., lack of an intermediary authority) (Reiff, 2021) are well documented. However, most cryptocurrencies suffer from a key drawback - high price fluctuations. Bitcoin, for example, increased its value by 1900% in 2017 before plunging below USD 8,000 per coin up from its high of USD 19,000 in the 2018 Jan/Feb crash. Since cryptocurrency price research is still a relatively unexplored area and equilibrium market prices are poorly understood (Conrad, 2018), this project seeks to conduct a sentiment analysis to explore possible drivers such as social media and news reports that leads to these price fluctuations. Additionally, other non-sentiment factors such as Public Interest, Community Data and Developer Data will also be analysed for their respective price impacts.
Under the sentiment analysis portion of this FYP, Reddit and Twitter social media content & articles from 30,000 news sources and blogs will be mined and analysed using a Valence Aware Dictionary for Sentiment Reasoning (VADER) model to facilitate a correlation analysis against prices of top cryptocurrencies such as Bitcoin (BTC) and Ethereum (ETH). Non-sentiment factors are also extracted as inputs for building a Linear Regression model as well as a Random Forest Classifier.
The FYP results indicate that there are weak correlations (|r|< 0.1) with news and social media sentiments across both cryptocurrencies. On the other hand, non-sentiment factors were strongly correlated with price across the board (|r|>0.7) and specific factors such as Number of Twitter Followers for a cryptocurrency’s Twitter account (i.e. @Bitcoin) and Number of Reddit Accounts following a cryptocurrency’s subreddit (i.e. r/Bitcoin) appeared to be strong price drivers under the high-accuracy regression and classification models. |
author2 |
Anwitaman Datta |
author_facet |
Anwitaman Datta Yeoh, Chester Fu Soon |
format |
Final Year Project |
author |
Yeoh, Chester Fu Soon |
author_sort |
Yeoh, Chester Fu Soon |
title |
Time series & sentiment analysis of top cryptocurrencies |
title_short |
Time series & sentiment analysis of top cryptocurrencies |
title_full |
Time series & sentiment analysis of top cryptocurrencies |
title_fullStr |
Time series & sentiment analysis of top cryptocurrencies |
title_full_unstemmed |
Time series & sentiment analysis of top cryptocurrencies |
title_sort |
time series & sentiment analysis of top cryptocurrencies |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/148200 |
_version_ |
1698713691931803648 |