Time series & sentiment analysis of top cryptocurrencies

Bitcoin, the world’s first cryptocurrency, was created in 2009 by Satoshi Nakamoto. Frustrated by the need of a central financial institute to oversee online payments made from one party to another, Bitcoin would use a blockchain framework that would be “powered by users with no central authority or...

Full description

Saved in:
Bibliographic Details
Main Author: Yeoh, Chester Fu Soon
Other Authors: Anwitaman Datta
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/148200
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-148200
record_format dspace
spelling sg-ntu-dr.10356-1482002021-04-27T06:56:47Z Time series & sentiment analysis of top cryptocurrencies Yeoh, Chester Fu Soon Anwitaman Datta School of Computer Science and Engineering Anwitaman@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Bitcoin, the world’s first cryptocurrency, was created in 2009 by Satoshi Nakamoto. Frustrated by the need of a central financial institute to oversee online payments made from one party to another, Bitcoin would use a blockchain framework that would be “powered by users with no central authority or middlemen” .Fast forward close to a decade later, Bitcoin has seen a meteoric rise in its popularity, exceeding 300,000 transactions per day (Appendix A). Overall, it has also blossomed into a booming market worth over USD 1 Tn (as of 9 March 2021) and ushered in a new era of cryptocurrencies. The benefits of cryptocurrencies such as accessibility and user autonomy (i.e., lack of an intermediary authority) (Reiff, 2021) are well documented. However, most cryptocurrencies suffer from a key drawback - high price fluctuations. Bitcoin, for example, increased its value by 1900% in 2017 before plunging below USD 8,000 per coin up from its high of USD 19,000 in the 2018 Jan/Feb crash. Since cryptocurrency price research is still a relatively unexplored area and equilibrium market prices are poorly understood (Conrad, 2018), this project seeks to conduct a sentiment analysis to explore possible drivers such as social media and news reports that leads to these price fluctuations. Additionally, other non-sentiment factors such as Public Interest, Community Data and Developer Data will also be analysed for their respective price impacts. Under the sentiment analysis portion of this FYP, Reddit and Twitter social media content & articles from 30,000 news sources and blogs will be mined and analysed using a Valence Aware Dictionary for Sentiment Reasoning (VADER) model to facilitate a correlation analysis against prices of top cryptocurrencies such as Bitcoin (BTC) and Ethereum (ETH). Non-sentiment factors are also extracted as inputs for building a Linear Regression model as well as a Random Forest Classifier. The FYP results indicate that there are weak correlations (|r|< 0.1) with news and social media sentiments across both cryptocurrencies. On the other hand, non-sentiment factors were strongly correlated with price across the board (|r|>0.7) and specific factors such as Number of Twitter Followers for a cryptocurrency’s Twitter account (i.e. @Bitcoin) and Number of Reddit Accounts following a cryptocurrency’s subreddit (i.e. r/Bitcoin) appeared to be strong price drivers under the high-accuracy regression and classification models. Bachelor of Engineering (Computer Science) 2021-04-27T06:54:15Z 2021-04-27T06:54:15Z 2021 Final Year Project (FYP) Yeoh, C. F. S. (2021). Time series & sentiment analysis of top cryptocurrencies. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148200 https://hdl.handle.net/10356/148200 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Yeoh, Chester Fu Soon
Time series & sentiment analysis of top cryptocurrencies
description Bitcoin, the world’s first cryptocurrency, was created in 2009 by Satoshi Nakamoto. Frustrated by the need of a central financial institute to oversee online payments made from one party to another, Bitcoin would use a blockchain framework that would be “powered by users with no central authority or middlemen” .Fast forward close to a decade later, Bitcoin has seen a meteoric rise in its popularity, exceeding 300,000 transactions per day (Appendix A). Overall, it has also blossomed into a booming market worth over USD 1 Tn (as of 9 March 2021) and ushered in a new era of cryptocurrencies. The benefits of cryptocurrencies such as accessibility and user autonomy (i.e., lack of an intermediary authority) (Reiff, 2021) are well documented. However, most cryptocurrencies suffer from a key drawback - high price fluctuations. Bitcoin, for example, increased its value by 1900% in 2017 before plunging below USD 8,000 per coin up from its high of USD 19,000 in the 2018 Jan/Feb crash. Since cryptocurrency price research is still a relatively unexplored area and equilibrium market prices are poorly understood (Conrad, 2018), this project seeks to conduct a sentiment analysis to explore possible drivers such as social media and news reports that leads to these price fluctuations. Additionally, other non-sentiment factors such as Public Interest, Community Data and Developer Data will also be analysed for their respective price impacts. Under the sentiment analysis portion of this FYP, Reddit and Twitter social media content & articles from 30,000 news sources and blogs will be mined and analysed using a Valence Aware Dictionary for Sentiment Reasoning (VADER) model to facilitate a correlation analysis against prices of top cryptocurrencies such as Bitcoin (BTC) and Ethereum (ETH). Non-sentiment factors are also extracted as inputs for building a Linear Regression model as well as a Random Forest Classifier. The FYP results indicate that there are weak correlations (|r|< 0.1) with news and social media sentiments across both cryptocurrencies. On the other hand, non-sentiment factors were strongly correlated with price across the board (|r|>0.7) and specific factors such as Number of Twitter Followers for a cryptocurrency’s Twitter account (i.e. @Bitcoin) and Number of Reddit Accounts following a cryptocurrency’s subreddit (i.e. r/Bitcoin) appeared to be strong price drivers under the high-accuracy regression and classification models.
author2 Anwitaman Datta
author_facet Anwitaman Datta
Yeoh, Chester Fu Soon
format Final Year Project
author Yeoh, Chester Fu Soon
author_sort Yeoh, Chester Fu Soon
title Time series & sentiment analysis of top cryptocurrencies
title_short Time series & sentiment analysis of top cryptocurrencies
title_full Time series & sentiment analysis of top cryptocurrencies
title_fullStr Time series & sentiment analysis of top cryptocurrencies
title_full_unstemmed Time series & sentiment analysis of top cryptocurrencies
title_sort time series & sentiment analysis of top cryptocurrencies
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/148200
_version_ 1698713691931803648