Topic extraction and sentiment analysis of a subreddit (r/coronavirus)
Human emotion and individual opinion are subjective information that greatly affect how humans behave and interact [1]. Textual information such as online posting is one such way of expressing what a person is thinking and feeling. The coronavirus (COVID-19) pandemic has spread its roots globally s...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/153247 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-153247 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1532472021-11-17T00:51:36Z Topic extraction and sentiment analysis of a subreddit (r/coronavirus) Chong, You Min Anwitaman Datta School of Computer Science and Engineering Anwitaman@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Document and text processing Human emotion and individual opinion are subjective information that greatly affect how humans behave and interact [1]. Textual information such as online posting is one such way of expressing what a person is thinking and feeling. The coronavirus (COVID-19) pandemic has spread its roots globally since the first outbreak in early 2020, and the first global crisis since SARS in 2002. COVID-19 has negatively impacted the world in more ways than one and has completely changed the way lives are being led. In this study, the objective is to explore and perform analysis on the subreddit /r/Coronavirus, to observe and visualize the trends in which covid-related topics are being discussed. This is implemented with the use of Reddit’s API for data collection, MySQL for database management, and Python to present the findings. NLP techniques were applied during data analysis, including text pre-processing, topic modelling, and sentiment analysis. In addition, various libraries were utilized to carry out the aforementioned NLP techniques. The results showed that some negative sentiments were present among topics discussed, and vaccines were also commonly mentioned as a key topic. Further application of these results may be implemented to improve the ways in which topics are being identified and interpreted. Bachelor of Engineering (Computer Science) 2021-11-17T00:51:36Z 2021-11-17T00:51:36Z 2021 Final Year Project (FYP) Chong, Y. M. (2021). Topic extraction and sentiment analysis of a subreddit (r/coronavirus). Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/153247 https://hdl.handle.net/10356/153247 en SCSE20-0941 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing Chong, You Min Topic extraction and sentiment analysis of a subreddit (r/coronavirus) |
description |
Human emotion and individual opinion are subjective information that greatly affect how humans behave and interact [1]. Textual information such as online posting is one such way of expressing what a person is thinking and feeling.
The coronavirus (COVID-19) pandemic has spread its roots globally since the first outbreak in early 2020, and the first global crisis since SARS in 2002. COVID-19 has negatively impacted the world in more ways than one and has completely changed the way lives are being led.
In this study, the objective is to explore and perform analysis on the subreddit /r/Coronavirus, to observe and visualize the trends in which covid-related topics are being discussed. This is implemented with the use of Reddit’s API for data collection, MySQL for database management, and Python to present the findings. NLP techniques were applied during data analysis, including text pre-processing, topic modelling, and sentiment analysis. In addition, various libraries were utilized to carry out the aforementioned NLP techniques.
The results showed that some negative sentiments were present among topics discussed, and vaccines were also commonly mentioned as a key topic. Further application of these results may be implemented to improve the ways in which topics are being identified and interpreted. |
author2 |
Anwitaman Datta |
author_facet |
Anwitaman Datta Chong, You Min |
format |
Final Year Project |
author |
Chong, You Min |
author_sort |
Chong, You Min |
title |
Topic extraction and sentiment analysis of a subreddit (r/coronavirus) |
title_short |
Topic extraction and sentiment analysis of a subreddit (r/coronavirus) |
title_full |
Topic extraction and sentiment analysis of a subreddit (r/coronavirus) |
title_fullStr |
Topic extraction and sentiment analysis of a subreddit (r/coronavirus) |
title_full_unstemmed |
Topic extraction and sentiment analysis of a subreddit (r/coronavirus) |
title_sort |
topic extraction and sentiment analysis of a subreddit (r/coronavirus) |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/153247 |
_version_ |
1718368104461893632 |