Topic extraction and sentiment analysis of a subreddit (r/coronavirus)

Human emotion and individual opinion are subjective information that greatly affect how humans behave and interact [1]. Textual information such as online posting is one such way of expressing what a person is thinking and feeling. The coronavirus (COVID-19) pandemic has spread its roots globally s...

Full description

Saved in:
Bibliographic Details
Main Author: Chong, You Min
Other Authors: Anwitaman Datta
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/153247
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-153247
record_format dspace
spelling sg-ntu-dr.10356-1532472021-11-17T00:51:36Z Topic extraction and sentiment analysis of a subreddit (r/coronavirus) Chong, You Min Anwitaman Datta School of Computer Science and Engineering Anwitaman@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Document and text processing Human emotion and individual opinion are subjective information that greatly affect how humans behave and interact [1]. Textual information such as online posting is one such way of expressing what a person is thinking and feeling. The coronavirus (COVID-19) pandemic has spread its roots globally since the first outbreak in early 2020, and the first global crisis since SARS in 2002. COVID-19 has negatively impacted the world in more ways than one and has completely changed the way lives are being led. In this study, the objective is to explore and perform analysis on the subreddit /r/Coronavirus, to observe and visualize the trends in which covid-related topics are being discussed. This is implemented with the use of Reddit’s API for data collection, MySQL for database management, and Python to present the findings. NLP techniques were applied during data analysis, including text pre-processing, topic modelling, and sentiment analysis. In addition, various libraries were utilized to carry out the aforementioned NLP techniques. The results showed that some negative sentiments were present among topics discussed, and vaccines were also commonly mentioned as a key topic. Further application of these results may be implemented to improve the ways in which topics are being identified and interpreted. Bachelor of Engineering (Computer Science) 2021-11-17T00:51:36Z 2021-11-17T00:51:36Z 2021 Final Year Project (FYP) Chong, Y. M. (2021). Topic extraction and sentiment analysis of a subreddit (r/coronavirus). Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/153247 https://hdl.handle.net/10356/153247 en SCSE20-0941 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Document and text processing
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Chong, You Min
Topic extraction and sentiment analysis of a subreddit (r/coronavirus)
description Human emotion and individual opinion are subjective information that greatly affect how humans behave and interact [1]. Textual information such as online posting is one such way of expressing what a person is thinking and feeling. The coronavirus (COVID-19) pandemic has spread its roots globally since the first outbreak in early 2020, and the first global crisis since SARS in 2002. COVID-19 has negatively impacted the world in more ways than one and has completely changed the way lives are being led. In this study, the objective is to explore and perform analysis on the subreddit /r/Coronavirus, to observe and visualize the trends in which covid-related topics are being discussed. This is implemented with the use of Reddit’s API for data collection, MySQL for database management, and Python to present the findings. NLP techniques were applied during data analysis, including text pre-processing, topic modelling, and sentiment analysis. In addition, various libraries were utilized to carry out the aforementioned NLP techniques. The results showed that some negative sentiments were present among topics discussed, and vaccines were also commonly mentioned as a key topic. Further application of these results may be implemented to improve the ways in which topics are being identified and interpreted.
author2 Anwitaman Datta
author_facet Anwitaman Datta
Chong, You Min
format Final Year Project
author Chong, You Min
author_sort Chong, You Min
title Topic extraction and sentiment analysis of a subreddit (r/coronavirus)
title_short Topic extraction and sentiment analysis of a subreddit (r/coronavirus)
title_full Topic extraction and sentiment analysis of a subreddit (r/coronavirus)
title_fullStr Topic extraction and sentiment analysis of a subreddit (r/coronavirus)
title_full_unstemmed Topic extraction and sentiment analysis of a subreddit (r/coronavirus)
title_sort topic extraction and sentiment analysis of a subreddit (r/coronavirus)
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/153247
_version_ 1718368104461893632