Development of a classification system on big data set using machine learning techniques

In this day and age, there are millions of people all around the world who are regular users of online social media platforms like Facebook, Twitter and Reddit. This has resulted in a huge amount of text data to be available online and is a good opportunity to be used to study and analyse sentime...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Zhi Ler
Other Authors: Chan Chee Keong
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/149132
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-149132
record_format dspace
spelling sg-ntu-dr.10356-1491322023-07-07T17:41:07Z Development of a classification system on big data set using machine learning techniques Tan, Zhi Ler Chan Chee Keong School of Electrical and Electronic Engineering ECKCHAN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Electrical and electronic engineering In this day and age, there are millions of people all around the world who are regular users of online social media platforms like Facebook, Twitter and Reddit. This has resulted in a huge amount of text data to be available online and is a good opportunity to be used to study and analyse sentiments of texts. This project aims to create classification models based on a Twitter dataset to classify Tweets to their sentiment class of either positive, negative, or neutral. 7 different classification models were explored and tuned to obtain accuracies ranging from 55%-70%. A Telegram bot that can output the sentiment of user inputs by using the trained classification models was made. By using Twitter APIs to stream Tweets, a real-time graph was also made which shows sentiment over time of a specified keyword. Bachelor of Engineering (Information Engineering and Media) 2021-05-27T06:54:54Z 2021-05-27T06:54:54Z 2021 Final Year Project (FYP) Tan, Z. L. (2021). Development of a classification system on big data set using machine learning techniques. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149132 https://hdl.handle.net/10356/149132 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Electrical and electronic engineering
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Electrical and electronic engineering
Tan, Zhi Ler
Development of a classification system on big data set using machine learning techniques
description In this day and age, there are millions of people all around the world who are regular users of online social media platforms like Facebook, Twitter and Reddit. This has resulted in a huge amount of text data to be available online and is a good opportunity to be used to study and analyse sentiments of texts. This project aims to create classification models based on a Twitter dataset to classify Tweets to their sentiment class of either positive, negative, or neutral. 7 different classification models were explored and tuned to obtain accuracies ranging from 55%-70%. A Telegram bot that can output the sentiment of user inputs by using the trained classification models was made. By using Twitter APIs to stream Tweets, a real-time graph was also made which shows sentiment over time of a specified keyword.
author2 Chan Chee Keong
author_facet Chan Chee Keong
Tan, Zhi Ler
format Final Year Project
author Tan, Zhi Ler
author_sort Tan, Zhi Ler
title Development of a classification system on big data set using machine learning techniques
title_short Development of a classification system on big data set using machine learning techniques
title_full Development of a classification system on big data set using machine learning techniques
title_fullStr Development of a classification system on big data set using machine learning techniques
title_full_unstemmed Development of a classification system on big data set using machine learning techniques
title_sort development of a classification system on big data set using machine learning techniques
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/149132
_version_ 1772826135867949056