Development of a classification system on big data set using machine learning techniques
In this day and age, there are millions of people all around the world who are regular users of online social media platforms like Facebook, Twitter and Reddit. This has resulted in a huge amount of text data to be available online and is a good opportunity to be used to study and analyse sentime...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/149132 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-149132 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1491322023-07-07T17:41:07Z Development of a classification system on big data set using machine learning techniques Tan, Zhi Ler Chan Chee Keong School of Electrical and Electronic Engineering ECKCHAN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Electrical and electronic engineering In this day and age, there are millions of people all around the world who are regular users of online social media platforms like Facebook, Twitter and Reddit. This has resulted in a huge amount of text data to be available online and is a good opportunity to be used to study and analyse sentiments of texts. This project aims to create classification models based on a Twitter dataset to classify Tweets to their sentiment class of either positive, negative, or neutral. 7 different classification models were explored and tuned to obtain accuracies ranging from 55%-70%. A Telegram bot that can output the sentiment of user inputs by using the trained classification models was made. By using Twitter APIs to stream Tweets, a real-time graph was also made which shows sentiment over time of a specified keyword. Bachelor of Engineering (Information Engineering and Media) 2021-05-27T06:54:54Z 2021-05-27T06:54:54Z 2021 Final Year Project (FYP) Tan, Z. L. (2021). Development of a classification system on big data set using machine learning techniques. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149132 https://hdl.handle.net/10356/149132 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Electrical and electronic engineering Tan, Zhi Ler Development of a classification system on big data set using machine learning techniques |
description |
In this day and age, there are millions of people all around the world who are regular
users of online social media platforms like Facebook, Twitter and Reddit. This has
resulted in a huge amount of text data to be available online and is a good
opportunity to be used to study and analyse sentiments of texts.
This project aims to create classification models based on a Twitter dataset to
classify Tweets to their sentiment class of either positive, negative, or neutral. 7
different classification models were explored and tuned to obtain accuracies ranging
from 55%-70%.
A Telegram bot that can output the sentiment of user inputs by using the trained
classification models was made. By using Twitter APIs to stream Tweets, a real-time
graph was also made which shows sentiment over time of a specified keyword. |
author2 |
Chan Chee Keong |
author_facet |
Chan Chee Keong Tan, Zhi Ler |
format |
Final Year Project |
author |
Tan, Zhi Ler |
author_sort |
Tan, Zhi Ler |
title |
Development of a classification system on big data set using machine learning techniques |
title_short |
Development of a classification system on big data set using machine learning techniques |
title_full |
Development of a classification system on big data set using machine learning techniques |
title_fullStr |
Development of a classification system on big data set using machine learning techniques |
title_full_unstemmed |
Development of a classification system on big data set using machine learning techniques |
title_sort |
development of a classification system on big data set using machine learning techniques |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/149132 |
_version_ |
1772826135867949056 |