#TITLE_ALTERNATIVE#
The growth of machine learning technology in Indonesia, especially in text classification segment, goes rapidly. It comes with a lot of research that delivers many methods to optimize the performance of machine learning. To decide the method which has the best performance, one thing that should be d...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/22448 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:22448 |
---|---|
spelling |
id-itb.:224482017-09-29T14:32:53Z#TITLE_ALTERNATIVE# (NIM: 18213012), Hidayat Indonesia Final Project INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/22448 The growth of machine learning technology in Indonesia, especially in text classification segment, goes rapidly. It comes with a lot of research that delivers many methods to optimize the performance of machine learning. To decide the method which has the best performance, one thing that should be done is to quantify the performance every single method. The quantifying process can be done by implementing them in sentiment analysis. Every feedback that’s been conveyed by Go-Jek’s customer in Twitter can be used as a dataset to analyze. Therefore, this research aims to decide which the alternative method best in doing sentiment analysis by using dataset about Go-Jek as tweet from Twitter in Indonesian. The approachment in text classification, generally, uses supervised learning. Some methods used in this research are Support Vector Machine, Naive Bayes Classifier, and Decision Tree Classifier. Every alternative methods used in this research is combined with weighting feature, such as TF and TF-IDF, and n-gram feature, such as unigram, bigram, and unigram+bigram. Naive Bayes Classifier combined with TF-IDF and unigram has best performance among the alternative methods used in this research with the 84% precision, 84% recall, and 84% F-Measure. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
The growth of machine learning technology in Indonesia, especially in text classification segment, goes rapidly. It comes with a lot of research that delivers many methods to optimize the performance of machine learning. To decide the method which has the best performance, one thing that should be done is to quantify the performance every single method. The quantifying process can be done by implementing them in sentiment analysis. Every feedback that’s been conveyed by Go-Jek’s customer in Twitter can be used as a dataset to analyze. Therefore, this research aims to decide which the alternative method best in doing sentiment analysis by using dataset about Go-Jek as tweet from Twitter in Indonesian. The approachment in text classification, generally, uses supervised learning. Some methods used in this research are Support Vector Machine, Naive Bayes Classifier, and Decision Tree Classifier. Every alternative methods used in this research is combined with weighting feature, such as TF and TF-IDF, and n-gram feature, such as unigram, bigram, and unigram+bigram. Naive Bayes Classifier combined with TF-IDF and unigram has best performance among the alternative methods used in this research with the 84% precision, 84% recall, and 84% F-Measure. |
format |
Final Project |
author |
(NIM: 18213012), Hidayat |
spellingShingle |
(NIM: 18213012), Hidayat #TITLE_ALTERNATIVE# |
author_facet |
(NIM: 18213012), Hidayat |
author_sort |
(NIM: 18213012), Hidayat |
title |
#TITLE_ALTERNATIVE# |
title_short |
#TITLE_ALTERNATIVE# |
title_full |
#TITLE_ALTERNATIVE# |
title_fullStr |
#TITLE_ALTERNATIVE# |
title_full_unstemmed |
#TITLE_ALTERNATIVE# |
title_sort |
#title_alternative# |
url |
https://digilib.itb.ac.id/gdl/view/22448 |
_version_ |
1821120775675969536 |