#TITLE_ALTERNATIVE#

The growth of machine learning technology in Indonesia, especially in text classification segment, goes rapidly. It comes with a lot of research that delivers many methods to optimize the performance of machine learning. To decide the method which has the best performance, one thing that should be d...

Full description

Saved in:
Bibliographic Details
Main Author: (NIM: 18213012), Hidayat
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/22448
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:22448
spelling id-itb.:224482017-09-29T14:32:53Z#TITLE_ALTERNATIVE# (NIM: 18213012), Hidayat Indonesia Final Project INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/22448 The growth of machine learning technology in Indonesia, especially in text classification segment, goes rapidly. It comes with a lot of research that delivers many methods to optimize the performance of machine learning. To decide the method which has the best performance, one thing that should be done is to quantify the performance every single method. The quantifying process can be done by implementing them in sentiment analysis. Every feedback that’s been conveyed by Go-Jek’s customer in Twitter can be used as a dataset to analyze. Therefore, this research aims to decide which the alternative method best in doing sentiment analysis by using dataset about Go-Jek as tweet from Twitter in Indonesian. The approachment in text classification, generally, uses supervised learning. Some methods used in this research are Support Vector Machine, Naive Bayes Classifier, and Decision Tree Classifier. Every alternative methods used in this research is combined with weighting feature, such as TF and TF-IDF, and n-gram feature, such as unigram, bigram, and unigram+bigram. Naive Bayes Classifier combined with TF-IDF and unigram has best performance among the alternative methods used in this research with the 84% precision, 84% recall, and 84% F-Measure. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description The growth of machine learning technology in Indonesia, especially in text classification segment, goes rapidly. It comes with a lot of research that delivers many methods to optimize the performance of machine learning. To decide the method which has the best performance, one thing that should be done is to quantify the performance every single method. The quantifying process can be done by implementing them in sentiment analysis. Every feedback that’s been conveyed by Go-Jek’s customer in Twitter can be used as a dataset to analyze. Therefore, this research aims to decide which the alternative method best in doing sentiment analysis by using dataset about Go-Jek as tweet from Twitter in Indonesian. The approachment in text classification, generally, uses supervised learning. Some methods used in this research are Support Vector Machine, Naive Bayes Classifier, and Decision Tree Classifier. Every alternative methods used in this research is combined with weighting feature, such as TF and TF-IDF, and n-gram feature, such as unigram, bigram, and unigram+bigram. Naive Bayes Classifier combined with TF-IDF and unigram has best performance among the alternative methods used in this research with the 84% precision, 84% recall, and 84% F-Measure.
format Final Project
author (NIM: 18213012), Hidayat
spellingShingle (NIM: 18213012), Hidayat
#TITLE_ALTERNATIVE#
author_facet (NIM: 18213012), Hidayat
author_sort (NIM: 18213012), Hidayat
title #TITLE_ALTERNATIVE#
title_short #TITLE_ALTERNATIVE#
title_full #TITLE_ALTERNATIVE#
title_fullStr #TITLE_ALTERNATIVE#
title_full_unstemmed #TITLE_ALTERNATIVE#
title_sort #title_alternative#
url https://digilib.itb.ac.id/gdl/view/22448
_version_ 1821120775675969536