GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD

Twitter has been widely used as a communication tool for emergency response when disasters occur in various countries. Emergency response teams or researchers use hashtags for emergency disaster searches. The use of disaster hashtags in particular Twitter does not have a standard format, this makes...

Full description

Saved in:
Bibliographic Details
Main Author: WIATI GUSTI - NIM: 23515041 , KHARISMA
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/28476
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:28476
spelling id-itb.:284762018-10-01T10:08:38ZGENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD WIATI GUSTI - NIM: 23515041 , KHARISMA Indonesia Theses INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/28476 Twitter has been widely used as a communication tool for emergency response when disasters occur in various countries. Emergency response teams or researchers use hashtags for emergency disaster searches. The use of disaster hashtags in particular Twitter does not have a standard format, this makes it difficult to search and collect data for emergency response. OCHA (Office for Coordination of Humanitarian Affairs) proposes to standardize the hashtag for emergency response. <br /> <br /> <br /> <br /> <br /> This study proposes to generate automatic disaster hashtag in accordance with OCHA standards. 2,685 tweets preprocessed and resulting 1,309 tweets in a clean dataset. The research uses the word representation method with Skip Gram model and SMOTE filter for handling imbalanced datasets. Then the classification of tweets into the category of emergency, non-emergency, and other uses various classifiers namely Naïve Bayes, Support Vector Machine, Instance-Based Learning, and Logistic Regression. <br /> <br /> <br /> <br /> <br /> Data emergency and non-emergency categories are used for the introduction of entities, names of disasters and disaster locations. Of the 257 relevant tweets, tokenization and labeled with BIO standardized. As many as 3,856 tokens become inputs for the introduction of named entities using the Conditional Random Field (CRF) model. Furthermore, automatic hashtag generation is performed using the results of the classification and introduction of named entities. <br /> <br /> <br /> <br /> <br /> The results show that the use of skip gram models can improve accuracy. The highest average accuracy of 83.9695% is obtained by using instance-based learning with k 15. The named entity recognition with 70.3% recall, 89.4% precision and 77.1% f-measure. Automatic hashtag generation has good results with an average of 61.2% recall, 87.4% precision and 66.9% f-measure. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Twitter has been widely used as a communication tool for emergency response when disasters occur in various countries. Emergency response teams or researchers use hashtags for emergency disaster searches. The use of disaster hashtags in particular Twitter does not have a standard format, this makes it difficult to search and collect data for emergency response. OCHA (Office for Coordination of Humanitarian Affairs) proposes to standardize the hashtag for emergency response. <br /> <br /> <br /> <br /> <br /> This study proposes to generate automatic disaster hashtag in accordance with OCHA standards. 2,685 tweets preprocessed and resulting 1,309 tweets in a clean dataset. The research uses the word representation method with Skip Gram model and SMOTE filter for handling imbalanced datasets. Then the classification of tweets into the category of emergency, non-emergency, and other uses various classifiers namely Naïve Bayes, Support Vector Machine, Instance-Based Learning, and Logistic Regression. <br /> <br /> <br /> <br /> <br /> Data emergency and non-emergency categories are used for the introduction of entities, names of disasters and disaster locations. Of the 257 relevant tweets, tokenization and labeled with BIO standardized. As many as 3,856 tokens become inputs for the introduction of named entities using the Conditional Random Field (CRF) model. Furthermore, automatic hashtag generation is performed using the results of the classification and introduction of named entities. <br /> <br /> <br /> <br /> <br /> The results show that the use of skip gram models can improve accuracy. The highest average accuracy of 83.9695% is obtained by using instance-based learning with k 15. The named entity recognition with 70.3% recall, 89.4% precision and 77.1% f-measure. Automatic hashtag generation has good results with an average of 61.2% recall, 87.4% precision and 66.9% f-measure.
format Theses
author WIATI GUSTI - NIM: 23515041 , KHARISMA
spellingShingle WIATI GUSTI - NIM: 23515041 , KHARISMA
GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
author_facet WIATI GUSTI - NIM: 23515041 , KHARISMA
author_sort WIATI GUSTI - NIM: 23515041 , KHARISMA
title GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_short GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_full GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_fullStr GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_full_unstemmed GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_sort generating of automatic disaster hashtag based on ocha standard
url https://digilib.itb.ac.id/gdl/view/28476
_version_ 1822922600775417856