GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD

Twitter has been widely used as a communication tool for emergency response when disasters occur in various countries. Emergency response teams or researchers use hashtags for emergency disaster searches. The use of disaster hashtags in particular Twitter does not have a standard format, this makes...

Full description

Saved in:

Bibliographic Details
Main Author:	WIATI GUSTI - NIM: 23515041 , KHARISMA
Format:	Theses
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/28476
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:28476
spelling	id-itb.:284762018-10-01T10:08:38ZGENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD WIATI GUSTI - NIM: 23515041 , KHARISMA Indonesia Theses INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/28476 Twitter has been widely used as a communication tool for emergency response when disasters occur in various countries. Emergency response teams or researchers use hashtags for emergency disaster searches. The use of disaster hashtags in particular Twitter does not have a standard format, this makes it difficult to search and collect data for emergency response. OCHA (Office for Coordination of Humanitarian Affairs) proposes to standardize the hashtag for emergency response. <br /> <br /> <br /> <br /> <br /> This study proposes to generate automatic disaster hashtag in accordance with OCHA standards. 2,685 tweets preprocessed and resulting 1,309 tweets in a clean dataset. The research uses the word representation method with Skip Gram model and SMOTE filter for handling imbalanced datasets. Then the classification of tweets into the category of emergency, non-emergency, and other uses various classifiers namely NaÃƒÂ¯ve Bayes, Support Vector Machine, Instance-Based Learning, and Logistic Regression. <br /> <br /> <br /> <br /> <br /> Data emergency and non-emergency categories are used for the introduction of entities, names of disasters and disaster locations. Of the 257 relevant tweets, tokenization and labeled with BIO standardized. As many as 3,856 tokens become inputs for the introduction of named entities using the Conditional Random Field (CRF) model. Furthermore, automatic hashtag generation is performed using the results of the classification and introduction of named entities. <br /> <br /> <br /> <br /> <br /> The results show that the use of skip gram models can improve accuracy. The highest average accuracy of 83.9695% is obtained by using instance-based learning with k 15. The named entity recognition with 70.3% recall, 89.4% precision and 77.1% f-measure. Automatic hashtag generation has good results with an average of 61.2% recall, 87.4% precision and 66.9% f-measure. text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	Twitter has been widely used as a communication tool for emergency response when disasters occur in various countries. Emergency response teams or researchers use hashtags for emergency disaster searches. The use of disaster hashtags in particular Twitter does not have a standard format, this makes it difficult to search and collect data for emergency response. OCHA (Office for Coordination of Humanitarian Affairs) proposes to standardize the hashtag for emergency response. <br /> <br /> <br /> <br /> <br /> This study proposes to generate automatic disaster hashtag in accordance with OCHA standards. 2,685 tweets preprocessed and resulting 1,309 tweets in a clean dataset. The research uses the word representation method with Skip Gram model and SMOTE filter for handling imbalanced datasets. Then the classification of tweets into the category of emergency, non-emergency, and other uses various classifiers namely NaÃƒÂ¯ve Bayes, Support Vector Machine, Instance-Based Learning, and Logistic Regression. <br /> <br /> <br /> <br /> <br /> Data emergency and non-emergency categories are used for the introduction of entities, names of disasters and disaster locations. Of the 257 relevant tweets, tokenization and labeled with BIO standardized. As many as 3,856 tokens become inputs for the introduction of named entities using the Conditional Random Field (CRF) model. Furthermore, automatic hashtag generation is performed using the results of the classification and introduction of named entities. <br /> <br /> <br /> <br /> <br /> The results show that the use of skip gram models can improve accuracy. The highest average accuracy of 83.9695% is obtained by using instance-based learning with k 15. The named entity recognition with 70.3% recall, 89.4% precision and 77.1% f-measure. Automatic hashtag generation has good results with an average of 61.2% recall, 87.4% precision and 66.9% f-measure.
format	Theses
author	WIATI GUSTI - NIM: 23515041 , KHARISMA
spellingShingle	WIATI GUSTI - NIM: 23515041 , KHARISMA GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
author_facet	WIATI GUSTI - NIM: 23515041 , KHARISMA
author_sort	WIATI GUSTI - NIM: 23515041 , KHARISMA
title	GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_short	GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_full	GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_fullStr	GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_full_unstemmed	GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD
title_sort	generating of automatic disaster hashtag based on ocha standard
url	https://digilib.itb.ac.id/gdl/view/28476
_version_	1822922600775417856

GENERATING OF AUTOMATIC DISASTER HASHTAG BASED ON OCHA STANDARD

Similar Items