Impact of difficult negatives on Twitter crisis detection

Twitter has become an alternative information source during a crisis. However, the short, noisy nature of tweets hinders information extraction. While models trained with standard Twitter crisis datasets accomplished decent performance, it remained a challenge to generalize to unseen crisis events....

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHANG, Yuhao, LO, Siaw Ling, WIN MYINT, Phyo Yi
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	Twitter Crisis Detection Difficult Negative Data Negative Mining Databases and Information Systems Social Media
Online Access:	https://ink.library.smu.edu.sg/sis_research/8007 https://ink.library.smu.edu.sg/context/sis_research/article/9010/viewcontent/LoSiawLing_2023_Impact_of_Difficult_Negatives_on_Twitter_Crisis_Detection.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-9010
record_format	dspace
spelling	sg-smu-ink.sis_research-90102023-08-15T01:57:15Z Impact of difficult negatives on Twitter crisis detection ZHANG, Yuhao LO, Siaw Ling WIN MYINT, Phyo Yi Twitter has become an alternative information source during a crisis. However, the short, noisy nature of tweets hinders information extraction. While models trained with standard Twitter crisis datasets accomplished decent performance, it remained a challenge to generalize to unseen crisis events. Thus, we proposed adding “difficult” negative examples during training to improve model generalization for Twitter crisis detection. Although adding random noise is a common practice, the impact of difficult negatives, i.e., negative data semantically similar to true examples, was never examined in NLP. Most of existing research focuses on the classification task, without considering the primary information need of crisis responders. In our study, we implemented multiple sequence tagging models and studied quantitatively and qualitatively the impact of difficult negatives on sequence tagging. We evaluated models on unseen events and showed that difficult negative forced models to generalize better, leading to more accurate information extraction in a real-world application. 2023-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8007 https://ink.library.smu.edu.sg/context/sis_research/article/9010/viewcontent/LoSiawLing_2023_Impact_of_Difficult_Negatives_on_Twitter_Crisis_Detection.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Twitter Crisis Detection Difficult Negative Data Negative Mining Databases and Information Systems Social Media
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Twitter Crisis Detection Difficult Negative Data Negative Mining Databases and Information Systems Social Media
spellingShingle	Twitter Crisis Detection Difficult Negative Data Negative Mining Databases and Information Systems Social Media ZHANG, Yuhao LO, Siaw Ling WIN MYINT, Phyo Yi Impact of difficult negatives on Twitter crisis detection
description	Twitter has become an alternative information source during a crisis. However, the short, noisy nature of tweets hinders information extraction. While models trained with standard Twitter crisis datasets accomplished decent performance, it remained a challenge to generalize to unseen crisis events. Thus, we proposed adding “difficult” negative examples during training to improve model generalization for Twitter crisis detection. Although adding random noise is a common practice, the impact of difficult negatives, i.e., negative data semantically similar to true examples, was never examined in NLP. Most of existing research focuses on the classification task, without considering the primary information need of crisis responders. In our study, we implemented multiple sequence tagging models and studied quantitatively and qualitatively the impact of difficult negatives on sequence tagging. We evaluated models on unseen events and showed that difficult negative forced models to generalize better, leading to more accurate information extraction in a real-world application.
format	text
author	ZHANG, Yuhao LO, Siaw Ling WIN MYINT, Phyo Yi
author_facet	ZHANG, Yuhao LO, Siaw Ling WIN MYINT, Phyo Yi
author_sort	ZHANG, Yuhao
title	Impact of difficult negatives on Twitter crisis detection
title_short	Impact of difficult negatives on Twitter crisis detection
title_full	Impact of difficult negatives on Twitter crisis detection
title_fullStr	Impact of difficult negatives on Twitter crisis detection
title_full_unstemmed	Impact of difficult negatives on Twitter crisis detection
title_sort	impact of difficult negatives on twitter crisis detection
publisher	Institutional Knowledge at Singapore Management University
publishDate	2023
url	https://ink.library.smu.edu.sg/sis_research/8007 https://ink.library.smu.edu.sg/context/sis_research/article/9010/viewcontent/LoSiawLing_2023_Impact_of_Difficult_Negatives_on_Twitter_Crisis_Detection.pdf
_version_	1779156934200393728

Impact of difficult negatives on Twitter crisis detection

Similar Items