Event detection based on on-line news clustering

In this dissertation, we propose a semi-supervised learning method to solve the disaster event detection task. This method achieved lower miss rate and false rate than the unsupervised learning method on disaster event detection task. The semi-supervised learning method contains a multi-class classi...

Full description

Saved in:
Bibliographic Details
Main Author: Huang, Song
Other Authors: Mao Kezhi
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/155130
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:In this dissertation, we propose a semi-supervised learning method to solve the disaster event detection task. This method achieved lower miss rate and false rate than the unsupervised learning method on disaster event detection task. The semi-supervised learning method contains a multi-class classification model and a single-pass clustering model. A hierarchical single-pass clustering algorithm is also developed to overcome the deficiency of traditional single-pass clustering algorithm. For the text representation learning, a pre-trained BERT model is fine-tuned on our customized dataset, and achieves great performance on the classification problem. A NER model is introduced to extract the location and time features to help the clustering algorithm detect new events and track known events.