Event detection based on on-line news clustering
In this dissertation, we propose a semi-supervised learning method to solve the disaster event detection task. This method achieved lower miss rate and false rate than the unsupervised learning method on disaster event detection task. The semi-supervised learning method contains a multi-class classi...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Coursework |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/155130 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-155130 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1551302023-07-04T16:46:40Z Event detection based on on-line news clustering Huang, Song Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering In this dissertation, we propose a semi-supervised learning method to solve the disaster event detection task. This method achieved lower miss rate and false rate than the unsupervised learning method on disaster event detection task. The semi-supervised learning method contains a multi-class classification model and a single-pass clustering model. A hierarchical single-pass clustering algorithm is also developed to overcome the deficiency of traditional single-pass clustering algorithm. For the text representation learning, a pre-trained BERT model is fine-tuned on our customized dataset, and achieves great performance on the classification problem. A NER model is introduced to extract the location and time features to help the clustering algorithm detect new events and track known events. Master of Science (Computer Control and Automation) 2022-02-08T01:46:31Z 2022-02-08T01:46:31Z 2021 Thesis-Master by Coursework Huang, S. (2021). Event detection based on on-line news clustering. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/155130 https://hdl.handle.net/10356/155130 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Huang, Song Event detection based on on-line news clustering |
description |
In this dissertation, we propose a semi-supervised learning method to solve the disaster event detection task. This method achieved lower miss rate and false rate than the unsupervised learning method on disaster event detection task. The semi-supervised learning method contains a multi-class classification model and a single-pass clustering model. A hierarchical single-pass clustering algorithm is also developed to overcome the deficiency of traditional single-pass clustering algorithm. For the text representation learning, a pre-trained BERT model is fine-tuned on our customized dataset, and achieves great performance on the classification problem. A NER model is introduced to extract the location and time features to help the clustering algorithm detect new events and track known events. |
author2 |
Mao Kezhi |
author_facet |
Mao Kezhi Huang, Song |
format |
Thesis-Master by Coursework |
author |
Huang, Song |
author_sort |
Huang, Song |
title |
Event detection based on on-line news clustering |
title_short |
Event detection based on on-line news clustering |
title_full |
Event detection based on on-line news clustering |
title_fullStr |
Event detection based on on-line news clustering |
title_full_unstemmed |
Event detection based on on-line news clustering |
title_sort |
event detection based on on-line news clustering |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/155130 |
_version_ |
1772825999706161152 |