BERT named entity recognition on emergency response system
Named Entity Recognition (NER) is a natural language processing task to identify pre-defined categories called entities in a given sequence. An existing Emergency Response System is a NER-based application developed to aid call operators by extracting key information from the caller and replacing th...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/156612 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-156612 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1566122022-04-21T05:02:37Z BERT named entity recognition on emergency response system Chua, Clarita Wyn Kay Chng Eng Siong School of Computer Science and Engineering ASESChng@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Document and text processing Named Entity Recognition (NER) is a natural language processing task to identify pre-defined categories called entities in a given sequence. An existing Emergency Response System is a NER-based application developed to aid call operators by extracting key information from the caller and replacing the need for manual insertion by call operators into the command control system. This paper proposes the improvement of the NER model in the Emergency Response System by including medical and covid-related entities through finetuning and training the different BERT variant models onto a COVID dataset and General Emergency Response dataset. To aid in the development and deployment of a variety of BERT-based models, the paper also introduces an automated NER pipeline with modules to prepare data and run the NER model. This paper also explores the benefits of data augmentation from PEGASUS with experiments conducted on augmented and non-augmented datasets to obtain the best baseline NER model for each respective dataset. From the experiments, we have found that roBERTa is the best baseline model for original datasets, but performed around 10% lower on augmented datasets. Additionally, DistilBERT and BERT NER models have shown significant improvement within a range of 3- 7% in their performance after data augmentation. Bachelor of Engineering (Computer Science) 2022-04-21T05:02:37Z 2022-04-21T05:02:37Z 2022 Final Year Project (FYP) Chua, C. W. K. (2022). BERT named entity recognition on emergency response system. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156612 https://hdl.handle.net/10356/156612 en SCSE21-0068 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing Chua, Clarita Wyn Kay BERT named entity recognition on emergency response system |
description |
Named Entity Recognition (NER) is a natural language processing task to identify pre-defined categories called entities in a given sequence. An existing Emergency Response System is a NER-based application developed to aid call operators by extracting key information from the caller and replacing the need for manual insertion by call operators into the command control system. This paper proposes the improvement of the NER model in the Emergency Response System by including medical and covid-related entities through finetuning and training the different BERT variant models onto a COVID dataset and General Emergency Response dataset. To aid in the development and deployment of a variety of BERT-based models, the paper also introduces an automated NER pipeline with modules to prepare data and run the NER model. This paper also explores the benefits of data augmentation from PEGASUS with experiments conducted on augmented and non-augmented datasets to obtain the best baseline NER model for each respective dataset. From the experiments, we have found that roBERTa is the best baseline model for original datasets, but performed around 10% lower on augmented datasets. Additionally, DistilBERT and BERT NER models have shown significant improvement within a range of 3- 7% in their performance after data augmentation. |
author2 |
Chng Eng Siong |
author_facet |
Chng Eng Siong Chua, Clarita Wyn Kay |
format |
Final Year Project |
author |
Chua, Clarita Wyn Kay |
author_sort |
Chua, Clarita Wyn Kay |
title |
BERT named entity recognition on emergency response system |
title_short |
BERT named entity recognition on emergency response system |
title_full |
BERT named entity recognition on emergency response system |
title_fullStr |
BERT named entity recognition on emergency response system |
title_full_unstemmed |
BERT named entity recognition on emergency response system |
title_sort |
bert named entity recognition on emergency response system |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/156612 |
_version_ |
1731235749410373632 |