Information extraction of hazard events
Information Extraction (IE) is the process of extracting structured information from unstructured text. Since news articles on hazard events consist of useful information and are usually reported in real-time, identifying, and extracting such information would allow the government and emergency resp...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/149108 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-149108 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1491082023-07-07T17:33:04Z Information extraction of hazard events Lim, Joanna Jia Yi Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering Information Extraction (IE) is the process of extracting structured information from unstructured text. Since news articles on hazard events consist of useful information and are usually reported in real-time, identifying, and extracting such information would allow the government and emergency response teams to better allocate resources and support to affected areas. In this project, several deep learning models were explored to identify occurrences of information like Deaths, Injury, Location, Date and Time in hazard events related news sentences. News articles of hazard events like attacks, earthquakes, typhoon, hurricanes, road accidents were first identified and filtered using keywords. Next, sentences of interest from these news articles were isolated and labelled to form the hazard events database. The labelled training data is then used to train deep neural network models. Two schemes were explored in this project. In the first scheme, one single model was trained to handle multi-class samples. While in the second scheme, multiple binary classifiers were trained. Discussion and comparison of results between the two schemes were carried out. Finally, information like location, date and time was extracted using spaCy’s named entity recognition. Bachelor of Engineering (Information Engineering and Media) 2021-05-26T13:24:54Z 2021-05-26T13:24:54Z 2021 Final Year Project (FYP) Lim, J. J. Y. (2021). Information extraction of hazard events. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149108 https://hdl.handle.net/10356/149108 en A1104-201 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Lim, Joanna Jia Yi Information extraction of hazard events |
description |
Information Extraction (IE) is the process of extracting structured information from unstructured text. Since news articles on hazard events consist of useful information and are usually reported in real-time, identifying, and extracting such information would allow the government and emergency response teams to better allocate resources and support to affected areas.
In this project, several deep learning models were explored to identify occurrences of information like Deaths, Injury, Location, Date and Time in hazard events related news sentences. News articles of hazard events like attacks, earthquakes, typhoon, hurricanes, road accidents were first identified and filtered using keywords. Next, sentences of interest from these news articles were isolated and labelled to form the hazard events database. The labelled training data is then used to train deep neural network models. Two schemes were explored in this project.
In the first scheme, one single model was trained to handle multi-class samples. While in the second scheme, multiple binary classifiers were trained. Discussion and comparison of results between the two schemes were carried out. Finally, information like location, date and time was extracted using spaCy’s named entity recognition. |
author2 |
Mao Kezhi |
author_facet |
Mao Kezhi Lim, Joanna Jia Yi |
format |
Final Year Project |
author |
Lim, Joanna Jia Yi |
author_sort |
Lim, Joanna Jia Yi |
title |
Information extraction of hazard events |
title_short |
Information extraction of hazard events |
title_full |
Information extraction of hazard events |
title_fullStr |
Information extraction of hazard events |
title_full_unstemmed |
Information extraction of hazard events |
title_sort |
information extraction of hazard events |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/149108 |
_version_ |
1772826515038273536 |