Information extraction of hazard events

Information Extraction (IE) is the process of extracting structured information from unstructured text. Since news articles on hazard events consist of useful information and are usually reported in real-time, identifying, and extracting such information would allow the government and emergency resp...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Joanna Jia Yi
Other Authors: Mao Kezhi
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/149108
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-149108
record_format dspace
spelling sg-ntu-dr.10356-1491082023-07-07T17:33:04Z Information extraction of hazard events Lim, Joanna Jia Yi Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering Information Extraction (IE) is the process of extracting structured information from unstructured text. Since news articles on hazard events consist of useful information and are usually reported in real-time, identifying, and extracting such information would allow the government and emergency response teams to better allocate resources and support to affected areas. In this project, several deep learning models were explored to identify occurrences of information like Deaths, Injury, Location, Date and Time in hazard events related news sentences. News articles of hazard events like attacks, earthquakes, typhoon, hurricanes, road accidents were first identified and filtered using keywords. Next, sentences of interest from these news articles were isolated and labelled to form the hazard events database. The labelled training data is then used to train deep neural network models. Two schemes were explored in this project. In the first scheme, one single model was trained to handle multi-class samples. While in the second scheme, multiple binary classifiers were trained. Discussion and comparison of results between the two schemes were carried out. Finally, information like location, date and time was extracted using spaCy’s named entity recognition. Bachelor of Engineering (Information Engineering and Media) 2021-05-26T13:24:54Z 2021-05-26T13:24:54Z 2021 Final Year Project (FYP) Lim, J. J. Y. (2021). Information extraction of hazard events. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149108 https://hdl.handle.net/10356/149108 en A1104-201 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Lim, Joanna Jia Yi
Information extraction of hazard events
description Information Extraction (IE) is the process of extracting structured information from unstructured text. Since news articles on hazard events consist of useful information and are usually reported in real-time, identifying, and extracting such information would allow the government and emergency response teams to better allocate resources and support to affected areas. In this project, several deep learning models were explored to identify occurrences of information like Deaths, Injury, Location, Date and Time in hazard events related news sentences. News articles of hazard events like attacks, earthquakes, typhoon, hurricanes, road accidents were first identified and filtered using keywords. Next, sentences of interest from these news articles were isolated and labelled to form the hazard events database. The labelled training data is then used to train deep neural network models. Two schemes were explored in this project. In the first scheme, one single model was trained to handle multi-class samples. While in the second scheme, multiple binary classifiers were trained. Discussion and comparison of results between the two schemes were carried out. Finally, information like location, date and time was extracted using spaCy’s named entity recognition.
author2 Mao Kezhi
author_facet Mao Kezhi
Lim, Joanna Jia Yi
format Final Year Project
author Lim, Joanna Jia Yi
author_sort Lim, Joanna Jia Yi
title Information extraction of hazard events
title_short Information extraction of hazard events
title_full Information extraction of hazard events
title_fullStr Information extraction of hazard events
title_full_unstemmed Information extraction of hazard events
title_sort information extraction of hazard events
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/149108
_version_ 1772826515038273536