Named entity recognition for information extraction

Named Entity Recognition (NER) refers to the task of examining unstructured text to extract real-world objects, peoples, terms, and more, termed ‘named’ entities, to be classified into predefined categories (Person, Location, Organization, etc.). NER serves as a crucial first step for a wide range o...

Full description

Saved in:
Bibliographic Details
Main Author: Wei, Mark Zi Yun
Other Authors: Hui Siu Cheung
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/148635
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-148635
record_format dspace
spelling sg-ntu-dr.10356-1486352021-05-18T05:59:38Z Named entity recognition for information extraction Wei, Mark Zi Yun Hui Siu Cheung School of Computer Science and Engineering ASSCHUI@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Document and text processing Named Entity Recognition (NER) refers to the task of examining unstructured text to extract real-world objects, peoples, terms, and more, termed ‘named’ entities, to be classified into predefined categories (Person, Location, Organization, etc.). NER serves as a crucial first step for a wide range of Natural Language Processing (NLP) applications, including machine translation, question answering, and more. In this report, we explore two paradigms for NER systems, namely RNN-based systems and Transformer-based systems. We review related work that has been done leading up to the conceptualization of these systems, and explore the implementation, characteristics, and limitations of each approach. We then compare four different approaches from RNN-based NER systems and Transformer-based systems to determine their efficacy on two semantically and structurally different datasets. From our experiments, this project determined that Transformer-based models generally perform better than the RNN-based models implemented. These results are then discussed and reasons are provided to substantiate the results observed. Bachelor of Engineering (Computer Science) 2021-05-10T01:39:16Z 2021-05-10T01:39:16Z 2021 Final Year Project (FYP) Wei, M. Z. Y. (2021). Named entity recognition for information extraction. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148635 https://hdl.handle.net/10356/148635 en SCSE20-0340 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Document and text processing
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Wei, Mark Zi Yun
Named entity recognition for information extraction
description Named Entity Recognition (NER) refers to the task of examining unstructured text to extract real-world objects, peoples, terms, and more, termed ‘named’ entities, to be classified into predefined categories (Person, Location, Organization, etc.). NER serves as a crucial first step for a wide range of Natural Language Processing (NLP) applications, including machine translation, question answering, and more. In this report, we explore two paradigms for NER systems, namely RNN-based systems and Transformer-based systems. We review related work that has been done leading up to the conceptualization of these systems, and explore the implementation, characteristics, and limitations of each approach. We then compare four different approaches from RNN-based NER systems and Transformer-based systems to determine their efficacy on two semantically and structurally different datasets. From our experiments, this project determined that Transformer-based models generally perform better than the RNN-based models implemented. These results are then discussed and reasons are provided to substantiate the results observed.
author2 Hui Siu Cheung
author_facet Hui Siu Cheung
Wei, Mark Zi Yun
format Final Year Project
author Wei, Mark Zi Yun
author_sort Wei, Mark Zi Yun
title Named entity recognition for information extraction
title_short Named entity recognition for information extraction
title_full Named entity recognition for information extraction
title_fullStr Named entity recognition for information extraction
title_full_unstemmed Named entity recognition for information extraction
title_sort named entity recognition for information extraction
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/148635
_version_ 1701270486611132416