Named entity recognition for information extraction
Named Entity Recognition (NER) refers to the task of examining unstructured text to extract real-world objects, peoples, terms, and more, termed ‘named’ entities, to be classified into predefined categories (Person, Location, Organization, etc.). NER serves as a crucial first step for a wide range o...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/148635 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-148635 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1486352021-05-18T05:59:38Z Named entity recognition for information extraction Wei, Mark Zi Yun Hui Siu Cheung School of Computer Science and Engineering ASSCHUI@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Document and text processing Named Entity Recognition (NER) refers to the task of examining unstructured text to extract real-world objects, peoples, terms, and more, termed ‘named’ entities, to be classified into predefined categories (Person, Location, Organization, etc.). NER serves as a crucial first step for a wide range of Natural Language Processing (NLP) applications, including machine translation, question answering, and more. In this report, we explore two paradigms for NER systems, namely RNN-based systems and Transformer-based systems. We review related work that has been done leading up to the conceptualization of these systems, and explore the implementation, characteristics, and limitations of each approach. We then compare four different approaches from RNN-based NER systems and Transformer-based systems to determine their efficacy on two semantically and structurally different datasets. From our experiments, this project determined that Transformer-based models generally perform better than the RNN-based models implemented. These results are then discussed and reasons are provided to substantiate the results observed. Bachelor of Engineering (Computer Science) 2021-05-10T01:39:16Z 2021-05-10T01:39:16Z 2021 Final Year Project (FYP) Wei, M. Z. Y. (2021). Named entity recognition for information extraction. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148635 https://hdl.handle.net/10356/148635 en SCSE20-0340 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing Wei, Mark Zi Yun Named entity recognition for information extraction |
description |
Named Entity Recognition (NER) refers to the task of examining unstructured text to extract real-world objects, peoples, terms, and more, termed ‘named’ entities, to be classified into predefined categories (Person, Location, Organization, etc.). NER serves as a crucial first step for a wide range of Natural Language Processing (NLP) applications, including machine translation, question answering, and more. In this report, we explore two paradigms for NER systems, namely RNN-based systems and Transformer-based systems. We review related work that has been done leading up to the conceptualization of these systems, and explore the implementation, characteristics, and limitations of each approach. We then compare four different approaches from RNN-based NER systems and Transformer-based systems to determine their efficacy on two semantically and structurally different datasets. From our experiments, this project determined that Transformer-based models generally perform better than the RNN-based models implemented. These results are then discussed and reasons are provided to substantiate the results observed. |
author2 |
Hui Siu Cheung |
author_facet |
Hui Siu Cheung Wei, Mark Zi Yun |
format |
Final Year Project |
author |
Wei, Mark Zi Yun |
author_sort |
Wei, Mark Zi Yun |
title |
Named entity recognition for information extraction |
title_short |
Named entity recognition for information extraction |
title_full |
Named entity recognition for information extraction |
title_fullStr |
Named entity recognition for information extraction |
title_full_unstemmed |
Named entity recognition for information extraction |
title_sort |
named entity recognition for information extraction |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/148635 |
_version_ |
1701270486611132416 |