Named entity recognition for information extraction
Named Entity Recognition (NER) refers to the task of examining unstructured text to extract real-world objects, peoples, terms, and more, termed ‘named’ entities, to be classified into predefined categories (Person, Location, Organization, etc.). NER serves as a crucial first step for a wide range o...
Saved in:
主要作者: | |
---|---|
其他作者: | |
格式: | Final Year Project |
語言: | English |
出版: |
Nanyang Technological University
2021
|
主題: | |
在線閱讀: | https://hdl.handle.net/10356/148635 |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|
id |
sg-ntu-dr.10356-148635 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1486352021-05-18T05:59:38Z Named entity recognition for information extraction Wei, Mark Zi Yun Hui Siu Cheung School of Computer Science and Engineering ASSCHUI@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Document and text processing Named Entity Recognition (NER) refers to the task of examining unstructured text to extract real-world objects, peoples, terms, and more, termed ‘named’ entities, to be classified into predefined categories (Person, Location, Organization, etc.). NER serves as a crucial first step for a wide range of Natural Language Processing (NLP) applications, including machine translation, question answering, and more. In this report, we explore two paradigms for NER systems, namely RNN-based systems and Transformer-based systems. We review related work that has been done leading up to the conceptualization of these systems, and explore the implementation, characteristics, and limitations of each approach. We then compare four different approaches from RNN-based NER systems and Transformer-based systems to determine their efficacy on two semantically and structurally different datasets. From our experiments, this project determined that Transformer-based models generally perform better than the RNN-based models implemented. These results are then discussed and reasons are provided to substantiate the results observed. Bachelor of Engineering (Computer Science) 2021-05-10T01:39:16Z 2021-05-10T01:39:16Z 2021 Final Year Project (FYP) Wei, M. Z. Y. (2021). Named entity recognition for information extraction. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148635 https://hdl.handle.net/10356/148635 en SCSE20-0340 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing Wei, Mark Zi Yun Named entity recognition for information extraction |
description |
Named Entity Recognition (NER) refers to the task of examining unstructured text to extract real-world objects, peoples, terms, and more, termed ‘named’ entities, to be classified into predefined categories (Person, Location, Organization, etc.). NER serves as a crucial first step for a wide range of Natural Language Processing (NLP) applications, including machine translation, question answering, and more. In this report, we explore two paradigms for NER systems, namely RNN-based systems and Transformer-based systems. We review related work that has been done leading up to the conceptualization of these systems, and explore the implementation, characteristics, and limitations of each approach. We then compare four different approaches from RNN-based NER systems and Transformer-based systems to determine their efficacy on two semantically and structurally different datasets. From our experiments, this project determined that Transformer-based models generally perform better than the RNN-based models implemented. These results are then discussed and reasons are provided to substantiate the results observed. |
author2 |
Hui Siu Cheung |
author_facet |
Hui Siu Cheung Wei, Mark Zi Yun |
format |
Final Year Project |
author |
Wei, Mark Zi Yun |
author_sort |
Wei, Mark Zi Yun |
title |
Named entity recognition for information extraction |
title_short |
Named entity recognition for information extraction |
title_full |
Named entity recognition for information extraction |
title_fullStr |
Named entity recognition for information extraction |
title_full_unstemmed |
Named entity recognition for information extraction |
title_sort |
named entity recognition for information extraction |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/148635 |
_version_ |
1701270486611132416 |