Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus
In the world we live today, data is the new oil. Data can reveal hidden knowledge that gives us an advantage over our competitors. However, data that are present in an unstructured form such as text documents are difficult to be processed by conventional machine learning algorithms. Therefore, in th...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Published: |
2023
|
Online Access: | http://scholars.utp.edu.my/id/eprint/34359/ https://www.scopus.com/inward/record.uri?eid=2-s2.0-85145000035&doi=10.1007%2f978-3-031-20429-6_12&partnerID=40&md5=343151f1085f32b2981696e042b54683 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Petronas |
id |
oai:scholars.utp.edu.my:34359 |
---|---|
record_format |
eprints |
spelling |
oai:scholars.utp.edu.my:343592023-02-17T13:12:18Z http://scholars.utp.edu.my/id/eprint/34359/ Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus Abdullah, M.H.A. Aziz, N. Abdulkadir, S.J. Akhir, E.A.P. Talpur, N. In the world we live today, data is the new oil. Data can reveal hidden knowledge that gives us an advantage over our competitors. However, data that are present in an unstructured form such as text documents are difficult to be processed by conventional machine learning algorithms. Therefore, in this study, we attempted to perform information extraction from textual data using current and state-of-the-art models to understand their working mechanisms. To perform this study, we have chosen the GENIA corpus for evaluating the performance of each model. These selected event extraction models are evaluated based on specific measures which are precision, recall, and F-1 measure. The result of our study shows that the DeepEventMine model has scored the highest for trigger detection with a precision of 79.17, recall at 82.93, and F-1 measure at 81.01. Similarly, for event detection, the DeepEventMine model has scored highest among other models with a precision of 65.24, recall at 55.93, and F-1 measure at 60.23 based on the selected corpus. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG. 2023 Article NonPeerReviewed Abdullah, M.H.A. and Aziz, N. and Abdulkadir, S.J. and Akhir, E.A.P. and Talpur, N. (2023) Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus. Lecture Notes in Networks and Systems, 573 LN. pp. 118-127. https://www.scopus.com/inward/record.uri?eid=2-s2.0-85145000035&doi=10.1007%2f978-3-031-20429-6_12&partnerID=40&md5=343151f1085f32b2981696e042b54683 10.1007/978-3-031-20429-6₁₂ 10.1007/978-3-031-20429-6₁₂ |
institution |
Universiti Teknologi Petronas |
building |
UTP Resource Centre |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Petronas |
content_source |
UTP Institutional Repository |
url_provider |
http://eprints.utp.edu.my/ |
description |
In the world we live today, data is the new oil. Data can reveal hidden knowledge that gives us an advantage over our competitors. However, data that are present in an unstructured form such as text documents are difficult to be processed by conventional machine learning algorithms. Therefore, in this study, we attempted to perform information extraction from textual data using current and state-of-the-art models to understand their working mechanisms. To perform this study, we have chosen the GENIA corpus for evaluating the performance of each model. These selected event extraction models are evaluated based on specific measures which are precision, recall, and F-1 measure. The result of our study shows that the DeepEventMine model has scored the highest for trigger detection with a precision of 79.17, recall at 82.93, and F-1 measure at 81.01. Similarly, for event detection, the DeepEventMine model has scored highest among other models with a precision of 65.24, recall at 55.93, and F-1 measure at 60.23 based on the selected corpus. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG. |
format |
Article |
author |
Abdullah, M.H.A. Aziz, N. Abdulkadir, S.J. Akhir, E.A.P. Talpur, N. |
spellingShingle |
Abdullah, M.H.A. Aziz, N. Abdulkadir, S.J. Akhir, E.A.P. Talpur, N. Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus |
author_facet |
Abdullah, M.H.A. Aziz, N. Abdulkadir, S.J. Akhir, E.A.P. Talpur, N. |
author_sort |
Abdullah, M.H.A. |
title |
Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus |
title_short |
Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus |
title_full |
Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus |
title_fullStr |
Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus |
title_full_unstemmed |
Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus |
title_sort |
event detection and information extraction strategies from text: a preliminary study using genia corpus |
publishDate |
2023 |
url |
http://scholars.utp.edu.my/id/eprint/34359/ https://www.scopus.com/inward/record.uri?eid=2-s2.0-85145000035&doi=10.1007%2f978-3-031-20429-6_12&partnerID=40&md5=343151f1085f32b2981696e042b54683 |
_version_ |
1758580621620281344 |