Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus

In the world we live today, data is the new oil. Data can reveal hidden knowledge that gives us an advantage over our competitors. However, data that are present in an unstructured form such as text documents are difficult to be processed by conventional machine learning algorithms. Therefore, in th...

Full description

Saved in:
Bibliographic Details
Main Authors: Abdullah, M.H.A., Aziz, N., Abdulkadir, S.J., Akhir, E.A.P., Talpur, N.
Format: Article
Published: 2023
Online Access:http://scholars.utp.edu.my/id/eprint/34359/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85145000035&doi=10.1007%2f978-3-031-20429-6_12&partnerID=40&md5=343151f1085f32b2981696e042b54683
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Petronas
id oai:scholars.utp.edu.my:34359
record_format eprints
spelling oai:scholars.utp.edu.my:343592023-02-17T13:12:18Z http://scholars.utp.edu.my/id/eprint/34359/ Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus Abdullah, M.H.A. Aziz, N. Abdulkadir, S.J. Akhir, E.A.P. Talpur, N. In the world we live today, data is the new oil. Data can reveal hidden knowledge that gives us an advantage over our competitors. However, data that are present in an unstructured form such as text documents are difficult to be processed by conventional machine learning algorithms. Therefore, in this study, we attempted to perform information extraction from textual data using current and state-of-the-art models to understand their working mechanisms. To perform this study, we have chosen the GENIA corpus for evaluating the performance of each model. These selected event extraction models are evaluated based on specific measures which are precision, recall, and F-1 measure. The result of our study shows that the DeepEventMine model has scored the highest for trigger detection with a precision of 79.17, recall at 82.93, and F-1 measure at 81.01. Similarly, for event detection, the DeepEventMine model has scored highest among other models with a precision of 65.24, recall at 55.93, and F-1 measure at 60.23 based on the selected corpus. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG. 2023 Article NonPeerReviewed Abdullah, M.H.A. and Aziz, N. and Abdulkadir, S.J. and Akhir, E.A.P. and Talpur, N. (2023) Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus. Lecture Notes in Networks and Systems, 573 LN. pp. 118-127. https://www.scopus.com/inward/record.uri?eid=2-s2.0-85145000035&doi=10.1007%2f978-3-031-20429-6_12&partnerID=40&md5=343151f1085f32b2981696e042b54683 10.1007/978-3-031-20429-6₁₂ 10.1007/978-3-031-20429-6₁₂
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description In the world we live today, data is the new oil. Data can reveal hidden knowledge that gives us an advantage over our competitors. However, data that are present in an unstructured form such as text documents are difficult to be processed by conventional machine learning algorithms. Therefore, in this study, we attempted to perform information extraction from textual data using current and state-of-the-art models to understand their working mechanisms. To perform this study, we have chosen the GENIA corpus for evaluating the performance of each model. These selected event extraction models are evaluated based on specific measures which are precision, recall, and F-1 measure. The result of our study shows that the DeepEventMine model has scored the highest for trigger detection with a precision of 79.17, recall at 82.93, and F-1 measure at 81.01. Similarly, for event detection, the DeepEventMine model has scored highest among other models with a precision of 65.24, recall at 55.93, and F-1 measure at 60.23 based on the selected corpus. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
format Article
author Abdullah, M.H.A.
Aziz, N.
Abdulkadir, S.J.
Akhir, E.A.P.
Talpur, N.
spellingShingle Abdullah, M.H.A.
Aziz, N.
Abdulkadir, S.J.
Akhir, E.A.P.
Talpur, N.
Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus
author_facet Abdullah, M.H.A.
Aziz, N.
Abdulkadir, S.J.
Akhir, E.A.P.
Talpur, N.
author_sort Abdullah, M.H.A.
title Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus
title_short Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus
title_full Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus
title_fullStr Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus
title_full_unstemmed Event Detection and Information Extraction Strategies from Text: A Preliminary Study Using GENIA Corpus
title_sort event detection and information extraction strategies from text: a preliminary study using genia corpus
publishDate 2023
url http://scholars.utp.edu.my/id/eprint/34359/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85145000035&doi=10.1007%2f978-3-031-20429-6_12&partnerID=40&md5=343151f1085f32b2981696e042b54683
_version_ 1758580621620281344