DocEE: A large-scale and fine-grained benchmark for document-level event extraction
Event extraction aims to identify an event and then extract the arguments participating in the event. Despite the great success in sentencelevel event extraction, events are more naturally presented in the form of documents, with event arguments scattered in multiple sentences. However, a major barr...
Saved in:
Main Authors: | , , , , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7471 https://ink.library.smu.edu.sg/context/sis_research/article/8474/viewcontent/2022.naacl_main.291.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8474 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-84742022-10-20T07:06:28Z DocEE: A large-scale and fine-grained benchmark for document-level event extraction TONG, Meihan XU, Bin WANG, Shuai HAN, Meihuan CAO, Yixin ZHU, Jiangqi CHEN, Siyu HOU, Lei LI, Juanzi Event extraction aims to identify an event and then extract the arguments participating in the event. Despite the great success in sentencelevel event extraction, events are more naturally presented in the form of documents, with event arguments scattered in multiple sentences. However, a major barrier to promote documentlevel event extraction has been the lack of large-scale and practical training and evaluation datasets. In this paper, we present DocEE, a new document-level event extraction dataset including 27,000+ events, 180,000+ arguments. We highlight three features: largescale manual annotations, fine-grained argument types and application-oriented settings. Experiments show that there is still a big gap between state-of-the-art models and human beings (41% Vs 85% in F1 score), indicating that DocEE is an open issue. DocEE is now available at https://github.com/ tongmeihan1995/DocEE.git. 2022-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7471 info:doi/10.18653/v1/2022.naacl-main.291 https://ink.library.smu.edu.sg/context/sis_research/article/8474/viewcontent/2022.naacl_main.291.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems Graphics and Human Computer Interfaces |
spellingShingle |
Databases and Information Systems Graphics and Human Computer Interfaces TONG, Meihan XU, Bin WANG, Shuai HAN, Meihuan CAO, Yixin ZHU, Jiangqi CHEN, Siyu HOU, Lei LI, Juanzi DocEE: A large-scale and fine-grained benchmark for document-level event extraction |
description |
Event extraction aims to identify an event and then extract the arguments participating in the event. Despite the great success in sentencelevel event extraction, events are more naturally presented in the form of documents, with event arguments scattered in multiple sentences. However, a major barrier to promote documentlevel event extraction has been the lack of large-scale and practical training and evaluation datasets. In this paper, we present DocEE, a new document-level event extraction dataset including 27,000+ events, 180,000+ arguments. We highlight three features: largescale manual annotations, fine-grained argument types and application-oriented settings. Experiments show that there is still a big gap between state-of-the-art models and human beings (41% Vs 85% in F1 score), indicating that DocEE is an open issue. DocEE is now available at https://github.com/ tongmeihan1995/DocEE.git. |
format |
text |
author |
TONG, Meihan XU, Bin WANG, Shuai HAN, Meihuan CAO, Yixin ZHU, Jiangqi CHEN, Siyu HOU, Lei LI, Juanzi |
author_facet |
TONG, Meihan XU, Bin WANG, Shuai HAN, Meihuan CAO, Yixin ZHU, Jiangqi CHEN, Siyu HOU, Lei LI, Juanzi |
author_sort |
TONG, Meihan |
title |
DocEE: A large-scale and fine-grained benchmark for document-level event extraction |
title_short |
DocEE: A large-scale and fine-grained benchmark for document-level event extraction |
title_full |
DocEE: A large-scale and fine-grained benchmark for document-level event extraction |
title_fullStr |
DocEE: A large-scale and fine-grained benchmark for document-level event extraction |
title_full_unstemmed |
DocEE: A large-scale and fine-grained benchmark for document-level event extraction |
title_sort |
docee: a large-scale and fine-grained benchmark for document-level event extraction |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/7471 https://ink.library.smu.edu.sg/context/sis_research/article/8474/viewcontent/2022.naacl_main.291.pdf |
_version_ |
1770576352321208320 |