DocEE: A large-scale and fine-grained benchmark for document-level event extraction

Event extraction aims to identify an event and then extract the arguments participating in the event. Despite the great success in sentencelevel event extraction, events are more naturally presented in the form of documents, with event arguments scattered in multiple sentences. However, a major barr...

Full description

Saved in:
Bibliographic Details
Main Authors: TONG, Meihan, XU, Bin, WANG, Shuai, HAN, Meihuan, CAO, Yixin, ZHU, Jiangqi, CHEN, Siyu, HOU, Lei, LI, Juanzi
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7471
https://ink.library.smu.edu.sg/context/sis_research/article/8474/viewcontent/2022.naacl_main.291.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8474
record_format dspace
spelling sg-smu-ink.sis_research-84742022-10-20T07:06:28Z DocEE: A large-scale and fine-grained benchmark for document-level event extraction TONG, Meihan XU, Bin WANG, Shuai HAN, Meihuan CAO, Yixin ZHU, Jiangqi CHEN, Siyu HOU, Lei LI, Juanzi Event extraction aims to identify an event and then extract the arguments participating in the event. Despite the great success in sentencelevel event extraction, events are more naturally presented in the form of documents, with event arguments scattered in multiple sentences. However, a major barrier to promote documentlevel event extraction has been the lack of large-scale and practical training and evaluation datasets. In this paper, we present DocEE, a new document-level event extraction dataset including 27,000+ events, 180,000+ arguments. We highlight three features: largescale manual annotations, fine-grained argument types and application-oriented settings. Experiments show that there is still a big gap between state-of-the-art models and human beings (41% Vs 85% in F1 score), indicating that DocEE is an open issue. DocEE is now available at https://github.com/ tongmeihan1995/DocEE.git. 2022-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7471 info:doi/10.18653/v1/2022.naacl-main.291 https://ink.library.smu.edu.sg/context/sis_research/article/8474/viewcontent/2022.naacl_main.291.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Graphics and Human Computer Interfaces
spellingShingle Databases and Information Systems
Graphics and Human Computer Interfaces
TONG, Meihan
XU, Bin
WANG, Shuai
HAN, Meihuan
CAO, Yixin
ZHU, Jiangqi
CHEN, Siyu
HOU, Lei
LI, Juanzi
DocEE: A large-scale and fine-grained benchmark for document-level event extraction
description Event extraction aims to identify an event and then extract the arguments participating in the event. Despite the great success in sentencelevel event extraction, events are more naturally presented in the form of documents, with event arguments scattered in multiple sentences. However, a major barrier to promote documentlevel event extraction has been the lack of large-scale and practical training and evaluation datasets. In this paper, we present DocEE, a new document-level event extraction dataset including 27,000+ events, 180,000+ arguments. We highlight three features: largescale manual annotations, fine-grained argument types and application-oriented settings. Experiments show that there is still a big gap between state-of-the-art models and human beings (41% Vs 85% in F1 score), indicating that DocEE is an open issue. DocEE is now available at https://github.com/ tongmeihan1995/DocEE.git.
format text
author TONG, Meihan
XU, Bin
WANG, Shuai
HAN, Meihuan
CAO, Yixin
ZHU, Jiangqi
CHEN, Siyu
HOU, Lei
LI, Juanzi
author_facet TONG, Meihan
XU, Bin
WANG, Shuai
HAN, Meihuan
CAO, Yixin
ZHU, Jiangqi
CHEN, Siyu
HOU, Lei
LI, Juanzi
author_sort TONG, Meihan
title DocEE: A large-scale and fine-grained benchmark for document-level event extraction
title_short DocEE: A large-scale and fine-grained benchmark for document-level event extraction
title_full DocEE: A large-scale and fine-grained benchmark for document-level event extraction
title_fullStr DocEE: A large-scale and fine-grained benchmark for document-level event extraction
title_full_unstemmed DocEE: A large-scale and fine-grained benchmark for document-level event extraction
title_sort docee: a large-scale and fine-grained benchmark for document-level event extraction
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7471
https://ink.library.smu.edu.sg/context/sis_research/article/8474/viewcontent/2022.naacl_main.291.pdf
_version_ 1770576352321208320