Document selection for extracting entity and relationship instances of terrorist events

In this chapter, we study the problem of selecting documents so as to extract terrorist event information from a collection of documents. We represent an event by its entity and relation instances. Very often, these entity and relation instances have to be extracted from multiple documents. We there...

Full description

Saved in:
Bibliographic Details
Main Authors: SUN, Zhen, LIM, Ee Peng, CHANG, Kuiyu, Suryanto, Maggy Anastasia, Gunaratna, Rohan Kumar
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2008
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/848
https://ink.library.smu.edu.sg/context/sis_research/article/1847/viewcontent/Sun2008_Chapter_DocumentSelectionForExtracting.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1847
record_format dspace
spelling sg-smu-ink.sis_research-18472018-06-22T04:16:51Z Document selection for extracting entity and relationship instances of terrorist events SUN, Zhen LIM, Ee Peng CHANG, Kuiyu Suryanto, Maggy Anastasia Gunaratna, Rohan Kumar In this chapter, we study the problem of selecting documents so as to extract terrorist event information from a collection of documents. We represent an event by its entity and relation instances. Very often, these entity and relation instances have to be extracted from multiple documents. We therefore define an information extraction (IE) task as selecting documents and extracting from which entity and relation instances relevant to a user-specified event (aka domain specific event entity and relation extraction). We adopt domain specific IE patterns to extract potentially relevant entity and relation instances from documents, and develop a number of document ranking strategies using the extracted instances to address this extraction task. Each ranking strategy (aka pattern-based document ranking strategy) assigns a score to each document, which estimates the latter's contribution to the gain in event related instances. We conducted experiments on two document collection datasets constructed using two historical terrorism events. Experiments showed that our proposed patternbased document ranking strategies performed well on the domain specific event entity and relation extraction task for document collections of various sizes. 2008-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/848 info:doi/10.1007/978-0-387-71613-8_15 https://ink.library.smu.edu.sg/context/sis_research/article/1847/viewcontent/Sun2008_Chapter_DocumentSelectionForExtracting.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Defense and Security Studies Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Defense and Security Studies
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Defense and Security Studies
Numerical Analysis and Scientific Computing
SUN, Zhen
LIM, Ee Peng
CHANG, Kuiyu
Suryanto, Maggy Anastasia
Gunaratna, Rohan Kumar
Document selection for extracting entity and relationship instances of terrorist events
description In this chapter, we study the problem of selecting documents so as to extract terrorist event information from a collection of documents. We represent an event by its entity and relation instances. Very often, these entity and relation instances have to be extracted from multiple documents. We therefore define an information extraction (IE) task as selecting documents and extracting from which entity and relation instances relevant to a user-specified event (aka domain specific event entity and relation extraction). We adopt domain specific IE patterns to extract potentially relevant entity and relation instances from documents, and develop a number of document ranking strategies using the extracted instances to address this extraction task. Each ranking strategy (aka pattern-based document ranking strategy) assigns a score to each document, which estimates the latter's contribution to the gain in event related instances. We conducted experiments on two document collection datasets constructed using two historical terrorism events. Experiments showed that our proposed patternbased document ranking strategies performed well on the domain specific event entity and relation extraction task for document collections of various sizes.
format text
author SUN, Zhen
LIM, Ee Peng
CHANG, Kuiyu
Suryanto, Maggy Anastasia
Gunaratna, Rohan Kumar
author_facet SUN, Zhen
LIM, Ee Peng
CHANG, Kuiyu
Suryanto, Maggy Anastasia
Gunaratna, Rohan Kumar
author_sort SUN, Zhen
title Document selection for extracting entity and relationship instances of terrorist events
title_short Document selection for extracting entity and relationship instances of terrorist events
title_full Document selection for extracting entity and relationship instances of terrorist events
title_fullStr Document selection for extracting entity and relationship instances of terrorist events
title_full_unstemmed Document selection for extracting entity and relationship instances of terrorist events
title_sort document selection for extracting entity and relationship instances of terrorist events
publisher Institutional Knowledge at Singapore Management University
publishDate 2008
url https://ink.library.smu.edu.sg/sis_research/848
https://ink.library.smu.edu.sg/context/sis_research/article/1847/viewcontent/Sun2008_Chapter_DocumentSelectionForExtracting.pdf
_version_ 1770570737049927680