Using Support Vector Machines for Terrorism Information Extraction

Information extraction (IE) is of great importance in many applications including web intelligence, search engines, text understanding, etc. To extract information from text documents, most IE systems rely on a set of extraction patterns. Each extraction pattern is defined based on the syntactic and...

Full description

Saved in:
Bibliographic Details
Main Authors: SUN, Aixin, NAING, Myo-Myo, LIM, Ee Peng, LAM, Wai
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2003
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/960
https://ink.library.smu.edu.sg/context/sis_research/article/1959/viewcontent/42d5ecdc9a7a5970b512ea0da4f183a88282.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1959
record_format dspace
spelling sg-smu-ink.sis_research-19592018-06-25T06:55:47Z Using Support Vector Machines for Terrorism Information Extraction SUN, Aixin NAING, Myo-Myo LIM, Ee Peng LAM, Wai Information extraction (IE) is of great importance in many applications including web intelligence, search engines, text understanding, etc. To extract information from text documents, most IE systems rely on a set of extraction patterns. Each extraction pattern is defined based on the syntactic and/or semantic constraints on the positions of desired entities within natural language sentences. The IE systems also provide a set of pattern templates that determines the kind of syntactic and semantic constraints to be considered. In this paper, we argue that such pattern templates restricts the kind of extraction patterns that can be learned by IE systems. To allow a wider range of context information to be considered in learning extraction patterns, we first propose to model the content and context information of a candidate entity to be extracted as a set of features. A classification model is then built for each category of entities using Support Vector Machines (SVM). We have conducted IE experiments to evaluate our proposed method on a text collection in the terrorism domain. From the preliminary experimental results, we conclude that our proposed method can deliver reasonable accuracies. 2003-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/960 info:doi/10.1007/3-540-44853-5_1 https://ink.library.smu.edu.sg/context/sis_research/article/1959/viewcontent/42d5ecdc9a7a5970b512ea0da4f183a88282.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
SUN, Aixin
NAING, Myo-Myo
LIM, Ee Peng
LAM, Wai
Using Support Vector Machines for Terrorism Information Extraction
description Information extraction (IE) is of great importance in many applications including web intelligence, search engines, text understanding, etc. To extract information from text documents, most IE systems rely on a set of extraction patterns. Each extraction pattern is defined based on the syntactic and/or semantic constraints on the positions of desired entities within natural language sentences. The IE systems also provide a set of pattern templates that determines the kind of syntactic and semantic constraints to be considered. In this paper, we argue that such pattern templates restricts the kind of extraction patterns that can be learned by IE systems. To allow a wider range of context information to be considered in learning extraction patterns, we first propose to model the content and context information of a candidate entity to be extracted as a set of features. A classification model is then built for each category of entities using Support Vector Machines (SVM). We have conducted IE experiments to evaluate our proposed method on a text collection in the terrorism domain. From the preliminary experimental results, we conclude that our proposed method can deliver reasonable accuracies.
format text
author SUN, Aixin
NAING, Myo-Myo
LIM, Ee Peng
LAM, Wai
author_facet SUN, Aixin
NAING, Myo-Myo
LIM, Ee Peng
LAM, Wai
author_sort SUN, Aixin
title Using Support Vector Machines for Terrorism Information Extraction
title_short Using Support Vector Machines for Terrorism Information Extraction
title_full Using Support Vector Machines for Terrorism Information Extraction
title_fullStr Using Support Vector Machines for Terrorism Information Extraction
title_full_unstemmed Using Support Vector Machines for Terrorism Information Extraction
title_sort using support vector machines for terrorism information extraction
publisher Institutional Knowledge at Singapore Management University
publishDate 2003
url https://ink.library.smu.edu.sg/sis_research/960
https://ink.library.smu.edu.sg/context/sis_research/article/1959/viewcontent/42d5ecdc9a7a5970b512ea0da4f183a88282.pdf
_version_ 1770570794496163840