Automatic identification of cross-document structural relationships

Analysis on inter-document relationship is one of the important studies in multi document analysis. In this paper, we will focus on some special properties that multi document articles hold, specifically news articles. Information across news articles reporting on the same story are often related. C...

Full description

Saved in:
Bibliographic Details
Main Authors: Jaya Kumar, Yogan, Salim, Naomie, Hamza, Ahmed, Abuobieda, Albarraa
Format: Book Section
Published: IEEE 2012
Subjects:
Online Access:http://eprints.utm.my/id/eprint/34547/
http://dx.doi.org/10.1109/InfRKM.2012.6204977
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
Description
Summary:Analysis on inter-document relationship is one of the important studies in multi document analysis. In this paper, we will focus on some special properties that multi document articles hold, specifically news articles. Information across news articles reporting on the same story are often related. Cross-document Structure Theory (CST) gives the relationship between pairs of sentences from different documents. For example, two sentences might have relationships such as identical, overlapping or contradicting. Our aim here is to automatically identify some of these CST relationships. We applied the well known machine learning technique, SVMs for this purpose and obtained some comparable results.