Mapping entity sets in news archives across time

We propose a novel way of utilizing and accessing information stored in news archives as well as a new style of investigating the history. Our idea is to automatically generate similar entity pairs given two sets of entities, one from the past and one representing the present. This allows performing...

Full description

Saved in:
Bibliographic Details
Main Authors: Duan, Yijun, Jatowt, Adam, Bhowmick, Sourav S., Yoshikawa, Masatoshi
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/144030
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-144030
record_format dspace
spelling sg-ntu-dr.10356-1440302020-10-09T01:40:12Z Mapping entity sets in news archives across time Duan, Yijun Jatowt, Adam Bhowmick, Sourav S. Yoshikawa, Masatoshi School of Computer Science and Engineering Engineering::Computer science and engineering Comparable Entity Mining Typicality Analysis We propose a novel way of utilizing and accessing information stored in news archives as well as a new style of investigating the history. Our idea is to automatically generate similar entity pairs given two sets of entities, one from the past and one representing the present. This allows performing entity-oriented mapping between different times. We introduce an effective method to solve the aforementioned task based on a concise integer linear programming framework. In particular, our model first conducts typicality analysis to estimate entity representativeness. It next constructs orthogonal transformation between the two entity collections. The result is a set of typical across-time comparables. We demonstrate the effectiveness of our approach on the New York Times dataset through both qualitative and quantitative tests. Published version This research has been supported by JSPS KAKENHI grants (#17H01828, #18K19841). 2020-10-09T01:40:12Z 2020-10-09T01:40:12Z 2019 Journal Article Duan, Y., Jatowt, A., Bhowmick, S. S., & Yoshikawa, M. (2019). Mapping entity sets in news archives across time. Data Science and Engineering, 4(3), 208-222. doi:10.1007/s41019-019-00102-3 2364-1185 https://hdl.handle.net/10356/144030 10.1007/s41019-019-00102-3 3 4 208 222 en Data Science and Engineering © 2019 The Author(s). This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Comparable Entity Mining
Typicality Analysis
spellingShingle Engineering::Computer science and engineering
Comparable Entity Mining
Typicality Analysis
Duan, Yijun
Jatowt, Adam
Bhowmick, Sourav S.
Yoshikawa, Masatoshi
Mapping entity sets in news archives across time
description We propose a novel way of utilizing and accessing information stored in news archives as well as a new style of investigating the history. Our idea is to automatically generate similar entity pairs given two sets of entities, one from the past and one representing the present. This allows performing entity-oriented mapping between different times. We introduce an effective method to solve the aforementioned task based on a concise integer linear programming framework. In particular, our model first conducts typicality analysis to estimate entity representativeness. It next constructs orthogonal transformation between the two entity collections. The result is a set of typical across-time comparables. We demonstrate the effectiveness of our approach on the New York Times dataset through both qualitative and quantitative tests.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Duan, Yijun
Jatowt, Adam
Bhowmick, Sourav S.
Yoshikawa, Masatoshi
format Article
author Duan, Yijun
Jatowt, Adam
Bhowmick, Sourav S.
Yoshikawa, Masatoshi
author_sort Duan, Yijun
title Mapping entity sets in news archives across time
title_short Mapping entity sets in news archives across time
title_full Mapping entity sets in news archives across time
title_fullStr Mapping entity sets in news archives across time
title_full_unstemmed Mapping entity sets in news archives across time
title_sort mapping entity sets in news archives across time
publishDate 2020
url https://hdl.handle.net/10356/144030
_version_ 1681056668043968512