Joint representation learning of cross-lingual words and entities via attentive distant supervision

Joint representation learning of words and entities benefits many NLP tasks, but has not been well explored in cross-lingual settings. In this paper, we propose a novel method for joint representation learning of cross-lingual words and entities. It captures mutually complementary knowledge, and ena...

Full description

Saved in:
Bibliographic Details
Main Authors: CAO, Yixin, HOU, Lei, LI, Juanzi, LIU, Zhiyuan, LI, Chengjiang, CHEN, Xu, DONG, Tiansi
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2018
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7465
https://ink.library.smu.edu.sg/context/sis_research/article/8468/viewcontent/D18_1021.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8468
record_format dspace
spelling sg-smu-ink.sis_research-84682022-10-20T07:09:02Z Joint representation learning of cross-lingual words and entities via attentive distant supervision CAO, Yixin HOU, Lei LI, Juanzi LIU, Zhiyuan LI, Chengjiang CHEN, Xu DONG, Tiansi Joint representation learning of words and entities benefits many NLP tasks, but has not been well explored in cross-lingual settings. In this paper, we propose a novel method for joint representation learning of cross-lingual words and entities. It captures mutually complementary knowledge, and enables cross-lingual inferences among knowledge bases and texts. Our method does not require parallel corpora, and automatically generates comparable data via distant supervision using multi-lingual knowledge bases. We utilize two types of regularizers to align cross-lingual words and entities, and design knowledge attention and crosslingual attention to further reduce noises. We conducted a series of experiments on three tasks: word translation, entity relatedness, and cross-lingual entity linking. The results, both qualitatively and quantitatively, demonstrate the significance of our method. 2018-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7465 info:doi/10.18653/v1/D18-1021 https://ink.library.smu.edu.sg/context/sis_research/article/8468/viewcontent/D18_1021.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Graphics and Human Computer Interfaces
spellingShingle Databases and Information Systems
Graphics and Human Computer Interfaces
CAO, Yixin
HOU, Lei
LI, Juanzi
LIU, Zhiyuan
LI, Chengjiang
CHEN, Xu
DONG, Tiansi
Joint representation learning of cross-lingual words and entities via attentive distant supervision
description Joint representation learning of words and entities benefits many NLP tasks, but has not been well explored in cross-lingual settings. In this paper, we propose a novel method for joint representation learning of cross-lingual words and entities. It captures mutually complementary knowledge, and enables cross-lingual inferences among knowledge bases and texts. Our method does not require parallel corpora, and automatically generates comparable data via distant supervision using multi-lingual knowledge bases. We utilize two types of regularizers to align cross-lingual words and entities, and design knowledge attention and crosslingual attention to further reduce noises. We conducted a series of experiments on three tasks: word translation, entity relatedness, and cross-lingual entity linking. The results, both qualitatively and quantitatively, demonstrate the significance of our method.
format text
author CAO, Yixin
HOU, Lei
LI, Juanzi
LIU, Zhiyuan
LI, Chengjiang
CHEN, Xu
DONG, Tiansi
author_facet CAO, Yixin
HOU, Lei
LI, Juanzi
LIU, Zhiyuan
LI, Chengjiang
CHEN, Xu
DONG, Tiansi
author_sort CAO, Yixin
title Joint representation learning of cross-lingual words and entities via attentive distant supervision
title_short Joint representation learning of cross-lingual words and entities via attentive distant supervision
title_full Joint representation learning of cross-lingual words and entities via attentive distant supervision
title_fullStr Joint representation learning of cross-lingual words and entities via attentive distant supervision
title_full_unstemmed Joint representation learning of cross-lingual words and entities via attentive distant supervision
title_sort joint representation learning of cross-lingual words and entities via attentive distant supervision
publisher Institutional Knowledge at Singapore Management University
publishDate 2018
url https://ink.library.smu.edu.sg/sis_research/7465
https://ink.library.smu.edu.sg/context/sis_research/article/8468/viewcontent/D18_1021.pdf
_version_ 1770576343103176704