Bridge text and knowledge by learning multi-prototype entity mention embedding

Integrating text and knowledge into a unified semantic space has attracted significant research interests recently. However, the ambiguity in the common space remains a challenge, namely that the same mention phrase usually refers to various entities. In this paper, to deal with the ambiguity of ent...

Full description

Saved in:
Bibliographic Details
Main Authors: CAO, Yixin, HUANG, Lifu, JI, Heng, CHEN, Xu, LI, Juanzi
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7468
https://ink.library.smu.edu.sg/context/sis_research/article/8471/viewcontent/P17_1149.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8471
record_format dspace
spelling sg-smu-ink.sis_research-84712022-10-20T07:07:32Z Bridge text and knowledge by learning multi-prototype entity mention embedding CAO, Yixin HUANG, Lifu JI, Heng CHEN, Xu LI, Juanzi Integrating text and knowledge into a unified semantic space has attracted significant research interests recently. However, the ambiguity in the common space remains a challenge, namely that the same mention phrase usually refers to various entities. In this paper, to deal with the ambiguity of entity mentions, we propose a novel Multi-Prototype Mention Embedding model, which learns multiple sense embeddings for each mention by jointly modeling words from textual contexts and entities derived from a knowledge base. In addition, we further design an efficient language model based approach to disambiguate each mention to a specific sense. In experiments, both qualitative and quantitative analysis demonstrate the high quality of the word, entity and multi-prototype mention embeddings. Using entity linking as a study case, we apply our disambiguation method as well as the multi-prototype mention embeddings on the benchmark dataset, and achieve state-of-the-art performance. 2017-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7468 info:doi/10.18653/v1/P17-1149 https://ink.library.smu.edu.sg/context/sis_research/article/8471/viewcontent/P17_1149.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Graphics and Human Computer Interfaces
spellingShingle Databases and Information Systems
Graphics and Human Computer Interfaces
CAO, Yixin
HUANG, Lifu
JI, Heng
CHEN, Xu
LI, Juanzi
Bridge text and knowledge by learning multi-prototype entity mention embedding
description Integrating text and knowledge into a unified semantic space has attracted significant research interests recently. However, the ambiguity in the common space remains a challenge, namely that the same mention phrase usually refers to various entities. In this paper, to deal with the ambiguity of entity mentions, we propose a novel Multi-Prototype Mention Embedding model, which learns multiple sense embeddings for each mention by jointly modeling words from textual contexts and entities derived from a knowledge base. In addition, we further design an efficient language model based approach to disambiguate each mention to a specific sense. In experiments, both qualitative and quantitative analysis demonstrate the high quality of the word, entity and multi-prototype mention embeddings. Using entity linking as a study case, we apply our disambiguation method as well as the multi-prototype mention embeddings on the benchmark dataset, and achieve state-of-the-art performance.
format text
author CAO, Yixin
HUANG, Lifu
JI, Heng
CHEN, Xu
LI, Juanzi
author_facet CAO, Yixin
HUANG, Lifu
JI, Heng
CHEN, Xu
LI, Juanzi
author_sort CAO, Yixin
title Bridge text and knowledge by learning multi-prototype entity mention embedding
title_short Bridge text and knowledge by learning multi-prototype entity mention embedding
title_full Bridge text and knowledge by learning multi-prototype entity mention embedding
title_fullStr Bridge text and knowledge by learning multi-prototype entity mention embedding
title_full_unstemmed Bridge text and knowledge by learning multi-prototype entity mention embedding
title_sort bridge text and knowledge by learning multi-prototype entity mention embedding
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/7468
https://ink.library.smu.edu.sg/context/sis_research/article/8471/viewcontent/P17_1149.pdf
_version_ 1770576351840960512