TSDW : two-stage word sense disambiguation using Wikipedia

The semantic knowledge of Wikipedia has proved to be useful for many tasks, for example, named entity disambiguation. Among these applications, the task of identifying the word sense based on Wikipedia is a crucial component because the output of this component is often used in subsequent tasks. In...

全面介紹

Saved in:

書目詳細資料
Main Authors:	Li, Chenliang, Sun, Aixin, Datta, Anwitaman
其他作者:	School of Computer Engineering
格式:	Article
語言:	English
出版:	2013
主題:	DRNTU::Engineering::Computer science and engineering
在線閱讀:	https://hdl.handle.net/10356/107062 http://hdl.handle.net/10220/17700
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Nanyang Technological University
語言:	English

id	sg-ntu-dr.10356-107062
record_format	dspace
spelling	sg-ntu-dr.10356-1070622020-05-28T07:41:34Z TSDW : two-stage word sense disambiguation using Wikipedia Li, Chenliang Sun, Aixin Datta, Anwitaman School of Computer Engineering DRNTU::Engineering::Computer science and engineering The semantic knowledge of Wikipedia has proved to be useful for many tasks, for example, named entity disambiguation. Among these applications, the task of identifying the word sense based on Wikipedia is a crucial component because the output of this component is often used in subsequent tasks. In this article, we present a two-stage framework (called TSDW) for word sense disambiguation using knowledge latent in Wikipedia. The disambiguation of a given phrase is applied through a two-stage disambiguation process: (a) The first-stage disambiguation explores the contextual semantic information, where the noisy information is pruned for better effectiveness and efficiency; and (b) the second-stage disambiguation explores the disambiguated phrases of high confidence from the first stage to achieve better redisambiguation decisions for the phrases that are difficult to disambiguate in the first stage. Moreover, existing studies have addressed the disambiguation problem for English text only. Considering the popular usage of Wikipedia in different languages, we study the performance of TSDW and the existing state-of-the-art approaches over both English and Traditional Chinese articles. The experimental results show that TSDW generalizes well to different semantic relatedness measures and text in different languages. More important, TSDW significantly outperforms the state-of-the-art approaches with both better effectiveness and efficiency. 2013-11-15T06:55:31Z 2019-12-06T22:24:02Z 2013-11-15T06:55:31Z 2019-12-06T22:24:02Z 2013 2013 Journal Article Li, C., Sun, A., & Datta, A. (2013). TSDW: Two-stage word sense disambiguation using Wikipedia. Journal of the American Society for Information Science and Technology, 64(6), 1203-1223. https://hdl.handle.net/10356/107062 http://hdl.handle.net/10220/17700 10.1002/asi.22829 en Journal of the American society for information science and technology
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering
spellingShingle	DRNTU::Engineering::Computer science and engineering Li, Chenliang Sun, Aixin Datta, Anwitaman TSDW : two-stage word sense disambiguation using Wikipedia
description	The semantic knowledge of Wikipedia has proved to be useful for many tasks, for example, named entity disambiguation. Among these applications, the task of identifying the word sense based on Wikipedia is a crucial component because the output of this component is often used in subsequent tasks. In this article, we present a two-stage framework (called TSDW) for word sense disambiguation using knowledge latent in Wikipedia. The disambiguation of a given phrase is applied through a two-stage disambiguation process: (a) The first-stage disambiguation explores the contextual semantic information, where the noisy information is pruned for better effectiveness and efficiency; and (b) the second-stage disambiguation explores the disambiguated phrases of high confidence from the first stage to achieve better redisambiguation decisions for the phrases that are difficult to disambiguate in the first stage. Moreover, existing studies have addressed the disambiguation problem for English text only. Considering the popular usage of Wikipedia in different languages, we study the performance of TSDW and the existing state-of-the-art approaches over both English and Traditional Chinese articles. The experimental results show that TSDW generalizes well to different semantic relatedness measures and text in different languages. More important, TSDW significantly outperforms the state-of-the-art approaches with both better effectiveness and efficiency.
author2	School of Computer Engineering
author_facet	School of Computer Engineering Li, Chenliang Sun, Aixin Datta, Anwitaman
format	Article
author	Li, Chenliang Sun, Aixin Datta, Anwitaman
author_sort	Li, Chenliang
title	TSDW : two-stage word sense disambiguation using Wikipedia
title_short	TSDW : two-stage word sense disambiguation using Wikipedia
title_full	TSDW : two-stage word sense disambiguation using Wikipedia
title_fullStr	TSDW : two-stage word sense disambiguation using Wikipedia
title_full_unstemmed	TSDW : two-stage word sense disambiguation using Wikipedia
title_sort	tsdw : two-stage word sense disambiguation using wikipedia
publishDate	2013
url	https://hdl.handle.net/10356/107062 http://hdl.handle.net/10220/17700
_version_	1681059078385696768

TSDW : two-stage word sense disambiguation using Wikipedia

相似書籍