The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese

Semantic information is important for precise word sense disambiguation system and the kind of semantic analysis used in sophisticated natural language processing such as machine translation, question answering, etc. There are at least two kinds of semantic information: lexical semantics for words a...

Full description

Saved in:
Bibliographic Details
Main Authors: Tanaka, Takaaki, Bond, Francis, Fujita, Sanae
Other Authors: School of Humanities and Social Sciences
Format: Conference or Workshop Item
Language:English
Published: 2011
Subjects:
Online Access:https://hdl.handle.net/10356/79572
http://hdl.handle.net/10220/7278
http://dl.acm.org/citation.cfm?id=1641999
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-79572
record_format dspace
spelling sg-ntu-dr.10356-795722019-12-06T13:28:30Z The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese Tanaka, Takaaki Bond, Francis Fujita, Sanae School of Humanities and Social Sciences Workshop on Frontiers in Linguistically Annotated Corpora (2006 : Sydney, Australia) DRNTU::Humanities::Linguistics Semantic information is important for precise word sense disambiguation system and the kind of semantic analysis used in sophisticated natural language processing such as machine translation, question answering, etc. There are at least two kinds of semantic information: lexical semantics for words and phrases and structural semantics for phrases and sentences. We have built a Japanese corpus of over three million words with both lexical and structural semantic information. In this paper, we focus on our method of annotating the lexical semantics, that is building a word sense tagged corpus and its properties. Published version 2011-10-17T02:49:41Z 2019-12-06T13:28:30Z 2011-10-17T02:49:41Z 2019-12-06T13:28:30Z 2006 2006 Conference Paper Tanaka, T., Bond, F., & Fujita, S. (2006). The Hinoki Sensebank: A Large-Scale Word Sense Tagged Corpus of Japanese. Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora, pp.62-69. https://hdl.handle.net/10356/79572 http://hdl.handle.net/10220/7278 http://dl.acm.org/citation.cfm?id=1641999 155522 en © 2006 Association for Computational Linguistics. This paper was published in Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006 and is made available as an electronic reprint (preprint) with permission of Association for Computational Linguistics. The paper can be found at the following official URL: http://dl.acm.org/citation.cfm?id=1641999.  One print or electronic copy may be made for personal use only. Systematic or multiple reproduction, distribution to multiple locations via electronic or other means, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper is prohibited and is subject to penalties under law. 8 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Humanities::Linguistics
spellingShingle DRNTU::Humanities::Linguistics
Tanaka, Takaaki
Bond, Francis
Fujita, Sanae
The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese
description Semantic information is important for precise word sense disambiguation system and the kind of semantic analysis used in sophisticated natural language processing such as machine translation, question answering, etc. There are at least two kinds of semantic information: lexical semantics for words and phrases and structural semantics for phrases and sentences. We have built a Japanese corpus of over three million words with both lexical and structural semantic information. In this paper, we focus on our method of annotating the lexical semantics, that is building a word sense tagged corpus and its properties.
author2 School of Humanities and Social Sciences
author_facet School of Humanities and Social Sciences
Tanaka, Takaaki
Bond, Francis
Fujita, Sanae
format Conference or Workshop Item
author Tanaka, Takaaki
Bond, Francis
Fujita, Sanae
author_sort Tanaka, Takaaki
title The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese
title_short The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese
title_full The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese
title_fullStr The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese
title_full_unstemmed The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese
title_sort hinoki sensebank : a large-scale word sense tagged corpus of japanese
publishDate 2011
url https://hdl.handle.net/10356/79572
http://hdl.handle.net/10220/7278
http://dl.acm.org/citation.cfm?id=1641999
_version_ 1681045106867568640