The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese
Semantic information is important for precise word sense disambiguation system and the kind of semantic analysis used in sophisticated natural language processing such as machine translation, question answering, etc. There are at least two kinds of semantic information: lexical semantics for words a...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2011
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/79572 http://hdl.handle.net/10220/7278 http://dl.acm.org/citation.cfm?id=1641999 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-79572 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-795722019-12-06T13:28:30Z The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese Tanaka, Takaaki Bond, Francis Fujita, Sanae School of Humanities and Social Sciences Workshop on Frontiers in Linguistically Annotated Corpora (2006 : Sydney, Australia) DRNTU::Humanities::Linguistics Semantic information is important for precise word sense disambiguation system and the kind of semantic analysis used in sophisticated natural language processing such as machine translation, question answering, etc. There are at least two kinds of semantic information: lexical semantics for words and phrases and structural semantics for phrases and sentences. We have built a Japanese corpus of over three million words with both lexical and structural semantic information. In this paper, we focus on our method of annotating the lexical semantics, that is building a word sense tagged corpus and its properties. Published version 2011-10-17T02:49:41Z 2019-12-06T13:28:30Z 2011-10-17T02:49:41Z 2019-12-06T13:28:30Z 2006 2006 Conference Paper Tanaka, T., Bond, F., & Fujita, S. (2006). The Hinoki Sensebank: A Large-Scale Word Sense Tagged Corpus of Japanese. Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora, pp.62-69. https://hdl.handle.net/10356/79572 http://hdl.handle.net/10220/7278 http://dl.acm.org/citation.cfm?id=1641999 155522 en © 2006 Association for Computational Linguistics. This paper was published in Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006 and is made available as an electronic reprint (preprint) with permission of Association for Computational Linguistics. The paper can be found at the following official URL: http://dl.acm.org/citation.cfm?id=1641999. One print or electronic copy may be made for personal use only. Systematic or multiple reproduction, distribution to multiple locations via electronic or other means, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper is prohibited and is subject to penalties under law. 8 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Humanities::Linguistics |
spellingShingle |
DRNTU::Humanities::Linguistics Tanaka, Takaaki Bond, Francis Fujita, Sanae The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese |
description |
Semantic information is important for precise word sense disambiguation system and the kind of semantic analysis used in sophisticated natural language processing such as machine translation, question answering, etc. There are at least two kinds of semantic information: lexical semantics for words and phrases and structural semantics for phrases and sentences.
We have built a Japanese corpus of over three million words with both lexical and structural semantic information. In this paper, we focus on our method of annotating the lexical semantics, that is building a word sense tagged corpus and its properties. |
author2 |
School of Humanities and Social Sciences |
author_facet |
School of Humanities and Social Sciences Tanaka, Takaaki Bond, Francis Fujita, Sanae |
format |
Conference or Workshop Item |
author |
Tanaka, Takaaki Bond, Francis Fujita, Sanae |
author_sort |
Tanaka, Takaaki |
title |
The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese |
title_short |
The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese |
title_full |
The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese |
title_fullStr |
The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese |
title_full_unstemmed |
The Hinoki sensebank : a large-scale word sense tagged corpus of Japanese |
title_sort |
hinoki sensebank : a large-scale word sense tagged corpus of japanese |
publishDate |
2011 |
url |
https://hdl.handle.net/10356/79572 http://hdl.handle.net/10220/7278 http://dl.acm.org/citation.cfm?id=1641999 |
_version_ |
1681045106867568640 |