Semantic annotation of a Japanese speech corpus

This paper describes the semantic annotations we are performing on the CallHome Japanese corpus of spontaneous, unscripted telephone conversations (LDC, 1996). Our annotations include (i) semantic classes for all nouns and verbs; (ii) verb senses for all main verbs; and (iii) relations between main...

Full description

Saved in:
Bibliographic Details
Main Authors: Bond, Francis, Fry, John.
Other Authors: School of Humanities and Social Sciences
Format: Conference or Workshop Item
Language:English
Published: 2010
Subjects:
Online Access:https://hdl.handle.net/10356/83172
http://hdl.handle.net/10220/6434
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-83172
record_format dspace
spelling sg-ntu-dr.10356-831722019-12-06T15:13:18Z Semantic annotation of a Japanese speech corpus Bond, Francis Fry, John. School of Humanities and Social Sciences Workshop on Semantic Annotation and Intelligent Content (2000 : Luxembourg) DRNTU::Humanities::Language::Japanese DRNTU::Humanities::Linguistics::Semantics This paper describes the semantic annotations we are performing on the CallHome Japanese corpus of spontaneous, unscripted telephone conversations (LDC, 1996). Our annotations include (i) semantic classes for all nouns and verbs; (ii) verb senses for all main verbs; and (iii) relations between main verbs and their complements in the same utterance. Our semantic tagset is taken from NTT's Goi-Taikei semantic lexicon and ontology (Ikehara et al., 1997). A pilot study demonstrates that the verb sense tagging can be efficiently performed by native Japanese speakers using computergenerated HTML forms, and that good interannotator reliability can be obtained in the right conditions. Accepted version 2010-09-08T01:34:56Z 2019-12-06T15:13:18Z 2010-09-08T01:34:56Z 2019-12-06T15:13:18Z 2000 2000 Conference Paper Fry, J., & Bond, F. (2000). Semantic annotation of a Japanese speech corpus. In COLING Workshop on Semantic Annotation and Intelligent Content: pp.1-8. https://hdl.handle.net/10356/83172 http://hdl.handle.net/10220/6434 155596 en © 2000 ACL This is the author created version of a work that has been peer reviewed and accepted for publication by In COLING Workshop on Semantic Annotation and Intelligent Content, Association for Computational Linguistics. It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. 8 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Humanities::Language::Japanese
DRNTU::Humanities::Linguistics::Semantics
spellingShingle DRNTU::Humanities::Language::Japanese
DRNTU::Humanities::Linguistics::Semantics
Bond, Francis
Fry, John.
Semantic annotation of a Japanese speech corpus
description This paper describes the semantic annotations we are performing on the CallHome Japanese corpus of spontaneous, unscripted telephone conversations (LDC, 1996). Our annotations include (i) semantic classes for all nouns and verbs; (ii) verb senses for all main verbs; and (iii) relations between main verbs and their complements in the same utterance. Our semantic tagset is taken from NTT's Goi-Taikei semantic lexicon and ontology (Ikehara et al., 1997). A pilot study demonstrates that the verb sense tagging can be efficiently performed by native Japanese speakers using computergenerated HTML forms, and that good interannotator reliability can be obtained in the right conditions.
author2 School of Humanities and Social Sciences
author_facet School of Humanities and Social Sciences
Bond, Francis
Fry, John.
format Conference or Workshop Item
author Bond, Francis
Fry, John.
author_sort Bond, Francis
title Semantic annotation of a Japanese speech corpus
title_short Semantic annotation of a Japanese speech corpus
title_full Semantic annotation of a Japanese speech corpus
title_fullStr Semantic annotation of a Japanese speech corpus
title_full_unstemmed Semantic annotation of a Japanese speech corpus
title_sort semantic annotation of a japanese speech corpus
publishDate 2010
url https://hdl.handle.net/10356/83172
http://hdl.handle.net/10220/6434
_version_ 1681043107296051200