Multiword expressions : a pain in the neck for NLP

Multiword expressions are a key problem for the development of large-scale, linguistically sound natural language processing technology. This paper surveys the problem and some currently available analytic techniques. The various kinds of multiword expressions should be a...

Full description

Saved in:
Bibliographic Details
Main Authors: Sag, Ivan A., Baldwin, Timothy, Bond, Francis, Copestake, Ann, Flickinger, Dan
Other Authors: School of Humanities and Social Sciences
Format: Conference or Workshop Item
Language:English
Published: 2011
Subjects:
Online Access:https://hdl.handle.net/10356/79581
http://hdl.handle.net/10220/6828
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-79581
record_format dspace
spelling sg-ntu-dr.10356-795812020-03-07T12:10:36Z Multiword expressions : a pain in the neck for NLP Sag, Ivan A. Baldwin, Timothy Bond, Francis Copestake, Ann Flickinger, Dan School of Humanities and Social Sciences International Conference on Computational Linguistics and Intelligent Text Processing (3rd : 2004 : Mexico City, Mexico) DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics Multiword expressions are a key problem for the development of large-scale, linguistically sound natural language processing technology. This paper surveys the problem and some currently available analytic techniques. The various kinds of multiword expressions should be analyzed in distinct ways, including listing “words with spaces”, hierarchically organized lexicons, restricted combinatoric rules, lexical selection, “idiomatic constructions” and simple statistical affinity. An adequate comprehensive analysis of multiword expressions must employ both symbolic and statistical techniques. Accepted version 2011-06-13T08:49:25Z 2019-12-06T13:28:39Z 2011-06-13T08:49:25Z 2019-12-06T13:28:39Z 2002 2002 Conference Paper Sag, I. A., Baldwin, T., Bond, F., Copestake, A., & Flickinger, D. (2002). Multiword expressions: A pain in the neck for NLP. Proceedings of Computational Linguistics and Intelligent Text Processing: Third International Conference: CICLing-2002, Lecture Notes in Computer Science, 2276, 1-15. https://hdl.handle.net/10356/79581 http://hdl.handle.net/10220/6828 10.1007/3-540-45715-1_1 155580 en © 2002 Springer. This is the author created version of a work that has been peer reviewed and accepted for publication by Proceedings of Computational Linguistics and Intelligent Text Processing: Third International Conference: CICLing-2002, LNCS, Springer. It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. The published version is available at: [DOI: http://dx.doi.org/10.1007/3-540-45715-1_1]. 15 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics
spellingShingle DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics
Sag, Ivan A.
Baldwin, Timothy
Bond, Francis
Copestake, Ann
Flickinger, Dan
Multiword expressions : a pain in the neck for NLP
description Multiword expressions are a key problem for the development of large-scale, linguistically sound natural language processing technology. This paper surveys the problem and some currently available analytic techniques. The various kinds of multiword expressions should be analyzed in distinct ways, including listing “words with spaces”, hierarchically organized lexicons, restricted combinatoric rules, lexical selection, “idiomatic constructions” and simple statistical affinity. An adequate comprehensive analysis of multiword expressions must employ both symbolic and statistical techniques.
author2 School of Humanities and Social Sciences
author_facet School of Humanities and Social Sciences
Sag, Ivan A.
Baldwin, Timothy
Bond, Francis
Copestake, Ann
Flickinger, Dan
format Conference or Workshop Item
author Sag, Ivan A.
Baldwin, Timothy
Bond, Francis
Copestake, Ann
Flickinger, Dan
author_sort Sag, Ivan A.
title Multiword expressions : a pain in the neck for NLP
title_short Multiword expressions : a pain in the neck for NLP
title_full Multiword expressions : a pain in the neck for NLP
title_fullStr Multiword expressions : a pain in the neck for NLP
title_full_unstemmed Multiword expressions : a pain in the neck for NLP
title_sort multiword expressions : a pain in the neck for nlp
publishDate 2011
url https://hdl.handle.net/10356/79581
http://hdl.handle.net/10220/6828
_version_ 1681038984389591040