Learning the countability of English nouns from corpus data
This paper describes a method for learning the countability preferences of English nouns from raw text corpora. The method maps the corpus-attested lexico-syntactic properties of each noun onto a feature vector, and uses a suite of memory-based classifiers t...
Saved in:
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2011
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/92279 http://hdl.handle.net/10220/6825 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-92279 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-922792020-03-07T12:10:36Z Learning the countability of English nouns from corpus data Baldwin, Timothy Bond, Francis School of Humanities and Social Sciences Annual Meeting of the Association for Computational Linguistics (41st : 2003) DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics This paper describes a method for learning the countability preferences of English nouns from raw text corpora. The method maps the corpus-attested lexico-syntactic properties of each noun onto a feature vector, and uses a suite of memory-based classifiers to predict membership in 4 countability classes. We were able to assign countability to English nouns with a precision of 94.6%. Accepted version 2011-06-13T07:45:02Z 2019-12-06T18:20:34Z 2011-06-13T07:45:02Z 2019-12-06T18:20:34Z 2003 2003 Conference Paper Baldwin, T., & Bond, F. (2003). Learning the countability of English nouns from corpus data. Proceedings of 41st Annual Meeting of the Association for Computational Linguistics: ACL-2003, 463-470. https://hdl.handle.net/10356/92279 http://hdl.handle.net/10220/6825 10.3115/1075096.1075155 155550 en © 2003 ACL. This is the author created version of a work that has been peer reviewed and accepted for publication by Proceedings of 41st Annual Meeting of the Association for Computational Linguistics: ACL-2003, Association for Computational Linguistics. It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. The published version is available at: [DOI: http://dx.doi.org/10.3115/1075096.1075155]. 8 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics |
spellingShingle |
DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics Baldwin, Timothy Bond, Francis Learning the countability of English nouns from corpus data |
description |
This paper describes a method for learning
the countability preferences of English
nouns from raw text corpora. The method
maps the corpus-attested lexico-syntactic
properties of each noun onto a feature
vector, and uses a suite of memory-based
classifiers to predict membership in 4
countability classes. We were able to assign
countability to English nouns with a
precision of 94.6%. |
author2 |
School of Humanities and Social Sciences |
author_facet |
School of Humanities and Social Sciences Baldwin, Timothy Bond, Francis |
format |
Conference or Workshop Item |
author |
Baldwin, Timothy Bond, Francis |
author_sort |
Baldwin, Timothy |
title |
Learning the countability of English nouns from corpus data |
title_short |
Learning the countability of English nouns from corpus data |
title_full |
Learning the countability of English nouns from corpus data |
title_fullStr |
Learning the countability of English nouns from corpus data |
title_full_unstemmed |
Learning the countability of English nouns from corpus data |
title_sort |
learning the countability of english nouns from corpus data |
publishDate |
2011 |
url |
https://hdl.handle.net/10356/92279 http://hdl.handle.net/10220/6825 |
_version_ |
1681035329821212672 |