Automatic identification of two-word terms in English text.

The study compares the effectiveness of two statistical formulae, mutual information and contextual information formulae, for identifying collocations or two-word terms in English text. A modified version of the contextual information formula was found to perform better than mutual information.

Saved in:
Bibliographic Details
Main Author: Heng, Siok Tian.
Other Authors: Khoo, Christopher Soo Guan
Format: Theses and Dissertations
Published: 2008
Subjects:
Online Access:http://hdl.handle.net/10356/1963
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
id sg-ntu-dr.10356-1963
record_format dspace
spelling sg-ntu-dr.10356-19632019-12-10T13:31:48Z Automatic identification of two-word terms in English text. Heng, Siok Tian. Khoo, Christopher Soo Guan Wee Kim Wee School of Communication and Information DRNTU::Library and information science::Libraries::Technologies The study compares the effectiveness of two statistical formulae, mutual information and contextual information formulae, for identifying collocations or two-word terms in English text. A modified version of the contextual information formula was found to perform better than mutual information. Master of Science (Information Studies) 2008-09-10T08:37:43Z 2008-09-10T08:37:43Z 2002 2002 Thesis http://hdl.handle.net/10356/1963 Nanyang Technological University application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
topic DRNTU::Library and information science::Libraries::Technologies
spellingShingle DRNTU::Library and information science::Libraries::Technologies
Heng, Siok Tian.
Automatic identification of two-word terms in English text.
description The study compares the effectiveness of two statistical formulae, mutual information and contextual information formulae, for identifying collocations or two-word terms in English text. A modified version of the contextual information formula was found to perform better than mutual information.
author2 Khoo, Christopher Soo Guan
author_facet Khoo, Christopher Soo Guan
Heng, Siok Tian.
format Theses and Dissertations
author Heng, Siok Tian.
author_sort Heng, Siok Tian.
title Automatic identification of two-word terms in English text.
title_short Automatic identification of two-word terms in English text.
title_full Automatic identification of two-word terms in English text.
title_fullStr Automatic identification of two-word terms in English text.
title_full_unstemmed Automatic identification of two-word terms in English text.
title_sort automatic identification of two-word terms in english text.
publishDate 2008
url http://hdl.handle.net/10356/1963
_version_ 1681048865810153472