Lexicon splitting in lexical disambiguation for Malay morphological analysis and stemming
Lexical ambiguity is one of the problems faced by morphological analyser and stemmer. It is caused by ambiguous word form like homonym, which could direct the tools to produce incorrect output. Thus a method that can resolve ambiguity may improve the performance of such tools. Malay word affixation...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Advanced Institute of Convergence Information Technology
2013
|
Online Access: | http://psasir.upm.edu.my/id/eprint/30565/1/Lexicon%20splitting%20in%20lexical%20disambiguation%20for%20Malay%20morphological%20analysis%20and%20stemming.pdf http://psasir.upm.edu.my/id/eprint/30565/ http://www.globalcis.org/jnit/global/paper_detail.html?jname=JNIT&q=175 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Putra Malaysia |
Language: | English |
Summary: | Lexical ambiguity is one of the problems faced by morphological analyser and stemmer. It is caused
by ambiguous word form like homonym, which could direct the tools to produce incorrect output. Thus a method that can resolve ambiguity may improve the performance of such tools. Malay word affixation differentiates between monosyllable and multisyllable word. A disambiguation method is proposed for tools that use lexicon for analysis and stemming, by splitting the lexicon into monosyllable and multisyllable words. We found that this feature could help to resolve ambiguity involving monosyllable words, improve language’s exception handling and improve storage lookup.This would be useful for Malay morphological analysis and stemming as this method does not require
document-level context analysis of the analysed word. |
---|