Lexicon splitting in lexical disambiguation for Malay morphological analysis and stemming

Lexical ambiguity is one of the problems faced by morphological analyser and stemmer. It is caused by ambiguous word form like homonym, which could direct the tools to produce incorrect output. Thus a method that can resolve ambiguity may improve the performance of such tools. Malay word affixation...

Full description

Saved in:
Bibliographic Details
Main Authors: Sharum, Mohd Yunus, Abdullah, Muhamad Taufik, Sulaiman, Md Nasir, Azmi Murad, Masrah Azrifah, Zainon Hamzah, Zaitul Azma
Format: Article
Language:English
Published: Advanced Institute of Convergence Information Technology 2013
Online Access:http://psasir.upm.edu.my/id/eprint/30565/1/Lexicon%20splitting%20in%20lexical%20disambiguation%20for%20Malay%20morphological%20analysis%20and%20stemming.pdf
http://psasir.upm.edu.my/id/eprint/30565/
http://www.globalcis.org/jnit/global/paper_detail.html?jname=JNIT&q=175
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Putra Malaysia
Language: English
Description
Summary:Lexical ambiguity is one of the problems faced by morphological analyser and stemmer. It is caused by ambiguous word form like homonym, which could direct the tools to produce incorrect output. Thus a method that can resolve ambiguity may improve the performance of such tools. Malay word affixation differentiates between monosyllable and multisyllable word. A disambiguation method is proposed for tools that use lexicon for analysis and stemming, by splitting the lexicon into monosyllable and multisyllable words. We found that this feature could help to resolve ambiguity involving monosyllable words, improve language’s exception handling and improve storage lookup.This would be useful for Malay morphological analysis and stemming as this method does not require document-level context analysis of the analysed word.