Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli

Stemming is important thing to improve retrieval effectiveness. Stemming is used to reduce the size of indexing file for relevancy of document retrieval. Stemming is technique to truncate the word into the root word that will reduce vocabulary size and improve recall. The Malay affixes consist of...

Full description

Saved in:
Bibliographic Details
Main Author: Ghazalli, Edatul Muliana
Format: Thesis
Language:English
Published: 2005
Online Access:http://ir.uitm.edu.my/id/eprint/1429/1/TB_EDATUL%20MULIANA%20GHAZALLI%20CS%2005_5%20P01.pdf
http://ir.uitm.edu.my/id/eprint/1429/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Mara
Language: English
id my.uitm.ir.1429
record_format eprints
spelling my.uitm.ir.14292019-04-05T07:12:54Z http://ir.uitm.edu.my/id/eprint/1429/ Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli Ghazalli, Edatul Muliana Stemming is important thing to improve retrieval effectiveness. Stemming is used to reduce the size of indexing file for relevancy of document retrieval. Stemming is technique to truncate the word into the root word that will reduce vocabulary size and improve recall. The Malay affixes consist of four different types such as prefix, prefix-suffix, suffix and infix. An effective and powerfiil of Malay stemmer is it just not to move the suffixes rules only but it must remove all four types of affixes. Without removing all the affixes, the stem caimot be effectively used to index of Malay documents. So in order to get the best order of morphological rule for effective and powerfiil stemmer the researcher has to find out the best order of morphological rule to stem Malay words based on first character for each alphabet. This project involves the use of two combinations simultaneously. The words that could not stem correctly by the first combination of best order which is primary will shift to alternative combination of best order of morphological rule. The resuhs of experiment B, which is enhance project is better than experiment A, which is Rules-Application-Order (RAO) by Fatimah (1995) because that algorithm has successfully stemmed all word begin with alphabet "A" until "Z" that extracted fi-om Quran documents. 2005 Thesis NonPeerReviewed text en http://ir.uitm.edu.my/id/eprint/1429/1/TB_EDATUL%20MULIANA%20GHAZALLI%20CS%2005_5%20P01.pdf Ghazalli, Edatul Muliana (2005) Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli. Degree thesis, Universiti Teknologi MARA.
institution Universiti Teknologi Mara
building Tun Abdul Razak Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Mara
content_source UiTM Institutional Repository
url_provider http://ir.uitm.edu.my/
language English
description Stemming is important thing to improve retrieval effectiveness. Stemming is used to reduce the size of indexing file for relevancy of document retrieval. Stemming is technique to truncate the word into the root word that will reduce vocabulary size and improve recall. The Malay affixes consist of four different types such as prefix, prefix-suffix, suffix and infix. An effective and powerfiil of Malay stemmer is it just not to move the suffixes rules only but it must remove all four types of affixes. Without removing all the affixes, the stem caimot be effectively used to index of Malay documents. So in order to get the best order of morphological rule for effective and powerfiil stemmer the researcher has to find out the best order of morphological rule to stem Malay words based on first character for each alphabet. This project involves the use of two combinations simultaneously. The words that could not stem correctly by the first combination of best order which is primary will shift to alternative combination of best order of morphological rule. The resuhs of experiment B, which is enhance project is better than experiment A, which is Rules-Application-Order (RAO) by Fatimah (1995) because that algorithm has successfully stemmed all word begin with alphabet "A" until "Z" that extracted fi-om Quran documents.
format Thesis
author Ghazalli, Edatul Muliana
spellingShingle Ghazalli, Edatul Muliana
Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli
author_facet Ghazalli, Edatul Muliana
author_sort Ghazalli, Edatul Muliana
title Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli
title_short Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli
title_full Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli
title_fullStr Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli
title_full_unstemmed Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli
title_sort enhancement of rules-application-order (rao) stemming algorithm based on the first character of malay word / edatul muliana ghazalli
publishDate 2005
url http://ir.uitm.edu.my/id/eprint/1429/1/TB_EDATUL%20MULIANA%20GHAZALLI%20CS%2005_5%20P01.pdf
http://ir.uitm.edu.my/id/eprint/1429/
_version_ 1685648142397079552