T2CMT: Tagalog-to-Cebuano machine translation

T2CMT is a uni-directional machine translator for languages Tagalog and Cebuano, specifically it translates from Tagalog to Cebuano. The morphological analysis is based on TagSA (Tagalog Stemming Algorithm) and affix correspondence based POS (part-of-speech) tagger. A new method is used in the POS-t...

Full description

Saved in:
Bibliographic Details
Main Author: Fat, Jacqueline Gemillan
Format: text
Language:English
Published: Animo Repository 2004
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_masteral/3229
https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=10067&context=etd_masteral
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
Description
Summary:T2CMT is a uni-directional machine translator for languages Tagalog and Cebuano, specifically it translates from Tagalog to Cebuano. The morphological analysis is based on TagSA (Tagalog Stemming Algorithm) and affix correspondence based POS (part-of-speech) tagger. A new method is used in the POS-tagging process but does not handle ambiguity resolution and is only limited to a one-to-one mapping of words and parts-of-speech. The syntax analyzer accepts data passed by the POS tagger according to the formal grammar defined by the system. Transfer is implemented through affix and root transfers. The rules used in morphological synthesis are reverse of the rules used in morphological analysis. A bilingual dictionary from Tagalog to Cebuano was developed and is used by the different components of the system. T2CMT has been evaluated, with the Book of Genesis as input, using GTM (General Text Matcher), which is based on Precision and Recall. Result of the evaluation gives a score of good performance 0.8027 or 80.27% precision and 0.7992 or 79.92% recall.