T2CMT: Tagalog-to-Cebuano machine translation
T2CMT is a uni-directional machine translator for languages Tagalog and Cebuano, specifically it translates from Tagalog to Cebuano. The morphological analysis is based on TagSA (Tagalog Stemming Algorithm) and affix correspondence based POS (part-of-speech) tagger. A new method is used in the POS-t...
Saved in:
Main Author: | |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
2004
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etd_masteral/3229 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=10067&context=etd_masteral |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
Summary: | T2CMT is a uni-directional machine translator for languages Tagalog and Cebuano, specifically it translates from Tagalog to Cebuano. The morphological analysis is based on TagSA (Tagalog Stemming Algorithm) and affix correspondence based POS (part-of-speech) tagger. A new method is used in the POS-tagging process but does not handle ambiguity resolution and is only limited to a one-to-one mapping of words and parts-of-speech. The syntax analyzer accepts data passed by the POS tagger according to the formal grammar defined by the system. Transfer is implemented through affix and root transfers. The rules used in morphological synthesis are reverse of the rules used in morphological analysis. A bilingual dictionary from Tagalog to Cebuano was developed and is used by the different components of the system. T2CMT has been evaluated, with the Book of Genesis as input, using GTM (General Text Matcher), which is based on Precision and Recall. Result of the evaluation gives a score of good performance 0.8027 or 80.27% precision and 0.7992 or 79.92% recall. |
---|