The Application of Computer-Aided Under-Resourced Language Translation for Malay into Kadazandusun

A computer-aided language translation using a Machine translation (MT) is an application performed by computers (machines) that translates one natural language to another. There are many online language translation tools, but thus far none offers a sequence of text translations for the under-resourc...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohd Shamrie Sainin, Minah Sintian, Suraya Alias, Asni Tahir
Format: Article
Language:English
English
Published: International Association of Educators and Researchers (IAER) 2023
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/38447/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/38447/2/FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/38447/
https://doi.org/10.33166/AETiC.2023.05.002
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Sabah
Language: English
English
id my.ums.eprints.38447
record_format eprints
spelling my.ums.eprints.384472024-03-06T07:39:59Z https://eprints.ums.edu.my/id/eprint/38447/ The Application of Computer-Aided Under-Resourced Language Translation for Malay into Kadazandusun Mohd Shamrie Sainin Minah Sintian Suraya Alias Asni Tahir P306-310 Translating and interpreting QA75.5-76.95 Electronic computers. Computer science A computer-aided language translation using a Machine translation (MT) is an application performed by computers (machines) that translates one natural language to another. There are many online language translation tools, but thus far none offers a sequence of text translations for the under-resourced Kadazandusun language. Although there are web-based and mobile applications of Kadazandusun dictionaries available, the systems do not translate more than one word. Hence, this paper aims to present the discussion of the preliminary translation of Malay to Kadazandusun. The basic word-to-word with dictionary alignment translation based on Direct Machine Translation (DMT) is selected to begin the exploration of the translation domain where DMT is one of the earliest translation methods which relies on the word-to-word approach (sequence-to-sequence model). This paper aims to investigate the under-resourced language and the task of translating from the Malay language to the Kadazandusun language or vice versa. This paper presents the application and the process as well as the results of the system according to the basic Kadazandusun word arrangement (Verb-Subject-Object) and its translation quality using the Bilingual Evaluation Understudy (BLEU) score. Several phases are involved during the process, including data collection (word pair translation), preprocessing, text selection, translation procedures, and performance evaluation. The preliminary language translation approach is proven to be capable of producing up to 0.5 BLEU scores which indicate that the translation is readable, however, requires post-editing for better comprehension. The findings are significant for the quality of the under-resourced language translation and as a starting point for other machine translation methodologies such as statistical or deep learning-based translation. International Association of Educators and Researchers (IAER) 2023 Article NonPeerReviewed text en https://eprints.ums.edu.my/id/eprint/38447/1/ABSTRACT.pdf text en https://eprints.ums.edu.my/id/eprint/38447/2/FULL%20TEXT.pdf Mohd Shamrie Sainin and Minah Sintian and Suraya Alias and Asni Tahir (2023) The Application of Computer-Aided Under-Resourced Language Translation for Malay into Kadazandusun. Annals of Emerging Technologies in Computing (AETiC), 7 (5). pp. 11-24. ISSN 2516-0281 https://doi.org/10.33166/AETiC.2023.05.002
institution Universiti Malaysia Sabah
building UMS Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sabah
content_source UMS Institutional Repository
url_provider http://eprints.ums.edu.my/
language English
English
topic P306-310 Translating and interpreting
QA75.5-76.95 Electronic computers. Computer science
spellingShingle P306-310 Translating and interpreting
QA75.5-76.95 Electronic computers. Computer science
Mohd Shamrie Sainin
Minah Sintian
Suraya Alias
Asni Tahir
The Application of Computer-Aided Under-Resourced Language Translation for Malay into Kadazandusun
description A computer-aided language translation using a Machine translation (MT) is an application performed by computers (machines) that translates one natural language to another. There are many online language translation tools, but thus far none offers a sequence of text translations for the under-resourced Kadazandusun language. Although there are web-based and mobile applications of Kadazandusun dictionaries available, the systems do not translate more than one word. Hence, this paper aims to present the discussion of the preliminary translation of Malay to Kadazandusun. The basic word-to-word with dictionary alignment translation based on Direct Machine Translation (DMT) is selected to begin the exploration of the translation domain where DMT is one of the earliest translation methods which relies on the word-to-word approach (sequence-to-sequence model). This paper aims to investigate the under-resourced language and the task of translating from the Malay language to the Kadazandusun language or vice versa. This paper presents the application and the process as well as the results of the system according to the basic Kadazandusun word arrangement (Verb-Subject-Object) and its translation quality using the Bilingual Evaluation Understudy (BLEU) score. Several phases are involved during the process, including data collection (word pair translation), preprocessing, text selection, translation procedures, and performance evaluation. The preliminary language translation approach is proven to be capable of producing up to 0.5 BLEU scores which indicate that the translation is readable, however, requires post-editing for better comprehension. The findings are significant for the quality of the under-resourced language translation and as a starting point for other machine translation methodologies such as statistical or deep learning-based translation.
format Article
author Mohd Shamrie Sainin
Minah Sintian
Suraya Alias
Asni Tahir
author_facet Mohd Shamrie Sainin
Minah Sintian
Suraya Alias
Asni Tahir
author_sort Mohd Shamrie Sainin
title The Application of Computer-Aided Under-Resourced Language Translation for Malay into Kadazandusun
title_short The Application of Computer-Aided Under-Resourced Language Translation for Malay into Kadazandusun
title_full The Application of Computer-Aided Under-Resourced Language Translation for Malay into Kadazandusun
title_fullStr The Application of Computer-Aided Under-Resourced Language Translation for Malay into Kadazandusun
title_full_unstemmed The Application of Computer-Aided Under-Resourced Language Translation for Malay into Kadazandusun
title_sort application of computer-aided under-resourced language translation for malay into kadazandusun
publisher International Association of Educators and Researchers (IAER)
publishDate 2023
url https://eprints.ums.edu.my/id/eprint/38447/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/38447/2/FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/38447/
https://doi.org/10.33166/AETiC.2023.05.002
_version_ 1793154685528440832