Building a Filipino colloquialism translator using sequence-to-sequence model

Colloquialism in the Philippines has been prominently used in day-to-day conversations. Its vast emergence is evident especially on social media platforms but poses issues in terms of understandability to certain groups. For this research, machine translators have been implemented to fill in that ga...

Full description

Saved in:
Bibliographic Details
Main Authors: Nocon, Nicco Louis S., Kho, Nyssa Michelle D., Arroyo, Jeniffer
Format: text
Published: Animo Repository 2018
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/66
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-1065
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-10652022-08-13T03:14:55Z Building a Filipino colloquialism translator using sequence-to-sequence model Nocon, Nicco Louis S. Kho, Nyssa Michelle D. Arroyo, Jeniffer Colloquialism in the Philippines has been prominently used in day-to-day conversations. Its vast emergence is evident especially on social media platforms but poses issues in terms of understandability to certain groups. For this research, machine translators have been implemented to fill in that gap. The translators cover Filipino Textspeak or Shortcuts, Swardspeak or Gay-lingo, Conyo, and Datkilab-implemented on Tensorflow library and Moses tool. Implementing in Tensorflow achieved 85.88 BLEU score when evaluated to the training data and 14.67 to the test data, while Moses garnered 95.27 BLEU score on training data and 79.91 on test data. Analyses on both implementations include advantages and disadvantages in using each one. Through the analyses and development of this research, it is recommended to implement the following in the future: addition of colloquialism samples, experimentation on sequence-to-sequence configurations, applying Graphical User Interface (GUI) to the translators, implementing the translators to Natural Language Processing (NLP) tools, and to deploy the translators as a web application. 2018-07-02T07:00:00Z text text/html https://animorepository.dlsu.edu.ph/faculty_research/66 Faculty Research Work Animo Repository Machine translating Colloquial language--Translating Filipino language--Spoken Filipino--Translating Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Machine translating
Colloquial language--Translating
Filipino language--Spoken Filipino--Translating
Computer Sciences
spellingShingle Machine translating
Colloquial language--Translating
Filipino language--Spoken Filipino--Translating
Computer Sciences
Nocon, Nicco Louis S.
Kho, Nyssa Michelle D.
Arroyo, Jeniffer
Building a Filipino colloquialism translator using sequence-to-sequence model
description Colloquialism in the Philippines has been prominently used in day-to-day conversations. Its vast emergence is evident especially on social media platforms but poses issues in terms of understandability to certain groups. For this research, machine translators have been implemented to fill in that gap. The translators cover Filipino Textspeak or Shortcuts, Swardspeak or Gay-lingo, Conyo, and Datkilab-implemented on Tensorflow library and Moses tool. Implementing in Tensorflow achieved 85.88 BLEU score when evaluated to the training data and 14.67 to the test data, while Moses garnered 95.27 BLEU score on training data and 79.91 on test data. Analyses on both implementations include advantages and disadvantages in using each one. Through the analyses and development of this research, it is recommended to implement the following in the future: addition of colloquialism samples, experimentation on sequence-to-sequence configurations, applying Graphical User Interface (GUI) to the translators, implementing the translators to Natural Language Processing (NLP) tools, and to deploy the translators as a web application.
format text
author Nocon, Nicco Louis S.
Kho, Nyssa Michelle D.
Arroyo, Jeniffer
author_facet Nocon, Nicco Louis S.
Kho, Nyssa Michelle D.
Arroyo, Jeniffer
author_sort Nocon, Nicco Louis S.
title Building a Filipino colloquialism translator using sequence-to-sequence model
title_short Building a Filipino colloquialism translator using sequence-to-sequence model
title_full Building a Filipino colloquialism translator using sequence-to-sequence model
title_fullStr Building a Filipino colloquialism translator using sequence-to-sequence model
title_full_unstemmed Building a Filipino colloquialism translator using sequence-to-sequence model
title_sort building a filipino colloquialism translator using sequence-to-sequence model
publisher Animo Repository
publishDate 2018
url https://animorepository.dlsu.edu.ph/faculty_research/66
_version_ 1743177702908624896