2-step word alignment framework for Thai-English statistical machine translation

© 2017 International Information Institute. This paper presents a framework of a new word alignment process for SMT and the translation table improvement method with bilingual dictionary that was the lexical probabilistic tuning methodology for translation table in SMT. First, the alignment method w...

Full description

Saved in:
Bibliographic Details
Main Authors: Prasert Luekhong, Taneth Ruangrajitpakorn, Rattasit Sukhahuta, Thepchai Supnithi
Format: Journal
Published: 2018
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85040814274&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/57064
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-57064
record_format dspace
spelling th-cmuir.6653943832-570642018-09-05T03:34:30Z 2-step word alignment framework for Thai-English statistical machine translation Prasert Luekhong Taneth Ruangrajitpakorn Rattasit Sukhahuta Thepchai Supnithi Computer Science © 2017 International Information Institute. This paper presents a framework of a new word alignment process for SMT and the translation table improvement method with bilingual dictionary that was the lexical probabilistic tuning methodology for translation table in SMT. First, the alignment method was designed to include the quality of using dictionary as prior knowledge and the ability of co-occurrence to fill unknown words. By testing the proposed framework against the renowned GIZA, we applied an alignment model from both systems to Moses for proving its usefulness in a practical hierarchical phrase-based translation usage and exploited a BLEU score as a measurement. The case study in this work focused on Thai to English translation. The testing results showed the proposed method can overrun the result of GIZA IBM model-4 by 2.09 BLEU points. 2018-09-05T03:34:30Z 2018-09-05T03:34:30Z 2017-09-01 Journal 13448994 13434500 2-s2.0-85040814274 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85040814274&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/57064
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Computer Science
spellingShingle Computer Science
Prasert Luekhong
Taneth Ruangrajitpakorn
Rattasit Sukhahuta
Thepchai Supnithi
2-step word alignment framework for Thai-English statistical machine translation
description © 2017 International Information Institute. This paper presents a framework of a new word alignment process for SMT and the translation table improvement method with bilingual dictionary that was the lexical probabilistic tuning methodology for translation table in SMT. First, the alignment method was designed to include the quality of using dictionary as prior knowledge and the ability of co-occurrence to fill unknown words. By testing the proposed framework against the renowned GIZA, we applied an alignment model from both systems to Moses for proving its usefulness in a practical hierarchical phrase-based translation usage and exploited a BLEU score as a measurement. The case study in this work focused on Thai to English translation. The testing results showed the proposed method can overrun the result of GIZA IBM model-4 by 2.09 BLEU points.
format Journal
author Prasert Luekhong
Taneth Ruangrajitpakorn
Rattasit Sukhahuta
Thepchai Supnithi
author_facet Prasert Luekhong
Taneth Ruangrajitpakorn
Rattasit Sukhahuta
Thepchai Supnithi
author_sort Prasert Luekhong
title 2-step word alignment framework for Thai-English statistical machine translation
title_short 2-step word alignment framework for Thai-English statistical machine translation
title_full 2-step word alignment framework for Thai-English statistical machine translation
title_fullStr 2-step word alignment framework for Thai-English statistical machine translation
title_full_unstemmed 2-step word alignment framework for Thai-English statistical machine translation
title_sort 2-step word alignment framework for thai-english statistical machine translation
publishDate 2018
url https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85040814274&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/57064
_version_ 1681424809062301696