2-step word alignment framework for Thai-English statistical machine translation
© 2017 International Information Institute. This paper presents a framework of a new word alignment process for SMT and the translation table improvement method with bilingual dictionary that was the lexical probabilistic tuning methodology for translation table in SMT. First, the alignment method w...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Journal |
Published: |
2018
|
Subjects: | |
Online Access: | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85040814274&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/46644 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Chiang Mai University |
id |
th-cmuir.6653943832-46644 |
---|---|
record_format |
dspace |
spelling |
th-cmuir.6653943832-466442018-04-25T07:20:59Z 2-step word alignment framework for Thai-English statistical machine translation Prasert Luekhong Taneth Ruangrajitpakorn Rattasit Sukhahuta Thepchai Supnithi Agricultural and Biological Sciences © 2017 International Information Institute. This paper presents a framework of a new word alignment process for SMT and the translation table improvement method with bilingual dictionary that was the lexical probabilistic tuning methodology for translation table in SMT. First, the alignment method was designed to include the quality of using dictionary as prior knowledge and the ability of co-occurrence to fill unknown words. By testing the proposed framework against the renowned GIZA, we applied an alignment model from both systems to Moses for proving its usefulness in a practical hierarchical phrase-based translation usage and exploited a BLEU score as a measurement. The case study in this work focused on Thai to English translation. The testing results showed the proposed method can overrun the result of GIZA IBM model-4 by 2.09 BLEU points. 2018-04-25T06:58:53Z 2018-04-25T06:58:53Z 2017-09-01 Journal 13448994 13434500 2-s2.0-85040814274 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85040814274&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/46644 |
institution |
Chiang Mai University |
building |
Chiang Mai University Library |
country |
Thailand |
collection |
CMU Intellectual Repository |
topic |
Agricultural and Biological Sciences |
spellingShingle |
Agricultural and Biological Sciences Prasert Luekhong Taneth Ruangrajitpakorn Rattasit Sukhahuta Thepchai Supnithi 2-step word alignment framework for Thai-English statistical machine translation |
description |
© 2017 International Information Institute. This paper presents a framework of a new word alignment process for SMT and the translation table improvement method with bilingual dictionary that was the lexical probabilistic tuning methodology for translation table in SMT. First, the alignment method was designed to include the quality of using dictionary as prior knowledge and the ability of co-occurrence to fill unknown words. By testing the proposed framework against the renowned GIZA, we applied an alignment model from both systems to Moses for proving its usefulness in a practical hierarchical phrase-based translation usage and exploited a BLEU score as a measurement. The case study in this work focused on Thai to English translation. The testing results showed the proposed method can overrun the result of GIZA IBM model-4 by 2.09 BLEU points. |
format |
Journal |
author |
Prasert Luekhong Taneth Ruangrajitpakorn Rattasit Sukhahuta Thepchai Supnithi |
author_facet |
Prasert Luekhong Taneth Ruangrajitpakorn Rattasit Sukhahuta Thepchai Supnithi |
author_sort |
Prasert Luekhong |
title |
2-step word alignment framework for Thai-English statistical machine translation |
title_short |
2-step word alignment framework for Thai-English statistical machine translation |
title_full |
2-step word alignment framework for Thai-English statistical machine translation |
title_fullStr |
2-step word alignment framework for Thai-English statistical machine translation |
title_full_unstemmed |
2-step word alignment framework for Thai-English statistical machine translation |
title_sort |
2-step word alignment framework for thai-english statistical machine translation |
publishDate |
2018 |
url |
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85040814274&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/46644 |
_version_ |
1681422912676954112 |