Enhancing the quality of Machine Translation System Using Cross-Lingual Word Embedding Models = Nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. Luận văn ThS. Máy tính: 84801

In recent years, Machine Translation has shown promising results and received much interest of researchers. Two approaches that have been widely used for machine translation are Phrase-based Statistical Machine Translation (PBSMT) and Neural Machine Translation (NMT). During translation, both appro...

Full description

Saved in:

Bibliographic Details
Main Author:	Nguyễn, Minh Thuận, d1993-
Other Authors:	Nguyễn, Phương Thái
Format:	Theses and Dissertations
Language:	English
Published:	2019
Subjects:	Công nghệ thông tin 6.3
Online Access:	http://repository.vnu.edu.vn/handle/VNU_123/65766
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Vietnam National University, Hanoi
Language:	English

id	oai:112.137.131.14:VNU_123-65766
record_format	dspace
spelling	oai:112.137.131.14:VNU_123-657662019-08-08T07:33:53Z Enhancing the quality of Machine Translation System Using Cross-Lingual Word Embedding Models = Nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. Luận văn ThS. Máy tính: 84801 Nguyễn, Minh Thuận, d1993- Nguyễn, Phương Thái Công nghệ thông tin 6.3 In recent years, Machine Translation has shown promising results and received much interest of researchers. Two approaches that have been widely used for machine translation are Phrase-based Statistical Machine Translation (PBSMT) and Neural Machine Translation (NMT). During translation, both approaches rely heavily on large amounts of bilingual corpora which require much effort and financial support. The lack of bilingual data leads to a poor phrase-table, which is one of the main components of PBSMT, and the unknown word problem in NMT. In contrast, monolingual data are available for most of the languages. Thanks to the advantage, many models of word embedding and cross-lingual word embedding have been appeared to improve the quality of various tasks in natural language processing. The purpose of this thesis is to propose two models for using cross-lingual word embedding models to address the above impediment. The first model enhances the quality of the phrase-table in SMT, and the remaining model tackles the unknown word problem in NMT. Publications: ? Minh-Thuan Nguyen, Van-Tan Bui, Huy-Hien Vu, Phuong-Thai Nguyen and Chi-Mai Luong. Enhancing the quality of Phrase-table in Statistical Machine Translation for Less-Common and Low-Resource Languages. In the 2018 International Conference on Asian Language Processing (IALP 2018). Luận văn ThS. Khoa học máy tính: 848010101 -- Trường Đại học Công nghệ. Đại học Quốc gia Hà Nội, 2018 2019-07-16T05:12:40Z 2019-07-16T05:12:40Z 2018 Thesis http://repository.vnu.edu.vn/handle/VNU_123/65766 006.3 NG-T 2018 / 00051000326 eng 43 tr. application/pdf
institution	Vietnam National University, Hanoi
building	VNU Library & Information Center
country	Vietnam
collection	VNU Digital Repository
language	English
topic	Công nghệ thông tin 6.3
spellingShingle	Công nghệ thông tin 6.3 Nguyễn, Minh Thuận, d1993- Enhancing the quality of Machine Translation System Using Cross-Lingual Word Embedding Models = Nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. Luận văn ThS. Máy tính: 84801
description	In recent years, Machine Translation has shown promising results and received much interest of researchers. Two approaches that have been widely used for machine translation are Phrase-based Statistical Machine Translation (PBSMT) and Neural Machine Translation (NMT). During translation, both approaches rely heavily on large amounts of bilingual corpora which require much effort and financial support. The lack of bilingual data leads to a poor phrase-table, which is one of the main components of PBSMT, and the unknown word problem in NMT. In contrast, monolingual data are available for most of the languages. Thanks to the advantage, many models of word embedding and cross-lingual word embedding have been appeared to improve the quality of various tasks in natural language processing. The purpose of this thesis is to propose two models for using cross-lingual word embedding models to address the above impediment. The first model enhances the quality of the phrase-table in SMT, and the remaining model tackles the unknown word problem in NMT. Publications: ? Minh-Thuan Nguyen, Van-Tan Bui, Huy-Hien Vu, Phuong-Thai Nguyen and Chi-Mai Luong. Enhancing the quality of Phrase-table in Statistical Machine Translation for Less-Common and Low-Resource Languages. In the 2018 International Conference on Asian Language Processing (IALP 2018).
author2	Nguyễn, Phương Thái
author_facet	Nguyễn, Phương Thái Nguyễn, Minh Thuận, d1993-
format	Theses and Dissertations
author	Nguyễn, Minh Thuận, d1993-
author_sort	Nguyễn, Minh Thuận, d1993-
title	Enhancing the quality of Machine Translation System Using Cross-Lingual Word Embedding Models = Nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. Luận văn ThS. Máy tính: 84801
title_short	Enhancing the quality of Machine Translation System Using Cross-Lingual Word Embedding Models = Nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. Luận văn ThS. Máy tính: 84801
title_full	Enhancing the quality of Machine Translation System Using Cross-Lingual Word Embedding Models = Nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. Luận văn ThS. Máy tính: 84801
title_fullStr	Enhancing the quality of Machine Translation System Using Cross-Lingual Word Embedding Models = Nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. Luận văn ThS. Máy tính: 84801
title_full_unstemmed	Enhancing the quality of Machine Translation System Using Cross-Lingual Word Embedding Models = Nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. Luận văn ThS. Máy tính: 84801
title_sort	enhancing the quality of machine translation system using cross-lingual word embedding models = nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. luận văn ths. máy tính: 84801
publishDate	2019
url	http://repository.vnu.edu.vn/handle/VNU_123/65766
_version_	1680966506000678912

Enhancing the quality of Machine Translation System Using Cross-Lingual Word Embedding Models = Nâng cao chất lượng của hệ thống dịch máy dựa trên các mô hình vector nhúng biểu diễn từ giữa hai ngôn ngữ. Luận văn ThS. Máy tính: 84801

Similar Items