Discourse structure in machine translation evaluation

In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory (RST). Then...

Full description

Saved in:

Bibliographic Details
Main Authors:	Joty, Shafiq, Guzmán, Francisco, Màrquez, Lluís, Nakov, Preslav
Other Authors:	School of Computer Science and Engineering
Format:	Article
Language:	English
Published:	2018
Subjects:	Computer Aided Language Translation Machine Translation Evaluation DRNTU::Engineering::Computer science and engineering
Online Access:	https://hdl.handle.net/10356/88461 http://hdl.handle.net/10220/46925
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-88461
record_format	dspace
spelling	sg-ntu-dr.10356-884612020-03-07T11:48:59Z Discourse structure in machine translation evaluation Joty, Shafiq Guzmán, Francisco Màrquez, Lluís Nakov, Preslav School of Computer Science and Engineering Computer Aided Language Translation Machine Translation Evaluation DRNTU::Engineering::Computer science and engineering In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory (RST). Then, we show that a simple linear combination with these measures can help improve various existing machine translation evaluation metrics regarding correlation with human judgments both at the segment level and at the system level. This suggests that discourse information is complementary to the information used by many of the existing evaluation metrics, and thus it could be taken into account when developing richer evaluation metrics, such as the WMT-14 winning combined metric DiscoTKparty. We also provide a detailed analysis of the relevance of various discourse elements and relations from the RST parse trees for machine translation evaluation. In particular, we show that (i) all aspects of the RST tree are relevant, (ii) nuclearity is more useful than relation type, and (iii) the similarity of the translation RST tree to the reference RST tree is positively correlated with translation quality. Published version 2018-12-12T08:08:35Z 2019-12-06T17:03:49Z 2018-12-12T08:08:35Z 2019-12-06T17:03:49Z 2017 Journal Article Joty, S., Guzmán, F., Màrquez, L., & Nakov, P. (2017). Discourse structure in machine translation evaluation. Computational Linguistics, 43(4), 683-722. doi:10.1162/COLI_a_00298 0891-2017 https://hdl.handle.net/10356/88461 http://hdl.handle.net/10220/46925 10.1162/COLI_a_00298 en Computational Linguistics © 2017 Association for Computational Linguistics. Published under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license 40 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	Computer Aided Language Translation Machine Translation Evaluation DRNTU::Engineering::Computer science and engineering
spellingShingle	Computer Aided Language Translation Machine Translation Evaluation DRNTU::Engineering::Computer science and engineering Joty, Shafiq Guzmán, Francisco Màrquez, Lluís Nakov, Preslav Discourse structure in machine translation evaluation
description	In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory (RST). Then, we show that a simple linear combination with these measures can help improve various existing machine translation evaluation metrics regarding correlation with human judgments both at the segment level and at the system level. This suggests that discourse information is complementary to the information used by many of the existing evaluation metrics, and thus it could be taken into account when developing richer evaluation metrics, such as the WMT-14 winning combined metric DiscoTKparty. We also provide a detailed analysis of the relevance of various discourse elements and relations from the RST parse trees for machine translation evaluation. In particular, we show that (i) all aspects of the RST tree are relevant, (ii) nuclearity is more useful than relation type, and (iii) the similarity of the translation RST tree to the reference RST tree is positively correlated with translation quality.
author2	School of Computer Science and Engineering
author_facet	School of Computer Science and Engineering Joty, Shafiq Guzmán, Francisco Màrquez, Lluís Nakov, Preslav
format	Article
author	Joty, Shafiq Guzmán, Francisco Màrquez, Lluís Nakov, Preslav
author_sort	Joty, Shafiq
title	Discourse structure in machine translation evaluation
title_short	Discourse structure in machine translation evaluation
title_full	Discourse structure in machine translation evaluation
title_fullStr	Discourse structure in machine translation evaluation
title_full_unstemmed	Discourse structure in machine translation evaluation
title_sort	discourse structure in machine translation evaluation
publishDate	2018
url	https://hdl.handle.net/10356/88461 http://hdl.handle.net/10220/46925
_version_	1681044913425219584

Discourse structure in machine translation evaluation

Similar Items