Discourse structure in machine translation evaluation

In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory (RST). Then...

Full description

Saved in:
Bibliographic Details
Main Authors: Joty, Shafiq, Guzmán, Francisco, Màrquez, Lluís, Nakov, Preslav
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2018
Subjects:
Online Access:https://hdl.handle.net/10356/88461
http://hdl.handle.net/10220/46925
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-88461
record_format dspace
spelling sg-ntu-dr.10356-884612020-03-07T11:48:59Z Discourse structure in machine translation evaluation Joty, Shafiq Guzmán, Francisco Màrquez, Lluís Nakov, Preslav School of Computer Science and Engineering Computer Aided Language Translation Machine Translation Evaluation DRNTU::Engineering::Computer science and engineering In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory (RST). Then, we show that a simple linear combination with these measures can help improve various existing machine translation evaluation metrics regarding correlation with human judgments both at the segment level and at the system level. This suggests that discourse information is complementary to the information used by many of the existing evaluation metrics, and thus it could be taken into account when developing richer evaluation metrics, such as the WMT-14 winning combined metric DiscoTKparty. We also provide a detailed analysis of the relevance of various discourse elements and relations from the RST parse trees for machine translation evaluation. In particular, we show that (i) all aspects of the RST tree are relevant, (ii) nuclearity is more useful than relation type, and (iii) the similarity of the translation RST tree to the reference RST tree is positively correlated with translation quality. Published version 2018-12-12T08:08:35Z 2019-12-06T17:03:49Z 2018-12-12T08:08:35Z 2019-12-06T17:03:49Z 2017 Journal Article Joty, S., Guzmán, F., Màrquez, L., & Nakov, P. (2017). Discourse structure in machine translation evaluation. Computational Linguistics, 43(4), 683-722. doi:10.1162/COLI_a_00298 0891-2017 https://hdl.handle.net/10356/88461 http://hdl.handle.net/10220/46925 10.1162/COLI_a_00298 en Computational Linguistics © 2017 Association for Computational Linguistics. Published under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license 40 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Computer Aided Language Translation
Machine Translation Evaluation
DRNTU::Engineering::Computer science and engineering
spellingShingle Computer Aided Language Translation
Machine Translation Evaluation
DRNTU::Engineering::Computer science and engineering
Joty, Shafiq
Guzmán, Francisco
Màrquez, Lluís
Nakov, Preslav
Discourse structure in machine translation evaluation
description In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory (RST). Then, we show that a simple linear combination with these measures can help improve various existing machine translation evaluation metrics regarding correlation with human judgments both at the segment level and at the system level. This suggests that discourse information is complementary to the information used by many of the existing evaluation metrics, and thus it could be taken into account when developing richer evaluation metrics, such as the WMT-14 winning combined metric DiscoTKparty. We also provide a detailed analysis of the relevance of various discourse elements and relations from the RST parse trees for machine translation evaluation. In particular, we show that (i) all aspects of the RST tree are relevant, (ii) nuclearity is more useful than relation type, and (iii) the similarity of the translation RST tree to the reference RST tree is positively correlated with translation quality.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Joty, Shafiq
Guzmán, Francisco
Màrquez, Lluís
Nakov, Preslav
format Article
author Joty, Shafiq
Guzmán, Francisco
Màrquez, Lluís
Nakov, Preslav
author_sort Joty, Shafiq
title Discourse structure in machine translation evaluation
title_short Discourse structure in machine translation evaluation
title_full Discourse structure in machine translation evaluation
title_fullStr Discourse structure in machine translation evaluation
title_full_unstemmed Discourse structure in machine translation evaluation
title_sort discourse structure in machine translation evaluation
publishDate 2018
url https://hdl.handle.net/10356/88461
http://hdl.handle.net/10220/46925
_version_ 1681044913425219584