Discourse structure in machine translation evaluation
In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory (RST). Then...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2018
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/88461 http://hdl.handle.net/10220/46925 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-88461 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-884612020-03-07T11:48:59Z Discourse structure in machine translation evaluation Joty, Shafiq Guzmán, Francisco Màrquez, Lluís Nakov, Preslav School of Computer Science and Engineering Computer Aided Language Translation Machine Translation Evaluation DRNTU::Engineering::Computer science and engineering In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory (RST). Then, we show that a simple linear combination with these measures can help improve various existing machine translation evaluation metrics regarding correlation with human judgments both at the segment level and at the system level. This suggests that discourse information is complementary to the information used by many of the existing evaluation metrics, and thus it could be taken into account when developing richer evaluation metrics, such as the WMT-14 winning combined metric DiscoTKparty. We also provide a detailed analysis of the relevance of various discourse elements and relations from the RST parse trees for machine translation evaluation. In particular, we show that (i) all aspects of the RST tree are relevant, (ii) nuclearity is more useful than relation type, and (iii) the similarity of the translation RST tree to the reference RST tree is positively correlated with translation quality. Published version 2018-12-12T08:08:35Z 2019-12-06T17:03:49Z 2018-12-12T08:08:35Z 2019-12-06T17:03:49Z 2017 Journal Article Joty, S., Guzmán, F., Màrquez, L., & Nakov, P. (2017). Discourse structure in machine translation evaluation. Computational Linguistics, 43(4), 683-722. doi:10.1162/COLI_a_00298 0891-2017 https://hdl.handle.net/10356/88461 http://hdl.handle.net/10220/46925 10.1162/COLI_a_00298 en Computational Linguistics © 2017 Association for Computational Linguistics. Published under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license 40 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Computer Aided Language Translation Machine Translation Evaluation DRNTU::Engineering::Computer science and engineering |
spellingShingle |
Computer Aided Language Translation Machine Translation Evaluation DRNTU::Engineering::Computer science and engineering Joty, Shafiq Guzmán, Francisco Màrquez, Lluís Nakov, Preslav Discourse structure in machine translation evaluation |
description |
In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory (RST). Then, we show that a simple linear combination with these measures can help improve various existing machine translation evaluation metrics regarding correlation with human judgments both at the segment level and at the system level. This suggests that discourse information is complementary to the information used by many of the existing evaluation metrics, and thus it could be taken into account when developing richer evaluation metrics, such as the WMT-14 winning combined metric DiscoTKparty. We also provide a detailed analysis of the relevance of various discourse elements and relations from the RST parse trees for machine translation evaluation. In particular, we show that (i) all aspects of the RST tree are relevant, (ii) nuclearity is more useful than relation type, and (iii) the similarity of the translation RST tree to the reference RST tree is positively correlated with translation quality. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Joty, Shafiq Guzmán, Francisco Màrquez, Lluís Nakov, Preslav |
format |
Article |
author |
Joty, Shafiq Guzmán, Francisco Màrquez, Lluís Nakov, Preslav |
author_sort |
Joty, Shafiq |
title |
Discourse structure in machine translation evaluation |
title_short |
Discourse structure in machine translation evaluation |
title_full |
Discourse structure in machine translation evaluation |
title_fullStr |
Discourse structure in machine translation evaluation |
title_full_unstemmed |
Discourse structure in machine translation evaluation |
title_sort |
discourse structure in machine translation evaluation |
publishDate |
2018 |
url |
https://hdl.handle.net/10356/88461 http://hdl.handle.net/10220/46925 |
_version_ |
1681044913425219584 |