Factual consistency evaluation for text summarization via counterfactual estimation

Despite significant progress has been achieved in text summarization, factual inconsistency in generated summaries still severely limits its practical applications. Among the key factors to ensure factual consistency, a reliable automatic evaluation metric is the first and the most crucial one. Howe...

Full description

Saved in:

Bibliographic Details
Main Authors:	XIE, Yuexiang, SUN, Fei, DENG, Yang, LI, Yaliang, DING, Bolin
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2021
Subjects:	Databases and Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/9144 https://ink.library.smu.edu.sg/context/sis_research/article/10147/viewcontent/2021.findings_emnlp.10.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-10147
record_format	dspace
spelling	sg-smu-ink.sis_research-101472024-08-01T09:19:45Z Factual consistency evaluation for text summarization via counterfactual estimation XIE, Yuexiang SUN, Fei DENG, Yang LI, Yaliang DING, Bolin Despite significant progress has been achieved in text summarization, factual inconsistency in generated summaries still severely limits its practical applications. Among the key factors to ensure factual consistency, a reliable automatic evaluation metric is the first and the most crucial one. However, existing metrics either neglect the intrinsic cause of the factual inconsistency or rely on auxiliary tasks, leading to an unsatisfied correlation with human judgments or increasing the inconvenience of usage in practice. In light of these challenges, we propose a novel metric to evaluate the factual consistency in text summarization via counterfactual estimation, which formulates the causal relationship among the source document, the generated summary, and the language prior. We remove the effect of language prior, which can cause factual inconsistency, from the total causal effect on the generated summary, and provides a simple yet effective way to evaluate consistency without relying on other auxiliary tasks. We conduct a series of experiments on three public abstractive text summarization datasets, and demonstrate the advantages of the proposed metric in both improving the correlation with human judgments and the convenience of usage. The source code is available at https://github.com/xieyxclack/factual_coco. 2021-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9144 info:doi/10.18653/v1/2021.findings-emnlp.10 https://ink.library.smu.edu.sg/context/sis_research/article/10147/viewcontent/2021.findings_emnlp.10.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Databases and Information Systems
spellingShingle	Databases and Information Systems XIE, Yuexiang SUN, Fei DENG, Yang LI, Yaliang DING, Bolin Factual consistency evaluation for text summarization via counterfactual estimation
description	Despite significant progress has been achieved in text summarization, factual inconsistency in generated summaries still severely limits its practical applications. Among the key factors to ensure factual consistency, a reliable automatic evaluation metric is the first and the most crucial one. However, existing metrics either neglect the intrinsic cause of the factual inconsistency or rely on auxiliary tasks, leading to an unsatisfied correlation with human judgments or increasing the inconvenience of usage in practice. In light of these challenges, we propose a novel metric to evaluate the factual consistency in text summarization via counterfactual estimation, which formulates the causal relationship among the source document, the generated summary, and the language prior. We remove the effect of language prior, which can cause factual inconsistency, from the total causal effect on the generated summary, and provides a simple yet effective way to evaluate consistency without relying on other auxiliary tasks. We conduct a series of experiments on three public abstractive text summarization datasets, and demonstrate the advantages of the proposed metric in both improving the correlation with human judgments and the convenience of usage. The source code is available at https://github.com/xieyxclack/factual_coco.
format	text
author	XIE, Yuexiang SUN, Fei DENG, Yang LI, Yaliang DING, Bolin
author_facet	XIE, Yuexiang SUN, Fei DENG, Yang LI, Yaliang DING, Bolin
author_sort	XIE, Yuexiang
title	Factual consistency evaluation for text summarization via counterfactual estimation
title_short	Factual consistency evaluation for text summarization via counterfactual estimation
title_full	Factual consistency evaluation for text summarization via counterfactual estimation
title_fullStr	Factual consistency evaluation for text summarization via counterfactual estimation
title_full_unstemmed	Factual consistency evaluation for text summarization via counterfactual estimation
title_sort	factual consistency evaluation for text summarization via counterfactual estimation
publisher	Institutional Knowledge at Singapore Management University
publishDate	2021
url	https://ink.library.smu.edu.sg/sis_research/9144 https://ink.library.smu.edu.sg/context/sis_research/article/10147/viewcontent/2021.findings_emnlp.10.pdf
_version_	1814047754787225600

Factual consistency evaluation for text summarization via counterfactual estimation

Similar Items