R2F: A general retrieval, reading and fusion framework for document-level natural language inference
Document-level natural language inference (DocNLI) is a new challenging task in natural language processing, aiming at judging the entailment relationship between a pair of hypothesis and premise documents. Current datasets and baselines largely follow sentence-level settings, but fail to address th...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7481 https://ink.library.smu.edu.sg/context/sis_research/article/8484/viewcontent/docnli_emnlp2022.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8484 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-84842023-03-22T03:45:18Z R2F: A general retrieval, reading and fusion framework for document-level natural language inference WANG, Hao CAO, Yixin LI, Yangguang HUANG, Zhen WANG, Kun SHAO, Jing Document-level natural language inference (DocNLI) is a new challenging task in natural language processing, aiming at judging the entailment relationship between a pair of hypothesis and premise documents. Current datasets and baselines largely follow sentence-level settings, but fail to address the issues raised by longer documents. In this paper, we establish a general solution, named Retrieval, Reading and Fusion (R2F) framework, and a new setting, by analyzing the main challenges of DocNLI: interpretability, long-range dependency, and cross-sentence inference. The basic idea of the framework is to simplify document-level task into a set of sentence-level tasks, and improve both performance and interpretability with the power of evidence. For each hypothesis sentence, the framework retrieves evidence sentences from the premise, and reads to estimate its credibility. Then the sentence-level results are fused to judge the relationship between the documents. For the setting, we contribute complementary evidence and entailment label annotation on hypothesis sentences, for interpretability study. Our experimental results show that R2F framework can obtain state-of-the-art performance and is robust for diverse evidence retrieval methods. 2022-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7481 https://ink.library.smu.edu.sg/context/sis_research/article/8484/viewcontent/docnli_emnlp2022.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Natural language processing systems Language processing Interpretability Databases and Information Systems Programming Languages and Compilers |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Natural language processing systems Language processing Interpretability Databases and Information Systems Programming Languages and Compilers |
spellingShingle |
Natural language processing systems Language processing Interpretability Databases and Information Systems Programming Languages and Compilers WANG, Hao CAO, Yixin LI, Yangguang HUANG, Zhen WANG, Kun SHAO, Jing R2F: A general retrieval, reading and fusion framework for document-level natural language inference |
description |
Document-level natural language inference (DocNLI) is a new challenging task in natural language processing, aiming at judging the entailment relationship between a pair of hypothesis and premise documents. Current datasets and baselines largely follow sentence-level settings, but fail to address the issues raised by longer documents. In this paper, we establish a general solution, named Retrieval, Reading and Fusion (R2F) framework, and a new setting, by analyzing the main challenges of DocNLI: interpretability, long-range dependency, and cross-sentence inference. The basic idea of the framework is to simplify document-level task into a set of sentence-level tasks, and improve both performance and interpretability with the power of evidence. For each hypothesis sentence, the framework retrieves evidence sentences from the premise, and reads to estimate its credibility. Then the sentence-level results are fused to judge the relationship between the documents. For the setting, we contribute complementary evidence and entailment label annotation on hypothesis sentences, for interpretability study. Our experimental results show that R2F framework can obtain state-of-the-art performance and is robust for diverse evidence retrieval methods. |
format |
text |
author |
WANG, Hao CAO, Yixin LI, Yangguang HUANG, Zhen WANG, Kun SHAO, Jing |
author_facet |
WANG, Hao CAO, Yixin LI, Yangguang HUANG, Zhen WANG, Kun SHAO, Jing |
author_sort |
WANG, Hao |
title |
R2F: A general retrieval, reading and fusion framework for document-level natural language inference |
title_short |
R2F: A general retrieval, reading and fusion framework for document-level natural language inference |
title_full |
R2F: A general retrieval, reading and fusion framework for document-level natural language inference |
title_fullStr |
R2F: A general retrieval, reading and fusion framework for document-level natural language inference |
title_full_unstemmed |
R2F: A general retrieval, reading and fusion framework for document-level natural language inference |
title_sort |
r2f: a general retrieval, reading and fusion framework for document-level natural language inference |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/7481 https://ink.library.smu.edu.sg/context/sis_research/article/8484/viewcontent/docnli_emnlp2022.pdf |
_version_ |
1770576354096447488 |