R3: Reinforced Ranker-Reader for open-domain Question Answering
In recent years researchers have achieved considerable success applyingneural network methods to question answering (QA). These approaches haveachieved state of the art results in simplified closed-domain settings such asthe SQuAD (Rajpurkar et al., 2016) dataset, which provides a pre-selectedpassag...
Saved in:
Main Authors: | , , , , , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2018
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4237 https://ink.library.smu.edu.sg/context/sis_research/article/5240/viewcontent/Reinforced_Reader_Ranker_2018_AAAI_afv.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Summary: | In recent years researchers have achieved considerable success applyingneural network methods to question answering (QA). These approaches haveachieved state of the art results in simplified closed-domain settings such asthe SQuAD (Rajpurkar et al., 2016) dataset, which provides a pre-selectedpassage, from which the answer to a given question may be extracted. Morerecently, researchers have begun to tackle open-domain QA, in which the modelis given a question and access to a large corpus (e.g., wikipedia) instead of apre-selected passage (Chen et al., 2017a). This setting is more complex as itrequires large-scale search for relevant passages by an information retrievalcomponent, combined with a reading comprehension model that "reads" thepassages to generate an answer to the question. Performance in this settinglags considerably behind closed-domain performance. In this paper, we present anovel open-domain QA system called Reinforced Ranker-Reader (R3), based ontwo algorithmic innovations. First, we propose a new pipeline for open-domainQA with a Ranker component, which learns to rank retrieved passages in terms oflikelihood of generating the ground-truth answer to a given question. Second,we propose a novel method that jointly trains the Ranker along with ananswer-generation Reader model, based on reinforcement learning. We reportextensive experimental results showing that our method significantly improveson the state of the art for multiple open-domain QA datasets. |
---|