PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance
To facilitate conversational question answering (CQA) over hybrid contexts in finance, we present a new dataset, named PACIFIC. Compared with existing CQA datasets, PACIFIC exhibits three key features: (i) proactivity, (ii) numerical reasoning, and (iii) hybrid context of tables and text. A new task...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9139 https://ink.library.smu.edu.sg/context/sis_research/article/10142/viewcontent/2210.08817v2.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-10142 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-101422024-08-01T09:24:57Z PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance DENG, Yang LEI, Wenqiang ZHANG, Wenxuan LAM, Wai CHUA, Tat-Seng To facilitate conversational question answering (CQA) over hybrid contexts in finance, we present a new dataset, named PACIFIC. Compared with existing CQA datasets, PACIFIC exhibits three key features: (i) proactivity, (ii) numerical reasoning, and (iii) hybrid context of tables and text. A new task is defined accordingly to study Proactive Conversational Question Answering (PCQA), which combines clarification question generation and CQA. In addition, we propose a novel method, namely UniPCQA, to adapt a hybrid format of input and output content in PCQA into the Seq2Seq problem, including the reformulation of the numerical reasoning process as code generation. UniPCQA performs multi-task learning over all sub-tasks in PCQA and incorporates a simple ensemble strategy to alleviate the error propagation issue in the multi-task learning by cross-validating top-k sampled Seq2Seq outputs. We benchmark the PACIFIC dataset with extensive baselines and provide comprehensive evaluations on each sub-task of PCQA. 2022-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9139 info:doi/10.18653/v1/2022.emnlp-main.469 https://ink.library.smu.edu.sg/context/sis_research/article/10142/viewcontent/2210.08817v2.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Hybrid formats Key feature Multitask learning Novel methods Numerical reasoning Proactivity Question Answering Subtask Tabular data Textual data Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Hybrid formats Key feature Multitask learning Novel methods Numerical reasoning Proactivity Question Answering Subtask Tabular data Textual data Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
Hybrid formats Key feature Multitask learning Novel methods Numerical reasoning Proactivity Question Answering Subtask Tabular data Textual data Databases and Information Systems Numerical Analysis and Scientific Computing DENG, Yang LEI, Wenqiang ZHANG, Wenxuan LAM, Wai CHUA, Tat-Seng PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance |
description |
To facilitate conversational question answering (CQA) over hybrid contexts in finance, we present a new dataset, named PACIFIC. Compared with existing CQA datasets, PACIFIC exhibits three key features: (i) proactivity, (ii) numerical reasoning, and (iii) hybrid context of tables and text. A new task is defined accordingly to study Proactive Conversational Question Answering (PCQA), which combines clarification question generation and CQA. In addition, we propose a novel method, namely UniPCQA, to adapt a hybrid format of input and output content in PCQA into the Seq2Seq problem, including the reformulation of the numerical reasoning process as code generation. UniPCQA performs multi-task learning over all sub-tasks in PCQA and incorporates a simple ensemble strategy to alleviate the error propagation issue in the multi-task learning by cross-validating top-k sampled Seq2Seq outputs. We benchmark the PACIFIC dataset with extensive baselines and provide comprehensive evaluations on each sub-task of PCQA. |
format |
text |
author |
DENG, Yang LEI, Wenqiang ZHANG, Wenxuan LAM, Wai CHUA, Tat-Seng |
author_facet |
DENG, Yang LEI, Wenqiang ZHANG, Wenxuan LAM, Wai CHUA, Tat-Seng |
author_sort |
DENG, Yang |
title |
PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance |
title_short |
PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance |
title_full |
PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance |
title_fullStr |
PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance |
title_full_unstemmed |
PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance |
title_sort |
pacific: towards proactive conversational question answering over tabular and textual data in finance |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/9139 https://ink.library.smu.edu.sg/context/sis_research/article/10142/viewcontent/2210.08817v2.pdf |
_version_ |
1814047753358016512 |