PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance

To facilitate conversational question answering (CQA) over hybrid contexts in finance, we present a new dataset, named PACIFIC. Compared with existing CQA datasets, PACIFIC exhibits three key features: (i) proactivity, (ii) numerical reasoning, and (iii) hybrid context of tables and text. A new task...

Full description

Saved in:
Bibliographic Details
Main Authors: DENG, Yang, LEI, Wenqiang, ZHANG, Wenxuan, LAM, Wai, CHUA, Tat-Seng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9139
https://ink.library.smu.edu.sg/context/sis_research/article/10142/viewcontent/2210.08817v2.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10142
record_format dspace
spelling sg-smu-ink.sis_research-101422024-08-01T09:24:57Z PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance DENG, Yang LEI, Wenqiang ZHANG, Wenxuan LAM, Wai CHUA, Tat-Seng To facilitate conversational question answering (CQA) over hybrid contexts in finance, we present a new dataset, named PACIFIC. Compared with existing CQA datasets, PACIFIC exhibits three key features: (i) proactivity, (ii) numerical reasoning, and (iii) hybrid context of tables and text. A new task is defined accordingly to study Proactive Conversational Question Answering (PCQA), which combines clarification question generation and CQA. In addition, we propose a novel method, namely UniPCQA, to adapt a hybrid format of input and output content in PCQA into the Seq2Seq problem, including the reformulation of the numerical reasoning process as code generation. UniPCQA performs multi-task learning over all sub-tasks in PCQA and incorporates a simple ensemble strategy to alleviate the error propagation issue in the multi-task learning by cross-validating top-k sampled Seq2Seq outputs. We benchmark the PACIFIC dataset with extensive baselines and provide comprehensive evaluations on each sub-task of PCQA. 2022-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9139 info:doi/10.18653/v1/2022.emnlp-main.469 https://ink.library.smu.edu.sg/context/sis_research/article/10142/viewcontent/2210.08817v2.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Hybrid formats Key feature Multitask learning Novel methods Numerical reasoning Proactivity Question Answering Subtask Tabular data Textual data Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Hybrid formats
Key feature
Multitask learning
Novel methods
Numerical reasoning
Proactivity
Question Answering
Subtask
Tabular data
Textual data
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Hybrid formats
Key feature
Multitask learning
Novel methods
Numerical reasoning
Proactivity
Question Answering
Subtask
Tabular data
Textual data
Databases and Information Systems
Numerical Analysis and Scientific Computing
DENG, Yang
LEI, Wenqiang
ZHANG, Wenxuan
LAM, Wai
CHUA, Tat-Seng
PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance
description To facilitate conversational question answering (CQA) over hybrid contexts in finance, we present a new dataset, named PACIFIC. Compared with existing CQA datasets, PACIFIC exhibits three key features: (i) proactivity, (ii) numerical reasoning, and (iii) hybrid context of tables and text. A new task is defined accordingly to study Proactive Conversational Question Answering (PCQA), which combines clarification question generation and CQA. In addition, we propose a novel method, namely UniPCQA, to adapt a hybrid format of input and output content in PCQA into the Seq2Seq problem, including the reformulation of the numerical reasoning process as code generation. UniPCQA performs multi-task learning over all sub-tasks in PCQA and incorporates a simple ensemble strategy to alleviate the error propagation issue in the multi-task learning by cross-validating top-k sampled Seq2Seq outputs. We benchmark the PACIFIC dataset with extensive baselines and provide comprehensive evaluations on each sub-task of PCQA.
format text
author DENG, Yang
LEI, Wenqiang
ZHANG, Wenxuan
LAM, Wai
CHUA, Tat-Seng
author_facet DENG, Yang
LEI, Wenqiang
ZHANG, Wenxuan
LAM, Wai
CHUA, Tat-Seng
author_sort DENG, Yang
title PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance
title_short PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance
title_full PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance
title_fullStr PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance
title_full_unstemmed PACIFIC: Towards proactive conversational question answering over tabular and textual data in finance
title_sort pacific: towards proactive conversational question answering over tabular and textual data in finance
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/9139
https://ink.library.smu.edu.sg/context/sis_research/article/10142/viewcontent/2210.08817v2.pdf
_version_ 1814047753358016512