Chart-to-text : a large-scale benchmark for chart summarisation

Charts are popularly used for communicating insights and exploring data. However, interpreting charts can be a challenging task. Automatically generated natural language summaries can help readers more easily understand charts by identifying the key insights. In this report, we present Chart-to-T...

Full description

Saved in:
Bibliographic Details
Main Author: Leong, Tiffany Ko Rixie
Other Authors: Joty Shafiq Rayhan
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/153167
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Charts are popularly used for communicating insights and exploring data. However, interpreting charts can be a challenging task. Automatically generated natural language summaries can help readers more easily understand charts by identifying the key insights. In this report, we present Chart-to-Text, a large-scale benchmark with two datasets containing a total of 44,096 charts covering a diverse range of topics and chart types. We present the dataset construction process and an analysis of the datasets. We also formally define the Chart-to-Text task with two variations: one which assumes the availability of the underlying data table and another which does not. To tackle this problem, we introduce several state-of-the-art neural models, that utilise computer vision and data-to-text generation techniques, as baselines. Through a combination of automatic and human evaluation, we show that while our best models usually generate fluent summaries and yield reasonable BLEU scores, they unfortunately suffer from hallucinations and factual errors as well as difficulties in accurately describing complex patterns and trends in charts.