Chart-to-text : a large-scale benchmark for chart summarisation

Charts are popularly used for communicating insights and exploring data. However, interpreting charts can be a challenging task. Automatically generated natural language summaries can help readers more easily understand charts by identifying the key insights. In this report, we present Chart-to-T...

Full description

Saved in:
Bibliographic Details
Main Author: Leong, Tiffany Ko Rixie
Other Authors: Joty Shafiq Rayhan
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/153167
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-153167
record_format dspace
spelling sg-ntu-dr.10356-1531672021-11-14T12:02:08Z Chart-to-text : a large-scale benchmark for chart summarisation Leong, Tiffany Ko Rixie Joty Shafiq Rayhan School of Computer Science and Engineering srjoty@ntu.edu.sg Engineering::Computer science and engineering Charts are popularly used for communicating insights and exploring data. However, interpreting charts can be a challenging task. Automatically generated natural language summaries can help readers more easily understand charts by identifying the key insights. In this report, we present Chart-to-Text, a large-scale benchmark with two datasets containing a total of 44,096 charts covering a diverse range of topics and chart types. We present the dataset construction process and an analysis of the datasets. We also formally define the Chart-to-Text task with two variations: one which assumes the availability of the underlying data table and another which does not. To tackle this problem, we introduce several state-of-the-art neural models, that utilise computer vision and data-to-text generation techniques, as baselines. Through a combination of automatic and human evaluation, we show that while our best models usually generate fluent summaries and yield reasonable BLEU scores, they unfortunately suffer from hallucinations and factual errors as well as difficulties in accurately describing complex patterns and trends in charts. Bachelor of Engineering (Computer Science) 2021-11-14T12:02:08Z 2021-11-14T12:02:08Z 2021 Final Year Project (FYP) Leong, T. K. R. (2021). Chart-to-text : a large-scale benchmark for chart summarisation. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/153167 https://hdl.handle.net/10356/153167 en SCSE20-0768 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Leong, Tiffany Ko Rixie
Chart-to-text : a large-scale benchmark for chart summarisation
description Charts are popularly used for communicating insights and exploring data. However, interpreting charts can be a challenging task. Automatically generated natural language summaries can help readers more easily understand charts by identifying the key insights. In this report, we present Chart-to-Text, a large-scale benchmark with two datasets containing a total of 44,096 charts covering a diverse range of topics and chart types. We present the dataset construction process and an analysis of the datasets. We also formally define the Chart-to-Text task with two variations: one which assumes the availability of the underlying data table and another which does not. To tackle this problem, we introduce several state-of-the-art neural models, that utilise computer vision and data-to-text generation techniques, as baselines. Through a combination of automatic and human evaluation, we show that while our best models usually generate fluent summaries and yield reasonable BLEU scores, they unfortunately suffer from hallucinations and factual errors as well as difficulties in accurately describing complex patterns and trends in charts.
author2 Joty Shafiq Rayhan
author_facet Joty Shafiq Rayhan
Leong, Tiffany Ko Rixie
format Final Year Project
author Leong, Tiffany Ko Rixie
author_sort Leong, Tiffany Ko Rixie
title Chart-to-text : a large-scale benchmark for chart summarisation
title_short Chart-to-text : a large-scale benchmark for chart summarisation
title_full Chart-to-text : a large-scale benchmark for chart summarisation
title_fullStr Chart-to-text : a large-scale benchmark for chart summarisation
title_full_unstemmed Chart-to-text : a large-scale benchmark for chart summarisation
title_sort chart-to-text : a large-scale benchmark for chart summarisation
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/153167
_version_ 1718368068576477184