Chart-to-text : a large-scale benchmark for chart summarisation
Charts are popularly used for communicating insights and exploring data. However, interpreting charts can be a challenging task. Automatically generated natural language summaries can help readers more easily understand charts by identifying the key insights. In this report, we present Chart-to-T...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/153167 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-153167 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1531672021-11-14T12:02:08Z Chart-to-text : a large-scale benchmark for chart summarisation Leong, Tiffany Ko Rixie Joty Shafiq Rayhan School of Computer Science and Engineering srjoty@ntu.edu.sg Engineering::Computer science and engineering Charts are popularly used for communicating insights and exploring data. However, interpreting charts can be a challenging task. Automatically generated natural language summaries can help readers more easily understand charts by identifying the key insights. In this report, we present Chart-to-Text, a large-scale benchmark with two datasets containing a total of 44,096 charts covering a diverse range of topics and chart types. We present the dataset construction process and an analysis of the datasets. We also formally define the Chart-to-Text task with two variations: one which assumes the availability of the underlying data table and another which does not. To tackle this problem, we introduce several state-of-the-art neural models, that utilise computer vision and data-to-text generation techniques, as baselines. Through a combination of automatic and human evaluation, we show that while our best models usually generate fluent summaries and yield reasonable BLEU scores, they unfortunately suffer from hallucinations and factual errors as well as difficulties in accurately describing complex patterns and trends in charts. Bachelor of Engineering (Computer Science) 2021-11-14T12:02:08Z 2021-11-14T12:02:08Z 2021 Final Year Project (FYP) Leong, T. K. R. (2021). Chart-to-text : a large-scale benchmark for chart summarisation. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/153167 https://hdl.handle.net/10356/153167 en SCSE20-0768 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Leong, Tiffany Ko Rixie Chart-to-text : a large-scale benchmark for chart summarisation |
description |
Charts are popularly used for communicating insights and exploring data. However, interpreting charts can be a challenging task. Automatically generated natural language summaries can help readers more easily understand charts by identifying the key insights.
In this report, we present Chart-to-Text, a large-scale benchmark with two datasets containing a total of 44,096 charts covering a diverse range of topics and chart types. We present the dataset construction process and an analysis of the datasets. We also formally define the Chart-to-Text task with two variations: one which assumes the availability of the underlying data table and another which does not. To tackle this problem, we introduce several state-of-the-art neural models, that utilise computer vision and data-to-text generation techniques, as baselines. Through a combination of automatic and human evaluation, we show that while our best models usually generate fluent summaries and yield reasonable BLEU scores, they unfortunately suffer from hallucinations and factual errors as well as difficulties in accurately describing complex patterns and trends in charts. |
author2 |
Joty Shafiq Rayhan |
author_facet |
Joty Shafiq Rayhan Leong, Tiffany Ko Rixie |
format |
Final Year Project |
author |
Leong, Tiffany Ko Rixie |
author_sort |
Leong, Tiffany Ko Rixie |
title |
Chart-to-text : a large-scale benchmark for chart summarisation |
title_short |
Chart-to-text : a large-scale benchmark for chart summarisation |
title_full |
Chart-to-text : a large-scale benchmark for chart summarisation |
title_fullStr |
Chart-to-text : a large-scale benchmark for chart summarisation |
title_full_unstemmed |
Chart-to-text : a large-scale benchmark for chart summarisation |
title_sort |
chart-to-text : a large-scale benchmark for chart summarisation |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/153167 |
_version_ |
1718368068576477184 |