Language models are domain-specific chart analysts

As the advancement of multi-modal Large Language Models (LLM) such as GPT4, the cognitive capability of models is facing new expectations. Meanwhile, when LLM trainings are getting more expensive, there has been a gap between the conventional pretrain-finetune paradigm and the LLM prompting paradigm...

Full description

Saved in:

Bibliographic Details
Main Author:	Zhao, Yinjie
Other Authors:	Wen Bihan
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2023
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Online Access:	https://hdl.handle.net/10356/167416
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-167416
record_format	dspace
spelling	sg-ntu-dr.10356-1674162023-07-07T15:43:37Z Language models are domain-specific chart analysts Zhao, Yinjie Wen Bihan School of Electrical and Electronic Engineering bihan.wen@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence As the advancement of multi-modal Large Language Models (LLM) such as GPT4, the cognitive capability of models is facing new expectations. Meanwhile, when LLM trainings are getting more expensive, there has been a gap between the conventional pretrain-finetune paradigm and the LLM prompting paradigm regarding model designing. In order to close the currently existing gaps, we propose an AI model engineering pipeline, Cost-efficient C2T Pipeline (C2P), towards an objective of C2T model cognitive capabilities on Chart Domain-specific Analyzing (CDA). A 41.5 million parameter model was trained under C2P, achieving a significantly higher cost-efficiency compared to other models, with a comparable performance. In order to conduct the experiment validation, we proposed a new dataset, EconCharts, which is a domain-specific dataset on economics. C2P explores the Domain-specific cognitive capabilities of C2T / LLM models and to fill the engineering gap between expensive LLM models together with lightweight C2T models. Bachelor of Engineering (Electrical and Electronic Engineering) 2023-05-28T11:04:56Z 2023-05-28T11:04:56Z 2023 Final Year Project (FYP) Zhao, Y. (2023). Language models are domain-specific chart analysts. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167416 https://hdl.handle.net/10356/167416 en A3271-221 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Zhao, Yinjie Language models are domain-specific chart analysts
description	As the advancement of multi-modal Large Language Models (LLM) such as GPT4, the cognitive capability of models is facing new expectations. Meanwhile, when LLM trainings are getting more expensive, there has been a gap between the conventional pretrain-finetune paradigm and the LLM prompting paradigm regarding model designing. In order to close the currently existing gaps, we propose an AI model engineering pipeline, Cost-efficient C2T Pipeline (C2P), towards an objective of C2T model cognitive capabilities on Chart Domain-specific Analyzing (CDA). A 41.5 million parameter model was trained under C2P, achieving a significantly higher cost-efficiency compared to other models, with a comparable performance. In order to conduct the experiment validation, we proposed a new dataset, EconCharts, which is a domain-specific dataset on economics. C2P explores the Domain-specific cognitive capabilities of C2T / LLM models and to fill the engineering gap between expensive LLM models together with lightweight C2T models.
author2	Wen Bihan
author_facet	Wen Bihan Zhao, Yinjie
format	Final Year Project
author	Zhao, Yinjie
author_sort	Zhao, Yinjie
title	Language models are domain-specific chart analysts
title_short	Language models are domain-specific chart analysts
title_full	Language models are domain-specific chart analysts
title_fullStr	Language models are domain-specific chart analysts
title_full_unstemmed	Language models are domain-specific chart analysts
title_sort	language models are domain-specific chart analysts
publisher	Nanyang Technological University
publishDate	2023
url	https://hdl.handle.net/10356/167416
_version_	1772827791225520128

Language models are domain-specific chart analysts

Similar Items