ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction

arge language models (LLMs), such as GPT-3 and ChatGPT, have demonstrated remarkable results in various natural language processing (NLP) tasks with in-context learning, which involves inference based on a few demonstration examples. Despite their successes in NLP tasks, no investigation has been co...

Full description

Saved in:
Bibliographic Details
Main Authors: HE, Jiabang, WANG, Lei, HU, Yi, LIU, Ning, LIU, Hui, XU, Xing, SHEN, Heng Tao
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8718
https://ink.library.smu.edu.sg/context/sis_research/article/9721/viewcontent/He_ICL_D3IE_In_Context_Learning_with_Diverse_Demonstrations_Updating_for_Document_Information_ICCV_2023_av.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9721
record_format dspace
spelling sg-smu-ink.sis_research-97212024-04-15T08:35:37Z ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction HE, Jiabang WANG, Lei HU, Yi LIU, Ning LIU, Hui XU, Xing SHEN, Heng Tao arge language models (LLMs), such as GPT-3 and ChatGPT, have demonstrated remarkable results in various natural language processing (NLP) tasks with in-context learning, which involves inference based on a few demonstration examples. Despite their successes in NLP tasks, no investigation has been conducted to assess the ability of LLMs to perform document information extraction (DIE) using in-context learning. Applying LLMs to DIE poses two challenges: the modality and task gap. To this end, we propose a simple but effective in-context learning framework called ICL-D3IE, which enables LLMs to perform DIE with different types of demonstration examples. Specifically, we extract the most difficult and distinct segments from hard training documents as hard demonstrations for benefiting all test instances. We design demonstrations describing relationships that enable LLMs to understand positional relationships. We introduce formatting demonstrations for easy answer extraction. Additionally, the framework improves diverse demonstrations by updating them iteratively. Our experiments on three widely used benchmark datasets demonstrate that the ICL-D3IE framework enables Davinci-003/ChatGPT to achieve superior performance when compared to previous pre-trained methods fine-tuned with full training in both the in-distribution (ID) setting and in the out-of-distribution (OOD) setting. Code is available at https://github.com/MAEHCM/ICL-D3IE. 2023-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8718 info:doi/10.1109/ICCV51070.2023.01785 https://ink.library.smu.edu.sg/context/sis_research/article/9721/viewcontent/He_ICL_D3IE_In_Context_Learning_with_Diverse_Demonstrations_Updating_for_Document_Information_ICCV_2023_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Artificial Intelligence and Robotics
Numerical Analysis and Scientific Computing
spellingShingle Artificial Intelligence and Robotics
Numerical Analysis and Scientific Computing
HE, Jiabang
WANG, Lei
HU, Yi
LIU, Ning
LIU, Hui
XU, Xing
SHEN, Heng Tao
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
description arge language models (LLMs), such as GPT-3 and ChatGPT, have demonstrated remarkable results in various natural language processing (NLP) tasks with in-context learning, which involves inference based on a few demonstration examples. Despite their successes in NLP tasks, no investigation has been conducted to assess the ability of LLMs to perform document information extraction (DIE) using in-context learning. Applying LLMs to DIE poses two challenges: the modality and task gap. To this end, we propose a simple but effective in-context learning framework called ICL-D3IE, which enables LLMs to perform DIE with different types of demonstration examples. Specifically, we extract the most difficult and distinct segments from hard training documents as hard demonstrations for benefiting all test instances. We design demonstrations describing relationships that enable LLMs to understand positional relationships. We introduce formatting demonstrations for easy answer extraction. Additionally, the framework improves diverse demonstrations by updating them iteratively. Our experiments on three widely used benchmark datasets demonstrate that the ICL-D3IE framework enables Davinci-003/ChatGPT to achieve superior performance when compared to previous pre-trained methods fine-tuned with full training in both the in-distribution (ID) setting and in the out-of-distribution (OOD) setting. Code is available at https://github.com/MAEHCM/ICL-D3IE.
format text
author HE, Jiabang
WANG, Lei
HU, Yi
LIU, Ning
LIU, Hui
XU, Xing
SHEN, Heng Tao
author_facet HE, Jiabang
WANG, Lei
HU, Yi
LIU, Ning
LIU, Hui
XU, Xing
SHEN, Heng Tao
author_sort HE, Jiabang
title ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
title_short ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
title_full ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
title_fullStr ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
title_full_unstemmed ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
title_sort icl-d3ie: in-context learning with diverse demonstrations updating for document information extraction
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8718
https://ink.library.smu.edu.sg/context/sis_research/article/9721/viewcontent/He_ICL_D3IE_In_Context_Learning_with_Diverse_Demonstrations_Updating_for_Document_Information_ICCV_2023_av.pdf
_version_ 1814047474834210816