CoVariance-based causal debiasing for entity and relation extraction

Joint entity and relation extraction tasks aim to recognize named entities and extract relations simultaneously. Suffering from a variety of data biases, such as data selection bias, and distribution bias (out of distribution, long-tail distribution), serious concerns can be witnessed to threaten th...

Full description

Saved in:
Bibliographic Details
Main Authors: REN, Lin, LIU, Yongbin, CAO, Yixin, OUYANG, Chunping
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8395
https://ink.library.smu.edu.sg/context/sis_research/article/9398/viewcontent/2023.findings_emnlp.173.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9398
record_format dspace
spelling sg-smu-ink.sis_research-93982024-01-09T03:52:45Z CoVariance-based causal debiasing for entity and relation extraction REN, Lin LIU, Yongbin CAO, Yixin OUYANG, Chunping Joint entity and relation extraction tasks aim to recognize named entities and extract relations simultaneously. Suffering from a variety of data biases, such as data selection bias, and distribution bias (out of distribution, long-tail distribution), serious concerns can be witnessed to threaten the model’s transferability, robustness, and generalization. In this work, we address the above problems from a causality perspective. We propose a novel causal framework called covariance and variance optimization framework (OVO) to optimize feature representations and conduct general debiasing. In particular, the proposed covariance optimizing (COP) minimizes characterizing features’ covariance for alleviating the selection and distribution bias and enhances feature representation in the feature space. Furthermore, based on the causal backdoor adjustment, we propose variance optimizing (VOP) separates samples in terms of label information and minimizes the variance of each dimension in the feature vectors of the same class label for mitigating the distribution bias further. By applying it to three strong baselines in two widely used datasets, the results demonstrate the effectiveness and generalization of OVO for joint entity and relation extraction tasks. Furthermore, a fine-grained analysis reveals that OVO possesses the capability to mitigate the impact of long-tail distribution. The code is available at https://github.com/HomuraT/OVO. 2023-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8395 info:doi/10.18653/v1/2023.findings-emnlp.173 https://ink.library.smu.edu.sg/context/sis_research/article/9398/viewcontent/2023.findings_emnlp.173.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
REN, Lin
LIU, Yongbin
CAO, Yixin
OUYANG, Chunping
CoVariance-based causal debiasing for entity and relation extraction
description Joint entity and relation extraction tasks aim to recognize named entities and extract relations simultaneously. Suffering from a variety of data biases, such as data selection bias, and distribution bias (out of distribution, long-tail distribution), serious concerns can be witnessed to threaten the model’s transferability, robustness, and generalization. In this work, we address the above problems from a causality perspective. We propose a novel causal framework called covariance and variance optimization framework (OVO) to optimize feature representations and conduct general debiasing. In particular, the proposed covariance optimizing (COP) minimizes characterizing features’ covariance for alleviating the selection and distribution bias and enhances feature representation in the feature space. Furthermore, based on the causal backdoor adjustment, we propose variance optimizing (VOP) separates samples in terms of label information and minimizes the variance of each dimension in the feature vectors of the same class label for mitigating the distribution bias further. By applying it to three strong baselines in two widely used datasets, the results demonstrate the effectiveness and generalization of OVO for joint entity and relation extraction tasks. Furthermore, a fine-grained analysis reveals that OVO possesses the capability to mitigate the impact of long-tail distribution. The code is available at https://github.com/HomuraT/OVO.
format text
author REN, Lin
LIU, Yongbin
CAO, Yixin
OUYANG, Chunping
author_facet REN, Lin
LIU, Yongbin
CAO, Yixin
OUYANG, Chunping
author_sort REN, Lin
title CoVariance-based causal debiasing for entity and relation extraction
title_short CoVariance-based causal debiasing for entity and relation extraction
title_full CoVariance-based causal debiasing for entity and relation extraction
title_fullStr CoVariance-based causal debiasing for entity and relation extraction
title_full_unstemmed CoVariance-based causal debiasing for entity and relation extraction
title_sort covariance-based causal debiasing for entity and relation extraction
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8395
https://ink.library.smu.edu.sg/context/sis_research/article/9398/viewcontent/2023.findings_emnlp.173.pdf
_version_ 1787590768324509696