CoVariance-based causal debiasing for entity and relation extraction
Joint entity and relation extraction tasks aim to recognize named entities and extract relations simultaneously. Suffering from a variety of data biases, such as data selection bias, and distribution bias (out of distribution, long-tail distribution), serious concerns can be witnessed to threaten th...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8395 https://ink.library.smu.edu.sg/context/sis_research/article/9398/viewcontent/2023.findings_emnlp.173.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9398 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-93982024-01-09T03:52:45Z CoVariance-based causal debiasing for entity and relation extraction REN, Lin LIU, Yongbin CAO, Yixin OUYANG, Chunping Joint entity and relation extraction tasks aim to recognize named entities and extract relations simultaneously. Suffering from a variety of data biases, such as data selection bias, and distribution bias (out of distribution, long-tail distribution), serious concerns can be witnessed to threaten the model’s transferability, robustness, and generalization. In this work, we address the above problems from a causality perspective. We propose a novel causal framework called covariance and variance optimization framework (OVO) to optimize feature representations and conduct general debiasing. In particular, the proposed covariance optimizing (COP) minimizes characterizing features’ covariance for alleviating the selection and distribution bias and enhances feature representation in the feature space. Furthermore, based on the causal backdoor adjustment, we propose variance optimizing (VOP) separates samples in terms of label information and minimizes the variance of each dimension in the feature vectors of the same class label for mitigating the distribution bias further. By applying it to three strong baselines in two widely used datasets, the results demonstrate the effectiveness and generalization of OVO for joint entity and relation extraction tasks. Furthermore, a fine-grained analysis reveals that OVO possesses the capability to mitigate the impact of long-tail distribution. The code is available at https://github.com/HomuraT/OVO. 2023-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8395 info:doi/10.18653/v1/2023.findings-emnlp.173 https://ink.library.smu.edu.sg/context/sis_research/article/9398/viewcontent/2023.findings_emnlp.173.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems |
spellingShingle |
Databases and Information Systems REN, Lin LIU, Yongbin CAO, Yixin OUYANG, Chunping CoVariance-based causal debiasing for entity and relation extraction |
description |
Joint entity and relation extraction tasks aim to recognize named entities and extract relations simultaneously. Suffering from a variety of data biases, such as data selection bias, and distribution bias (out of distribution, long-tail distribution), serious concerns can be witnessed to threaten the model’s transferability, robustness, and generalization. In this work, we address the above problems from a causality perspective. We propose a novel causal framework called covariance and variance optimization framework (OVO) to optimize feature representations and conduct general debiasing. In particular, the proposed covariance optimizing (COP) minimizes characterizing features’ covariance for alleviating the selection and distribution bias and enhances feature representation in the feature space. Furthermore, based on the causal backdoor adjustment, we propose variance optimizing (VOP) separates samples in terms of label information and minimizes the variance of each dimension in the feature vectors of the same class label for mitigating the distribution bias further. By applying it to three strong baselines in two widely used datasets, the results demonstrate the effectiveness and generalization of OVO for joint entity and relation extraction tasks. Furthermore, a fine-grained analysis reveals that OVO possesses the capability to mitigate the impact of long-tail distribution. The code is available at https://github.com/HomuraT/OVO. |
format |
text |
author |
REN, Lin LIU, Yongbin CAO, Yixin OUYANG, Chunping |
author_facet |
REN, Lin LIU, Yongbin CAO, Yixin OUYANG, Chunping |
author_sort |
REN, Lin |
title |
CoVariance-based causal debiasing for entity and relation extraction |
title_short |
CoVariance-based causal debiasing for entity and relation extraction |
title_full |
CoVariance-based causal debiasing for entity and relation extraction |
title_fullStr |
CoVariance-based causal debiasing for entity and relation extraction |
title_full_unstemmed |
CoVariance-based causal debiasing for entity and relation extraction |
title_sort |
covariance-based causal debiasing for entity and relation extraction |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2023 |
url |
https://ink.library.smu.edu.sg/sis_research/8395 https://ink.library.smu.edu.sg/context/sis_research/article/9398/viewcontent/2023.findings_emnlp.173.pdf |
_version_ |
1787590768324509696 |