Deep neural networks for identifying causal relations in texts
Causal relation identification is an important task that facilitates many downstream tasks such as why-question answering, event prediction, information retrieval, sentiment analysis, etc. The goal of causal relation identification is to determine whether there's causal relation between 2 event...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Research |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/171180 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-171180 |
---|---|
record_format |
dspace |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Chen, Siyuan Deep neural networks for identifying causal relations in texts |
description |
Causal relation identification is an important task that facilitates many downstream tasks such as why-question answering, event prediction, information retrieval, sentiment analysis, etc. The goal of causal relation identification is to determine whether there's causal relation between 2 events or entities. With the prosperity of deep learning, end-to-end deep neural models have made great progress. Such approaches mainly focus on learning causal patterns from the training data, i.e. data-oriented. However, causal relation identification is a highly difficult task due to the flexible linguistic expressions and the background knowledge needed for identification. Despite of the necessity of learning from training data, only relying on it may suffer from poor generalizability. Hence, it's important to incorporate external knowledge into the neural network to improve its generalizability. In this thesis, we explore this idea in 2 subfields of causal relation identification, namely emotion-cause pair extraction and event causality identification.
Emotion-Cause Pair Extraction (ECPE) aims to extract emotion and cause clauses underlying a text and pair them. Most of the recent approaches to this problem adopt deep neural networks to learn the inter-clause dependencies from training data, without making full use of information of all granularities and external knowledge. In this thesis, we propose a model that utilizes external knowledge and multi-granular information, including word-level, clause-level, and document-level information, to facilitate emotion-cause pair extraction. More specifically, at clause-level, two fully-connected clause graphs are constructed, namely emotion graph and cause graph. Graph attention is applied to learn emotion-specific and cause-specific representations which are then used to generate document-level representations. To further exploit the mutual indication between emotion and cause clauses, a cross-graph co-attention mechanism is proposed. At word-level, external knowledge of emotional and causal cues is incorporated to provide word-level indicative information for emotion-cause pair extraction. The proposed model is tested on both Chinese [1] and English [2] datasets, and the results show that our model achieves state-of-the-art performance on both datasets.
Event Causality Identification (ECI) aims at detecting the causal relation between 2 events, which is a challenging task due to the complexity of causal expressions and the background knowledge needed for identifying certain causal relations. Considerable work has been done on the learning of context and the incorporation of external knowledge. However, few of them combines the data-oriented model with both explicit and implicit causal knowledge. In this thesis, we propose an integrative model to integrate causal indicator words, implicit causal knowledge and data-oriented model to enhance causality identification. For the data-oriented model, an event pair graph is constructed and Relational Graph Convolutional Network (R-GCN) is employed to better capture interactions between individual pairs. For the causal indicators, their word embeddings are used to initialize the filters of convolutional neural network so as to capture the clues indicating causal relation. For the causal facts, we design a cause-effect matching mechanism to measure the possibility of causal relation holding between 2 events, based on the possible causes and effects automatically generated by COMET. The proposed method is evaluated on 3 datasets [3-5], and experimental results demonstrate the effectiveness of the proposed method. |
author2 |
Mao Kezhi |
author_facet |
Mao Kezhi Chen, Siyuan |
format |
Thesis-Master by Research |
author |
Chen, Siyuan |
author_sort |
Chen, Siyuan |
title |
Deep neural networks for identifying causal relations in texts |
title_short |
Deep neural networks for identifying causal relations in texts |
title_full |
Deep neural networks for identifying causal relations in texts |
title_fullStr |
Deep neural networks for identifying causal relations in texts |
title_full_unstemmed |
Deep neural networks for identifying causal relations in texts |
title_sort |
deep neural networks for identifying causal relations in texts |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/171180 |
_version_ |
1781793797564268544 |
spelling |
sg-ntu-dr.10356-1711802023-11-02T02:20:48Z Deep neural networks for identifying causal relations in texts Chen, Siyuan Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Causal relation identification is an important task that facilitates many downstream tasks such as why-question answering, event prediction, information retrieval, sentiment analysis, etc. The goal of causal relation identification is to determine whether there's causal relation between 2 events or entities. With the prosperity of deep learning, end-to-end deep neural models have made great progress. Such approaches mainly focus on learning causal patterns from the training data, i.e. data-oriented. However, causal relation identification is a highly difficult task due to the flexible linguistic expressions and the background knowledge needed for identification. Despite of the necessity of learning from training data, only relying on it may suffer from poor generalizability. Hence, it's important to incorporate external knowledge into the neural network to improve its generalizability. In this thesis, we explore this idea in 2 subfields of causal relation identification, namely emotion-cause pair extraction and event causality identification. Emotion-Cause Pair Extraction (ECPE) aims to extract emotion and cause clauses underlying a text and pair them. Most of the recent approaches to this problem adopt deep neural networks to learn the inter-clause dependencies from training data, without making full use of information of all granularities and external knowledge. In this thesis, we propose a model that utilizes external knowledge and multi-granular information, including word-level, clause-level, and document-level information, to facilitate emotion-cause pair extraction. More specifically, at clause-level, two fully-connected clause graphs are constructed, namely emotion graph and cause graph. Graph attention is applied to learn emotion-specific and cause-specific representations which are then used to generate document-level representations. To further exploit the mutual indication between emotion and cause clauses, a cross-graph co-attention mechanism is proposed. At word-level, external knowledge of emotional and causal cues is incorporated to provide word-level indicative information for emotion-cause pair extraction. The proposed model is tested on both Chinese [1] and English [2] datasets, and the results show that our model achieves state-of-the-art performance on both datasets. Event Causality Identification (ECI) aims at detecting the causal relation between 2 events, which is a challenging task due to the complexity of causal expressions and the background knowledge needed for identifying certain causal relations. Considerable work has been done on the learning of context and the incorporation of external knowledge. However, few of them combines the data-oriented model with both explicit and implicit causal knowledge. In this thesis, we propose an integrative model to integrate causal indicator words, implicit causal knowledge and data-oriented model to enhance causality identification. For the data-oriented model, an event pair graph is constructed and Relational Graph Convolutional Network (R-GCN) is employed to better capture interactions between individual pairs. For the causal indicators, their word embeddings are used to initialize the filters of convolutional neural network so as to capture the clues indicating causal relation. For the causal facts, we design a cause-effect matching mechanism to measure the possibility of causal relation holding between 2 events, based on the possible causes and effects automatically generated by COMET. The proposed method is evaluated on 3 datasets [3-5], and experimental results demonstrate the effectiveness of the proposed method. Master of Engineering 2023-10-18T01:09:02Z 2023-10-18T01:09:02Z 2023 Thesis-Master by Research Chen, S. (2023). Deep neural networks for identifying causal relations in texts. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/171180 https://hdl.handle.net/10356/171180 10.32657/10356/171180 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |