Deep neural networks for identifying causal relations in texts

Causal relation identification is an important task that facilitates many downstream tasks such as why-question answering, event prediction, information retrieval, sentiment analysis, etc. The goal of causal relation identification is to determine whether there's causal relation between 2 event...

Full description

Saved in:
Bibliographic Details
Main Author: Chen, Siyuan
Other Authors: Mao Kezhi
Format: Thesis-Master by Research
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/171180
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-171180
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Chen, Siyuan
Deep neural networks for identifying causal relations in texts
description Causal relation identification is an important task that facilitates many downstream tasks such as why-question answering, event prediction, information retrieval, sentiment analysis, etc. The goal of causal relation identification is to determine whether there's causal relation between 2 events or entities. With the prosperity of deep learning, end-to-end deep neural models have made great progress. Such approaches mainly focus on learning causal patterns from the training data, i.e. data-oriented. However, causal relation identification is a highly difficult task due to the flexible linguistic expressions and the background knowledge needed for identification. Despite of the necessity of learning from training data, only relying on it may suffer from poor generalizability. Hence, it's important to incorporate external knowledge into the neural network to improve its generalizability. In this thesis, we explore this idea in 2 subfields of causal relation identification, namely emotion-cause pair extraction and event causality identification. Emotion-Cause Pair Extraction (ECPE) aims to extract emotion and cause clauses underlying a text and pair them. Most of the recent approaches to this problem adopt deep neural networks to learn the inter-clause dependencies from training data, without making full use of information of all granularities and external knowledge. In this thesis, we propose a model that utilizes external knowledge and multi-granular information, including word-level, clause-level, and document-level information, to facilitate emotion-cause pair extraction. More specifically, at clause-level, two fully-connected clause graphs are constructed, namely emotion graph and cause graph. Graph attention is applied to learn emotion-specific and cause-specific representations which are then used to generate document-level representations. To further exploit the mutual indication between emotion and cause clauses, a cross-graph co-attention mechanism is proposed. At word-level, external knowledge of emotional and causal cues is incorporated to provide word-level indicative information for emotion-cause pair extraction. The proposed model is tested on both Chinese [1] and English [2] datasets, and the results show that our model achieves state-of-the-art performance on both datasets. Event Causality Identification (ECI) aims at detecting the causal relation between 2 events, which is a challenging task due to the complexity of causal expressions and the background knowledge needed for identifying certain causal relations. Considerable work has been done on the learning of context and the incorporation of external knowledge. However, few of them combines the data-oriented model with both explicit and implicit causal knowledge. In this thesis, we propose an integrative model to integrate causal indicator words, implicit causal knowledge and data-oriented model to enhance causality identification. For the data-oriented model, an event pair graph is constructed and Relational Graph Convolutional Network (R-GCN) is employed to better capture interactions between individual pairs. For the causal indicators, their word embeddings are used to initialize the filters of convolutional neural network so as to capture the clues indicating causal relation. For the causal facts, we design a cause-effect matching mechanism to measure the possibility of causal relation holding between 2 events, based on the possible causes and effects automatically generated by COMET. The proposed method is evaluated on 3 datasets [3-5], and experimental results demonstrate the effectiveness of the proposed method.
author2 Mao Kezhi
author_facet Mao Kezhi
Chen, Siyuan
format Thesis-Master by Research
author Chen, Siyuan
author_sort Chen, Siyuan
title Deep neural networks for identifying causal relations in texts
title_short Deep neural networks for identifying causal relations in texts
title_full Deep neural networks for identifying causal relations in texts
title_fullStr Deep neural networks for identifying causal relations in texts
title_full_unstemmed Deep neural networks for identifying causal relations in texts
title_sort deep neural networks for identifying causal relations in texts
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/171180
_version_ 1781793797564268544
spelling sg-ntu-dr.10356-1711802023-11-02T02:20:48Z Deep neural networks for identifying causal relations in texts Chen, Siyuan Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Causal relation identification is an important task that facilitates many downstream tasks such as why-question answering, event prediction, information retrieval, sentiment analysis, etc. The goal of causal relation identification is to determine whether there's causal relation between 2 events or entities. With the prosperity of deep learning, end-to-end deep neural models have made great progress. Such approaches mainly focus on learning causal patterns from the training data, i.e. data-oriented. However, causal relation identification is a highly difficult task due to the flexible linguistic expressions and the background knowledge needed for identification. Despite of the necessity of learning from training data, only relying on it may suffer from poor generalizability. Hence, it's important to incorporate external knowledge into the neural network to improve its generalizability. In this thesis, we explore this idea in 2 subfields of causal relation identification, namely emotion-cause pair extraction and event causality identification. Emotion-Cause Pair Extraction (ECPE) aims to extract emotion and cause clauses underlying a text and pair them. Most of the recent approaches to this problem adopt deep neural networks to learn the inter-clause dependencies from training data, without making full use of information of all granularities and external knowledge. In this thesis, we propose a model that utilizes external knowledge and multi-granular information, including word-level, clause-level, and document-level information, to facilitate emotion-cause pair extraction. More specifically, at clause-level, two fully-connected clause graphs are constructed, namely emotion graph and cause graph. Graph attention is applied to learn emotion-specific and cause-specific representations which are then used to generate document-level representations. To further exploit the mutual indication between emotion and cause clauses, a cross-graph co-attention mechanism is proposed. At word-level, external knowledge of emotional and causal cues is incorporated to provide word-level indicative information for emotion-cause pair extraction. The proposed model is tested on both Chinese [1] and English [2] datasets, and the results show that our model achieves state-of-the-art performance on both datasets. Event Causality Identification (ECI) aims at detecting the causal relation between 2 events, which is a challenging task due to the complexity of causal expressions and the background knowledge needed for identifying certain causal relations. Considerable work has been done on the learning of context and the incorporation of external knowledge. However, few of them combines the data-oriented model with both explicit and implicit causal knowledge. In this thesis, we propose an integrative model to integrate causal indicator words, implicit causal knowledge and data-oriented model to enhance causality identification. For the data-oriented model, an event pair graph is constructed and Relational Graph Convolutional Network (R-GCN) is employed to better capture interactions between individual pairs. For the causal indicators, their word embeddings are used to initialize the filters of convolutional neural network so as to capture the clues indicating causal relation. For the causal facts, we design a cause-effect matching mechanism to measure the possibility of causal relation holding between 2 events, based on the possible causes and effects automatically generated by COMET. The proposed method is evaluated on 3 datasets [3-5], and experimental results demonstrate the effectiveness of the proposed method. Master of Engineering 2023-10-18T01:09:02Z 2023-10-18T01:09:02Z 2023 Thesis-Master by Research Chen, S. (2023). Deep neural networks for identifying causal relations in texts. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/171180 https://hdl.handle.net/10356/171180 10.32657/10356/171180 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University