Refining latent multi-view graph for relation extraction

Relation extraction (RE) is to predict the relation type of two entities that are mentioned in a piece of text, e.g. a sentence or a dialogue. According to various text input formats, Relation extraction can be divided into different categories, such as sentence-level RE, document-level RE and dialo...

Full description

Saved in:
Bibliographic Details
Main Author: Xue, Fuzhao
Other Authors: Chng Eng Siong
Format: Thesis-Master by Research
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/151541
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-151541
record_format dspace
spelling sg-ntu-dr.10356-1515412021-07-08T16:01:20Z Refining latent multi-view graph for relation extraction Xue, Fuzhao Chng Eng Siong Sun Aixin School of Computer Science and Engineering ASESChng@ntu.edu.sg, AXSun@ntu.edu.sg Engineering::Computer science and engineering Relation extraction (RE) is to predict the relation type of two entities that are mentioned in a piece of text, e.g. a sentence or a dialogue. According to various text input formats, Relation extraction can be divided into different categories, such as sentence-level RE, document-level RE and dialogue-level RE. In this thesis, we focus on addressing sentence-level RE and dialogue-level RE with a unified model. For both sentence-level RE and dialogue-level RE, the main challenge arise when the given text is long. It is difficult to identify indicative words for the relation prediction. Recent advances on RE tasks are from BERT-based models. These models predict relation by sentence-level semantic representation. Hierarchical representations are missing in BERT-based models so that the words weakly related to the target relation induce extra noise. Therefore, in this thesis, we propose a unified model, GDPNet (Gaussian Dynamic Time Warping Pooling Net), to improve relation extraction by identifying indicative words. GDPNet contains a BERT module and a Graph Module. In the BERT module, we obtain token representations from the raw text. Then, in the Graph Module, we propose to construct a latent multi-view graph to capture various possible relationships among tokens. We then refine this graph to select important words for relation prediction. Finally, the representation of the refined graph and the BERT-based sequence representation are concatenated for relation extraction. Specifically, in our proposed GDPNet, we utilize Gaussian Graph Generator (GGG) to generate edges of the multi-view graph. The graph is then refined by Dynamic Time Warping Pooling (DTWPool). On DialogRE and TACRED, we show that GDPNet achieves the best performance on dialogue-level RE, and comparable performance with the state-of-the-arts on sentence-level RE. We also perform a quantitative study to show that our model is capable of finding indicative words in an unsupervised manner. Master of Engineering 2021-06-25T00:03:05Z 2021-06-25T00:03:05Z 2021 Thesis-Master by Research Xue, F. (2021). Refining latent multi-view graph for relation extraction. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/151541 https://hdl.handle.net/10356/151541 10.32657/10356/151541 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Xue, Fuzhao
Refining latent multi-view graph for relation extraction
description Relation extraction (RE) is to predict the relation type of two entities that are mentioned in a piece of text, e.g. a sentence or a dialogue. According to various text input formats, Relation extraction can be divided into different categories, such as sentence-level RE, document-level RE and dialogue-level RE. In this thesis, we focus on addressing sentence-level RE and dialogue-level RE with a unified model. For both sentence-level RE and dialogue-level RE, the main challenge arise when the given text is long. It is difficult to identify indicative words for the relation prediction. Recent advances on RE tasks are from BERT-based models. These models predict relation by sentence-level semantic representation. Hierarchical representations are missing in BERT-based models so that the words weakly related to the target relation induce extra noise. Therefore, in this thesis, we propose a unified model, GDPNet (Gaussian Dynamic Time Warping Pooling Net), to improve relation extraction by identifying indicative words. GDPNet contains a BERT module and a Graph Module. In the BERT module, we obtain token representations from the raw text. Then, in the Graph Module, we propose to construct a latent multi-view graph to capture various possible relationships among tokens. We then refine this graph to select important words for relation prediction. Finally, the representation of the refined graph and the BERT-based sequence representation are concatenated for relation extraction. Specifically, in our proposed GDPNet, we utilize Gaussian Graph Generator (GGG) to generate edges of the multi-view graph. The graph is then refined by Dynamic Time Warping Pooling (DTWPool). On DialogRE and TACRED, we show that GDPNet achieves the best performance on dialogue-level RE, and comparable performance with the state-of-the-arts on sentence-level RE. We also perform a quantitative study to show that our model is capable of finding indicative words in an unsupervised manner.
author2 Chng Eng Siong
author_facet Chng Eng Siong
Xue, Fuzhao
format Thesis-Master by Research
author Xue, Fuzhao
author_sort Xue, Fuzhao
title Refining latent multi-view graph for relation extraction
title_short Refining latent multi-view graph for relation extraction
title_full Refining latent multi-view graph for relation extraction
title_fullStr Refining latent multi-view graph for relation extraction
title_full_unstemmed Refining latent multi-view graph for relation extraction
title_sort refining latent multi-view graph for relation extraction
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/151541
_version_ 1705151305149841408