Refining latent multi-view graph for relation extraction
Relation extraction (RE) is to predict the relation type of two entities that are mentioned in a piece of text, e.g. a sentence or a dialogue. According to various text input formats, Relation extraction can be divided into different categories, such as sentence-level RE, document-level RE and dialo...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Research |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/151541 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-151541 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1515412021-07-08T16:01:20Z Refining latent multi-view graph for relation extraction Xue, Fuzhao Chng Eng Siong Sun Aixin School of Computer Science and Engineering ASESChng@ntu.edu.sg, AXSun@ntu.edu.sg Engineering::Computer science and engineering Relation extraction (RE) is to predict the relation type of two entities that are mentioned in a piece of text, e.g. a sentence or a dialogue. According to various text input formats, Relation extraction can be divided into different categories, such as sentence-level RE, document-level RE and dialogue-level RE. In this thesis, we focus on addressing sentence-level RE and dialogue-level RE with a unified model. For both sentence-level RE and dialogue-level RE, the main challenge arise when the given text is long. It is difficult to identify indicative words for the relation prediction. Recent advances on RE tasks are from BERT-based models. These models predict relation by sentence-level semantic representation. Hierarchical representations are missing in BERT-based models so that the words weakly related to the target relation induce extra noise. Therefore, in this thesis, we propose a unified model, GDPNet (Gaussian Dynamic Time Warping Pooling Net), to improve relation extraction by identifying indicative words. GDPNet contains a BERT module and a Graph Module. In the BERT module, we obtain token representations from the raw text. Then, in the Graph Module, we propose to construct a latent multi-view graph to capture various possible relationships among tokens. We then refine this graph to select important words for relation prediction. Finally, the representation of the refined graph and the BERT-based sequence representation are concatenated for relation extraction. Specifically, in our proposed GDPNet, we utilize Gaussian Graph Generator (GGG) to generate edges of the multi-view graph. The graph is then refined by Dynamic Time Warping Pooling (DTWPool). On DialogRE and TACRED, we show that GDPNet achieves the best performance on dialogue-level RE, and comparable performance with the state-of-the-arts on sentence-level RE. We also perform a quantitative study to show that our model is capable of finding indicative words in an unsupervised manner. Master of Engineering 2021-06-25T00:03:05Z 2021-06-25T00:03:05Z 2021 Thesis-Master by Research Xue, F. (2021). Refining latent multi-view graph for relation extraction. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/151541 https://hdl.handle.net/10356/151541 10.32657/10356/151541 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Xue, Fuzhao Refining latent multi-view graph for relation extraction |
description |
Relation extraction (RE) is to predict the relation type of two entities that are mentioned in a piece of text, e.g. a sentence or a dialogue. According to various text input formats, Relation extraction can be divided into different categories, such as sentence-level RE, document-level RE and dialogue-level RE. In this thesis, we focus on addressing sentence-level RE and dialogue-level RE with a unified model. For both sentence-level RE and dialogue-level RE, the main challenge arise when the given text is long. It is difficult to identify indicative words for the relation prediction.
Recent advances on RE tasks are from BERT-based models. These models predict relation by sentence-level semantic representation. Hierarchical representations are missing in BERT-based models so that the words weakly related to the target relation induce extra noise. Therefore, in this thesis, we propose a unified model, GDPNet (Gaussian Dynamic Time Warping Pooling Net), to improve relation extraction by identifying indicative words. GDPNet contains a BERT module and a Graph Module. In the BERT module, we obtain token representations from the raw text. Then, in the Graph Module, we propose to construct a latent multi-view graph to capture various possible relationships among tokens. We then refine this graph to select important words for relation prediction. Finally, the representation of the refined graph and the BERT-based sequence representation are concatenated for relation extraction. Specifically, in our proposed GDPNet, we utilize Gaussian Graph Generator (GGG) to generate edges of the multi-view graph. The graph is then refined by Dynamic Time Warping Pooling (DTWPool).
On DialogRE and TACRED, we show that GDPNet achieves the best performance on dialogue-level RE, and comparable performance with the state-of-the-arts on sentence-level RE. We also perform a quantitative study to show that our model is capable of finding indicative words in an unsupervised manner. |
author2 |
Chng Eng Siong |
author_facet |
Chng Eng Siong Xue, Fuzhao |
format |
Thesis-Master by Research |
author |
Xue, Fuzhao |
author_sort |
Xue, Fuzhao |
title |
Refining latent multi-view graph for relation extraction |
title_short |
Refining latent multi-view graph for relation extraction |
title_full |
Refining latent multi-view graph for relation extraction |
title_fullStr |
Refining latent multi-view graph for relation extraction |
title_full_unstemmed |
Refining latent multi-view graph for relation extraction |
title_sort |
refining latent multi-view graph for relation extraction |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/151541 |
_version_ |
1705151305149841408 |