Semi-supervised generative open information extraction
Open Information Extraction (OpenIE) extracts facts in the form of n-ary relation tuples, i.e., (arg1, predicate, arg2, …, argn), from unstructured text without relying on predefined ontology schema. It has the potential to handle heterogeneous corpora with minimal human intervention. With OpenIE, W...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Research |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/171930 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-171930 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1719302023-12-01T01:52:37Z Semi-supervised generative open information extraction Zhou, Shaowen Long Cheng School of Computer Science and Engineering c.long@ntu.edu.sg Engineering::Computer science and engineering Open Information Extraction (OpenIE) extracts facts in the form of n-ary relation tuples, i.e., (arg1, predicate, arg2, …, argn), from unstructured text without relying on predefined ontology schema. It has the potential to handle heterogeneous corpora with minimal human intervention. With OpenIE, Web-scale unconstrained IE systems can be developed to acquire large quantities of knowledge. The knowledge gathered can then be integrated and used in various natural language processing (NLP) applications, such as textual entailment, summarization, question answering, and explicit reasoning. Firstly, we surveyed neural OpenIE. Thanks to the rapid development of deep learning technologies, numerous neural OpenIE architectures have been proposed, achieving considerable performance improvement. In this survey, we provide an extensive overview of the state-of-the-art neural OpenIE models, their key design decisions, strengths, and weaknesses. Then, we discuss the limitations of current solutions and the open issues in the OpenIE problem itself. Finally, we list recent trends that could help expand its scope and applicability, setting up promising directions for future research in OpenIE. This paper is the first review of neural OpenIE. Secondly, we consider semi-supervised generative OpenIE to boost the performance by utilizing unlabeled texts. Deep learning-based OpenIE systems need more high-quality training data. Moreover, as mentioned in previous work, different OpenIE applications focus on different types of information. Existing unsupervised data bootstrapping solutions need help producing high-quality training data and being more flexible in adjusting the data to be suitable for specific applications. Recently, generative methods have shown great potential in various NLP tasks. We leverage a mean teacher-based semi-supervised learning method to boost the performance of a generative OpenIE model. We first pre-train the model using existing training data and then conduct semi-supervised training using gold data and unlabeled corpus. To filter out non-trustworthy training examples generated during the semi-supervised learning phase, we introduce a verification mechanism to improve the quality of the examples. Instead of training two separate models, we use prompts to train a joint model with extraction and verification capabilities. We conduct experiments on OpenIE benchmarks and show the promising results achieved by our proposed method. Finally, we conduct a few experiments evaluating ChatGPT’s zero-shot capability, discuss possible future work directions, and conclude the report. Master of Engineering 2023-11-16T08:47:56Z 2023-11-16T08:47:56Z 2023 Thesis-Master by Research Zhou, S. (2023). Semi-supervised generative open information extraction. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/171930 https://hdl.handle.net/10356/171930 10.32657/10356/171930 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Zhou, Shaowen Semi-supervised generative open information extraction |
description |
Open Information Extraction (OpenIE) extracts facts in the form of n-ary relation tuples, i.e., (arg1, predicate, arg2, …, argn), from unstructured text without relying on predefined ontology schema. It has the potential to handle heterogeneous corpora with minimal human intervention. With OpenIE, Web-scale unconstrained IE systems can be developed to acquire large quantities of knowledge. The knowledge gathered can then be integrated and used in various natural language processing (NLP) applications, such as textual entailment, summarization, question answering, and explicit reasoning.
Firstly, we surveyed neural OpenIE. Thanks to the rapid development of deep learning technologies, numerous neural OpenIE architectures have been proposed, achieving considerable performance improvement. In this survey, we provide an
extensive overview of the state-of-the-art neural OpenIE models, their key design decisions, strengths, and weaknesses. Then, we discuss the limitations of current solutions and the open issues in the OpenIE problem itself. Finally, we list recent trends that could help expand its scope and applicability, setting up promising directions for future research in OpenIE. This paper is the first review of neural OpenIE.
Secondly, we consider semi-supervised generative OpenIE to boost the performance by utilizing unlabeled texts. Deep learning-based OpenIE systems need more high-quality training data. Moreover, as mentioned in previous work, different OpenIE applications focus on different types of information. Existing unsupervised data bootstrapping solutions need help producing high-quality training data and being more flexible in adjusting the data to be suitable for specific applications. Recently, generative methods have shown great potential in various NLP tasks. We leverage a mean teacher-based semi-supervised learning method to boost the performance of a generative OpenIE model. We first pre-train the model using existing training data and then conduct semi-supervised training using gold data and unlabeled corpus. To filter out non-trustworthy training examples generated during the semi-supervised learning phase, we introduce a verification mechanism to improve the quality of the examples. Instead of training two separate models, we use prompts to train a joint model with extraction and verification capabilities. We conduct experiments on OpenIE benchmarks and show the promising results achieved by our proposed method.
Finally, we conduct a few experiments evaluating ChatGPT’s zero-shot capability, discuss possible future work directions, and conclude the report. |
author2 |
Long Cheng |
author_facet |
Long Cheng Zhou, Shaowen |
format |
Thesis-Master by Research |
author |
Zhou, Shaowen |
author_sort |
Zhou, Shaowen |
title |
Semi-supervised generative open information extraction |
title_short |
Semi-supervised generative open information extraction |
title_full |
Semi-supervised generative open information extraction |
title_fullStr |
Semi-supervised generative open information extraction |
title_full_unstemmed |
Semi-supervised generative open information extraction |
title_sort |
semi-supervised generative open information extraction |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/171930 |
_version_ |
1784855574898278400 |