Semi-supervised generative open information extraction

Open Information Extraction (OpenIE) extracts facts in the form of n-ary relation tuples, i.e., (arg1, predicate, arg2, …, argn), from unstructured text without relying on predefined ontology schema. It has the potential to handle heterogeneous corpora with minimal human intervention. With OpenIE, W...

Full description

Saved in:
Bibliographic Details
Main Author: Zhou, Shaowen
Other Authors: Long Cheng
Format: Thesis-Master by Research
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/171930
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-171930
record_format dspace
spelling sg-ntu-dr.10356-1719302023-12-01T01:52:37Z Semi-supervised generative open information extraction Zhou, Shaowen Long Cheng School of Computer Science and Engineering c.long@ntu.edu.sg Engineering::Computer science and engineering Open Information Extraction (OpenIE) extracts facts in the form of n-ary relation tuples, i.e., (arg1, predicate, arg2, …, argn), from unstructured text without relying on predefined ontology schema. It has the potential to handle heterogeneous corpora with minimal human intervention. With OpenIE, Web-scale unconstrained IE systems can be developed to acquire large quantities of knowledge. The knowledge gathered can then be integrated and used in various natural language processing (NLP) applications, such as textual entailment, summarization, question answering, and explicit reasoning. Firstly, we surveyed neural OpenIE. Thanks to the rapid development of deep learning technologies, numerous neural OpenIE architectures have been proposed, achieving considerable performance improvement. In this survey, we provide an extensive overview of the state-of-the-art neural OpenIE models, their key design decisions, strengths, and weaknesses. Then, we discuss the limitations of current solutions and the open issues in the OpenIE problem itself. Finally, we list recent trends that could help expand its scope and applicability, setting up promising directions for future research in OpenIE. This paper is the first review of neural OpenIE. Secondly, we consider semi-supervised generative OpenIE to boost the performance by utilizing unlabeled texts. Deep learning-based OpenIE systems need more high-quality training data. Moreover, as mentioned in previous work, different OpenIE applications focus on different types of information. Existing unsupervised data bootstrapping solutions need help producing high-quality training data and being more flexible in adjusting the data to be suitable for specific applications. Recently, generative methods have shown great potential in various NLP tasks. We leverage a mean teacher-based semi-supervised learning method to boost the performance of a generative OpenIE model. We first pre-train the model using existing training data and then conduct semi-supervised training using gold data and unlabeled corpus. To filter out non-trustworthy training examples generated during the semi-supervised learning phase, we introduce a verification mechanism to improve the quality of the examples. Instead of training two separate models, we use prompts to train a joint model with extraction and verification capabilities. We conduct experiments on OpenIE benchmarks and show the promising results achieved by our proposed method. Finally, we conduct a few experiments evaluating ChatGPT’s zero-shot capability, discuss possible future work directions, and conclude the report. Master of Engineering 2023-11-16T08:47:56Z 2023-11-16T08:47:56Z 2023 Thesis-Master by Research Zhou, S. (2023). Semi-supervised generative open information extraction. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/171930 https://hdl.handle.net/10356/171930 10.32657/10356/171930 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Zhou, Shaowen
Semi-supervised generative open information extraction
description Open Information Extraction (OpenIE) extracts facts in the form of n-ary relation tuples, i.e., (arg1, predicate, arg2, …, argn), from unstructured text without relying on predefined ontology schema. It has the potential to handle heterogeneous corpora with minimal human intervention. With OpenIE, Web-scale unconstrained IE systems can be developed to acquire large quantities of knowledge. The knowledge gathered can then be integrated and used in various natural language processing (NLP) applications, such as textual entailment, summarization, question answering, and explicit reasoning. Firstly, we surveyed neural OpenIE. Thanks to the rapid development of deep learning technologies, numerous neural OpenIE architectures have been proposed, achieving considerable performance improvement. In this survey, we provide an extensive overview of the state-of-the-art neural OpenIE models, their key design decisions, strengths, and weaknesses. Then, we discuss the limitations of current solutions and the open issues in the OpenIE problem itself. Finally, we list recent trends that could help expand its scope and applicability, setting up promising directions for future research in OpenIE. This paper is the first review of neural OpenIE. Secondly, we consider semi-supervised generative OpenIE to boost the performance by utilizing unlabeled texts. Deep learning-based OpenIE systems need more high-quality training data. Moreover, as mentioned in previous work, different OpenIE applications focus on different types of information. Existing unsupervised data bootstrapping solutions need help producing high-quality training data and being more flexible in adjusting the data to be suitable for specific applications. Recently, generative methods have shown great potential in various NLP tasks. We leverage a mean teacher-based semi-supervised learning method to boost the performance of a generative OpenIE model. We first pre-train the model using existing training data and then conduct semi-supervised training using gold data and unlabeled corpus. To filter out non-trustworthy training examples generated during the semi-supervised learning phase, we introduce a verification mechanism to improve the quality of the examples. Instead of training two separate models, we use prompts to train a joint model with extraction and verification capabilities. We conduct experiments on OpenIE benchmarks and show the promising results achieved by our proposed method. Finally, we conduct a few experiments evaluating ChatGPT’s zero-shot capability, discuss possible future work directions, and conclude the report.
author2 Long Cheng
author_facet Long Cheng
Zhou, Shaowen
format Thesis-Master by Research
author Zhou, Shaowen
author_sort Zhou, Shaowen
title Semi-supervised generative open information extraction
title_short Semi-supervised generative open information extraction
title_full Semi-supervised generative open information extraction
title_fullStr Semi-supervised generative open information extraction
title_full_unstemmed Semi-supervised generative open information extraction
title_sort semi-supervised generative open information extraction
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/171930
_version_ 1784855574898278400