Are missing links predictable? An inferential benchmark for knowledge graph completion

We present InferWiki, a Knowledge Graph Completion (KGC) dataset that improves upon existing benchmarks in inferential ability, assumptions, and patterns. First, each testing sample is predictable with supportive data in the training set. To ensure it, we propose to utilize rule-guided train/test ge...

Full description

Saved in:

Bibliographic Details
Main Authors:	CAO, Yixin, JI, Xiang, LV, Xin, LI, Juanzi, WEN, Yonggang, ZHANG, Hanwang
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2021
Subjects:	Databases and Information Systems Graphics and Human Computer Interfaces
Online Access:	https://ink.library.smu.edu.sg/sis_research/7316 https://ink.library.smu.edu.sg/context/sis_research/article/8319/viewcontent/2021.acl_long.534.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-8319
record_format	dspace
spelling	sg-smu-ink.sis_research-83192022-09-29T06:01:14Z Are missing links predictable? An inferential benchmark for knowledge graph completion CAO, Yixin JI, Xiang LV, Xin LI, Juanzi WEN, Yonggang ZHANG, Hanwang We present InferWiki, a Knowledge Graph Completion (KGC) dataset that improves upon existing benchmarks in inferential ability, assumptions, and patterns. First, each testing sample is predictable with supportive data in the training set. To ensure it, we propose to utilize rule-guided train/test generation, instead of conventional random split. Second, InferWiki initiates the evaluation following the open-world assumption and improves the inferential difficulty of the closed-world assumption, by providing manually annotated negative and unknown triples. Third, we include various inference patterns (e.g., reasoning path length and types) for comprehensive evaluation. In experiments, we curate two settings of InferWiki varying in sizes and structures, and apply the construction process on CoDEx as comparative datasets. The results and empirical analyses demonstrate the necessity and high-quality of InferWiki. Nevertheless, the performance gap among various inferential assumptions and patterns presents the difficulty and inspires future research direction. Our datasets can be found in https://github. com/TaoMiner/inferwiki. 2021-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7316 https://ink.library.smu.edu.sg/context/sis_research/article/8319/viewcontent/2021.acl_long.534.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Graphics and Human Computer Interfaces
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Databases and Information Systems Graphics and Human Computer Interfaces
spellingShingle	Databases and Information Systems Graphics and Human Computer Interfaces CAO, Yixin JI, Xiang LV, Xin LI, Juanzi WEN, Yonggang ZHANG, Hanwang Are missing links predictable? An inferential benchmark for knowledge graph completion
description	We present InferWiki, a Knowledge Graph Completion (KGC) dataset that improves upon existing benchmarks in inferential ability, assumptions, and patterns. First, each testing sample is predictable with supportive data in the training set. To ensure it, we propose to utilize rule-guided train/test generation, instead of conventional random split. Second, InferWiki initiates the evaluation following the open-world assumption and improves the inferential difficulty of the closed-world assumption, by providing manually annotated negative and unknown triples. Third, we include various inference patterns (e.g., reasoning path length and types) for comprehensive evaluation. In experiments, we curate two settings of InferWiki varying in sizes and structures, and apply the construction process on CoDEx as comparative datasets. The results and empirical analyses demonstrate the necessity and high-quality of InferWiki. Nevertheless, the performance gap among various inferential assumptions and patterns presents the difficulty and inspires future research direction. Our datasets can be found in https://github. com/TaoMiner/inferwiki.
format	text
author	CAO, Yixin JI, Xiang LV, Xin LI, Juanzi WEN, Yonggang ZHANG, Hanwang
author_facet	CAO, Yixin JI, Xiang LV, Xin LI, Juanzi WEN, Yonggang ZHANG, Hanwang
author_sort	CAO, Yixin
title	Are missing links predictable? An inferential benchmark for knowledge graph completion
title_short	Are missing links predictable? An inferential benchmark for knowledge graph completion
title_full	Are missing links predictable? An inferential benchmark for knowledge graph completion
title_fullStr	Are missing links predictable? An inferential benchmark for knowledge graph completion
title_full_unstemmed	Are missing links predictable? An inferential benchmark for knowledge graph completion
title_sort	are missing links predictable? an inferential benchmark for knowledge graph completion
publisher	Institutional Knowledge at Singapore Management University
publishDate	2021
url	https://ink.library.smu.edu.sg/sis_research/7316 https://ink.library.smu.edu.sg/context/sis_research/article/8319/viewcontent/2021.acl_long.534.pdf
_version_	1770576310147481600

Are missing links predictable? An inferential benchmark for knowledge graph completion

Similar Items