DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM

DBPedia, Freebase, and other knowledge bases have sparse connectivity, where it is a major challenge in predicting linkages on existing entities. Some recommendation systems use complex models to perform link predictions. In the midst of the rapid development of the internet in the world, accompa...

Full description

Saved in:
Bibliographic Details
Main Author: Thamrin Andrew H Siho, Timothy
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/51443
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:51443
spelling id-itb.:514432020-09-28T19:22:22ZDEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM Thamrin Andrew H Siho, Timothy Indonesia Final Project link prediction, knowledge base, Spark, entity encoding, relation scoring INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/51443 DBPedia, Freebase, and other knowledge bases have sparse connectivity, where it is a major challenge in predicting linkages on existing entities. Some recommendation systems use complex models to perform link predictions. In the midst of the rapid development of the internet in the world, accompanied by ever larger data movements, the existing knowledge base is also getting bigger. So, it is also necessary to build a parallel and distributed model so that development can be carried out more quickly and efficiently. Therefore, the final project regarding the construction of a knowledge base link prediction system is carried out. In this study, it was determined how to encode entities and also assess relationships in parallel and distributed by utilizing various libraries on Spark such as Word2Vec, TF-IDF, vector multiplication, and several other processing carried out using a map on Spark. This study uses xLearn as a factorization machine. Based on the research conducted, it was found that the system built successfully carried out all the model building processes in the Spark distributed system. The system can build models more efficiently and generate link predictions with FMR 27 and FMRR 0.56 values. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description DBPedia, Freebase, and other knowledge bases have sparse connectivity, where it is a major challenge in predicting linkages on existing entities. Some recommendation systems use complex models to perform link predictions. In the midst of the rapid development of the internet in the world, accompanied by ever larger data movements, the existing knowledge base is also getting bigger. So, it is also necessary to build a parallel and distributed model so that development can be carried out more quickly and efficiently. Therefore, the final project regarding the construction of a knowledge base link prediction system is carried out. In this study, it was determined how to encode entities and also assess relationships in parallel and distributed by utilizing various libraries on Spark such as Word2Vec, TF-IDF, vector multiplication, and several other processing carried out using a map on Spark. This study uses xLearn as a factorization machine. Based on the research conducted, it was found that the system built successfully carried out all the model building processes in the Spark distributed system. The system can build models more efficiently and generate link predictions with FMR 27 and FMRR 0.56 values.
format Final Project
author Thamrin Andrew H Siho, Timothy
spellingShingle Thamrin Andrew H Siho, Timothy
DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM
author_facet Thamrin Andrew H Siho, Timothy
author_sort Thamrin Andrew H Siho, Timothy
title DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM
title_short DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM
title_full DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM
title_fullStr DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM
title_full_unstemmed DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM
title_sort development of knowledge based links prediction system in spark distributed system
url https://digilib.itb.ac.id/gdl/view/51443
_version_ 1822928746207772672