DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM
DBPedia, Freebase, and other knowledge bases have sparse connectivity, where it is a major challenge in predicting linkages on existing entities. Some recommendation systems use complex models to perform link predictions. In the midst of the rapid development of the internet in the world, accompa...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/51443 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:51443 |
---|---|
spelling |
id-itb.:514432020-09-28T19:22:22ZDEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM Thamrin Andrew H Siho, Timothy Indonesia Final Project link prediction, knowledge base, Spark, entity encoding, relation scoring INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/51443 DBPedia, Freebase, and other knowledge bases have sparse connectivity, where it is a major challenge in predicting linkages on existing entities. Some recommendation systems use complex models to perform link predictions. In the midst of the rapid development of the internet in the world, accompanied by ever larger data movements, the existing knowledge base is also getting bigger. So, it is also necessary to build a parallel and distributed model so that development can be carried out more quickly and efficiently. Therefore, the final project regarding the construction of a knowledge base link prediction system is carried out. In this study, it was determined how to encode entities and also assess relationships in parallel and distributed by utilizing various libraries on Spark such as Word2Vec, TF-IDF, vector multiplication, and several other processing carried out using a map on Spark. This study uses xLearn as a factorization machine. Based on the research conducted, it was found that the system built successfully carried out all the model building processes in the Spark distributed system. The system can build models more efficiently and generate link predictions with FMR 27 and FMRR 0.56 values. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
DBPedia, Freebase, and other knowledge bases have sparse connectivity, where it is a major
challenge in predicting linkages on existing entities. Some recommendation systems use
complex models to perform link predictions.
In the midst of the rapid development of the internet in the world, accompanied by ever larger
data movements, the existing knowledge base is also getting bigger. So, it is also necessary to
build a parallel and distributed model so that development can be carried out more quickly and
efficiently. Therefore, the final project regarding the construction of a knowledge base link
prediction system is carried out.
In this study, it was determined how to encode entities and also assess relationships in parallel
and distributed by utilizing various libraries on Spark such as Word2Vec, TF-IDF, vector
multiplication, and several other processing carried out using a map on Spark. This study uses
xLearn as a factorization machine.
Based on the research conducted, it was found that the system built successfully carried out all
the model building processes in the Spark distributed system. The system can build models
more efficiently and generate link predictions with FMR 27 and FMRR 0.56 values.
|
format |
Final Project |
author |
Thamrin Andrew H Siho, Timothy |
spellingShingle |
Thamrin Andrew H Siho, Timothy DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM |
author_facet |
Thamrin Andrew H Siho, Timothy |
author_sort |
Thamrin Andrew H Siho, Timothy |
title |
DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM |
title_short |
DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM |
title_full |
DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM |
title_fullStr |
DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM |
title_full_unstemmed |
DEVELOPMENT OF KNOWLEDGE BASED LINKS PREDICTION SYSTEM IN SPARK DISTRIBUTED SYSTEM |
title_sort |
development of knowledge based links prediction system in spark distributed system |
url |
https://digilib.itb.ac.id/gdl/view/51443 |
_version_ |
1822928746207772672 |