SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM
This final project focuses on utilizing Sequential Pattern Mining (SPM) as a preprocess method for Enhanced Graph Embedding with Side Information or EGES input (Wang et al., 2018). The purpose of this project is to build a graph embedding that has a high transaction certainty by replacing the inp...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/49943 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:49943 |
---|---|
spelling |
id-itb.:499432020-09-21T15:39:07ZSEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM Ayu Chandra Kemala, Shinta Indonesia Final Project graph embedding, sequential pattern mining, recommendation system INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/49943 This final project focuses on utilizing Sequential Pattern Mining (SPM) as a preprocess method for Enhanced Graph Embedding with Side Information or EGES input (Wang et al., 2018). The purpose of this project is to build a graph embedding that has a high transaction certainty by replacing the input that was originally in the form of an item page access history data into an item purchase history data of e-commerce. EGES accepts input in the form of page access history data and side information for each item. Although its performance is better than other graph embedding methods, EGES requires large data to perform well. The use of purchase history data which is less in number will affect the resulting embedding. By using SPM, transaction data is converted into a sequential pattern that has a relatively high occurrence frequency, so the input data is more certain. Embedding results from EGES are processed using logistic regression, then their performance is compared to the results of embedding from EGES without SPM to determine the effect of using sequential patterns. Based on the test results, the most suitable methods for EGES input preprocesses are SPADE (Zaki, 2001) and SPAM (Ayres et al., 2002) with AUC values reaching 0.5757, 5-6% higher than EGES without SPM, and AUPR values reaching 0.7867, 40% higher than EGES without SPM. Based on analysis and experiments, SPM is an appropriate method for preprocessing EGES input, and sequential pattern forms are suitable for use as EGES input. In addition to the SPM method, things that affect the quality of the prediction of the model from EGES are the amount of data, the balance of the number of classes, and the selection of EGES hyperparameters. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
This final project focuses on utilizing Sequential Pattern Mining (SPM) as a preprocess method
for Enhanced Graph Embedding with Side Information or EGES input (Wang et al., 2018).
The purpose of this project is to build a graph embedding that has a high transaction certainty
by replacing the input that was originally in the form of an item page access history data into
an item purchase history data of e-commerce.
EGES accepts input in the form of page access history data and side information for each item.
Although its performance is better than other graph embedding methods, EGES requires large
data to perform well. The use of purchase history data which is less in number will affect the
resulting embedding. By using SPM, transaction data is converted into a sequential pattern that
has a relatively high occurrence frequency, so the input data is more certain.
Embedding results from EGES are processed using logistic regression, then their performance
is compared to the results of embedding from EGES without SPM to determine the effect of
using sequential patterns. Based on the test results, the most suitable methods for EGES input
preprocesses are SPADE (Zaki, 2001) and SPAM (Ayres et al., 2002) with AUC values
reaching 0.5757, 5-6% higher than EGES without SPM, and AUPR values reaching 0.7867,
40% higher than EGES without SPM.
Based on analysis and experiments, SPM is an appropriate method for preprocessing EGES
input, and sequential pattern forms are suitable for use as EGES input. In addition to the SPM
method, things that affect the quality of the prediction of the model from EGES are the amount
of data, the balance of the number of classes, and the selection of EGES hyperparameters. |
format |
Final Project |
author |
Ayu Chandra Kemala, Shinta |
spellingShingle |
Ayu Chandra Kemala, Shinta SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM |
author_facet |
Ayu Chandra Kemala, Shinta |
author_sort |
Ayu Chandra Kemala, Shinta |
title |
SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM |
title_short |
SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM |
title_full |
SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM |
title_fullStr |
SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM |
title_full_unstemmed |
SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM |
title_sort |
sequential pattern-based enhanced graph embedding with side information for recommendation system |
url |
https://digilib.itb.ac.id/gdl/view/49943 |
_version_ |
1822928318865866752 |