SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM

This final project focuses on utilizing Sequential Pattern Mining (SPM) as a preprocess method for Enhanced Graph Embedding with Side Information or EGES input (Wang et al., 2018). The purpose of this project is to build a graph embedding that has a high transaction certainty by replacing the inp...

Full description

Saved in:
Bibliographic Details
Main Author: Ayu Chandra Kemala, Shinta
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/49943
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:49943
spelling id-itb.:499432020-09-21T15:39:07ZSEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM Ayu Chandra Kemala, Shinta Indonesia Final Project graph embedding, sequential pattern mining, recommendation system INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/49943 This final project focuses on utilizing Sequential Pattern Mining (SPM) as a preprocess method for Enhanced Graph Embedding with Side Information or EGES input (Wang et al., 2018). The purpose of this project is to build a graph embedding that has a high transaction certainty by replacing the input that was originally in the form of an item page access history data into an item purchase history data of e-commerce. EGES accepts input in the form of page access history data and side information for each item. Although its performance is better than other graph embedding methods, EGES requires large data to perform well. The use of purchase history data which is less in number will affect the resulting embedding. By using SPM, transaction data is converted into a sequential pattern that has a relatively high occurrence frequency, so the input data is more certain. Embedding results from EGES are processed using logistic regression, then their performance is compared to the results of embedding from EGES without SPM to determine the effect of using sequential patterns. Based on the test results, the most suitable methods for EGES input preprocesses are SPADE (Zaki, 2001) and SPAM (Ayres et al., 2002) with AUC values reaching 0.5757, 5-6% higher than EGES without SPM, and AUPR values reaching 0.7867, 40% higher than EGES without SPM. Based on analysis and experiments, SPM is an appropriate method for preprocessing EGES input, and sequential pattern forms are suitable for use as EGES input. In addition to the SPM method, things that affect the quality of the prediction of the model from EGES are the amount of data, the balance of the number of classes, and the selection of EGES hyperparameters. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description This final project focuses on utilizing Sequential Pattern Mining (SPM) as a preprocess method for Enhanced Graph Embedding with Side Information or EGES input (Wang et al., 2018). The purpose of this project is to build a graph embedding that has a high transaction certainty by replacing the input that was originally in the form of an item page access history data into an item purchase history data of e-commerce. EGES accepts input in the form of page access history data and side information for each item. Although its performance is better than other graph embedding methods, EGES requires large data to perform well. The use of purchase history data which is less in number will affect the resulting embedding. By using SPM, transaction data is converted into a sequential pattern that has a relatively high occurrence frequency, so the input data is more certain. Embedding results from EGES are processed using logistic regression, then their performance is compared to the results of embedding from EGES without SPM to determine the effect of using sequential patterns. Based on the test results, the most suitable methods for EGES input preprocesses are SPADE (Zaki, 2001) and SPAM (Ayres et al., 2002) with AUC values reaching 0.5757, 5-6% higher than EGES without SPM, and AUPR values reaching 0.7867, 40% higher than EGES without SPM. Based on analysis and experiments, SPM is an appropriate method for preprocessing EGES input, and sequential pattern forms are suitable for use as EGES input. In addition to the SPM method, things that affect the quality of the prediction of the model from EGES are the amount of data, the balance of the number of classes, and the selection of EGES hyperparameters.
format Final Project
author Ayu Chandra Kemala, Shinta
spellingShingle Ayu Chandra Kemala, Shinta
SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM
author_facet Ayu Chandra Kemala, Shinta
author_sort Ayu Chandra Kemala, Shinta
title SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM
title_short SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM
title_full SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM
title_fullStr SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM
title_full_unstemmed SEQUENTIAL PATTERN-BASED ENHANCED GRAPH EMBEDDING WITH SIDE INFORMATION FOR RECOMMENDATION SYSTEM
title_sort sequential pattern-based enhanced graph embedding with side information for recommendation system
url https://digilib.itb.ac.id/gdl/view/49943
_version_ 1822928318865866752