Graph representative learning

Information networks are commonly used in multiple applications since large amount of data exists in information networks. These data and relationship with different types are of great significant to many application such as community recommendation, users' preference prediction, etc. Generally...

Full description

Saved in:
Bibliographic Details
Main Author: Li, Chuyun
Other Authors: Lihui CHEN
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/141308
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-141308
record_format dspace
spelling sg-ntu-dr.10356-1413082023-07-04T16:54:25Z Graph representative learning Li, Chuyun Lihui CHEN School of Electrical and Electronic Engineering ELHCHEN@ntu.edu.sg Engineering::Electrical and electronic engineering Information networks are commonly used in multiple applications since large amount of data exists in information networks. These data and relationship with different types are of great significant to many application such as community recommendation, users' preference prediction, etc. Generally, the analysis and process of large-scale information networks has attracted widespread attention in the industry. Graph is an abstraction for representing relational data from multiple domains, and graph embedding is a good way for analysis of information networks. A good way of graph embeddings may be obtained by representing node in the graph as a vector, and these representations should contain useful structural information (features) of the graph. In this project, we aim to get feature vectors to represent graphs/sub-graphs in information networks so that machine learning approach can be applied for a classification task. We study two different state-of-the-art representation models: Deep-walk and ASPEM. Both of two algorithms are designed to learn representation of nodes in an information network. Deep-walk is an approach that generates a series of random walks, and the deep-walk based embedding uses Skip-gram to process these generated random walks because of the similarity between the distribution of the random walk in the network and that of sequences in vocabulary; ASPEM is more complicated embedding approach compared with Deep-walk because it proposed a concept of aspect. By dividing multiple aspects, ASPEM can reduce some impacts which effect the outcome of representative learning process and the final classification performance. We also make a comparative analysis of the performance of two algorithms on the node classification task for representative learning on four large-scale real-word data sets. The empirical results show that ASPEM outperforms Deep-walk embedding if multiple types of nodes or edges exists in the network. Master of Science (Signal Processing) 2020-06-07T12:40:03Z 2020-06-07T12:40:03Z 2020 Thesis-Master by Coursework https://hdl.handle.net/10356/141308 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Li, Chuyun
Graph representative learning
description Information networks are commonly used in multiple applications since large amount of data exists in information networks. These data and relationship with different types are of great significant to many application such as community recommendation, users' preference prediction, etc. Generally, the analysis and process of large-scale information networks has attracted widespread attention in the industry. Graph is an abstraction for representing relational data from multiple domains, and graph embedding is a good way for analysis of information networks. A good way of graph embeddings may be obtained by representing node in the graph as a vector, and these representations should contain useful structural information (features) of the graph. In this project, we aim to get feature vectors to represent graphs/sub-graphs in information networks so that machine learning approach can be applied for a classification task. We study two different state-of-the-art representation models: Deep-walk and ASPEM. Both of two algorithms are designed to learn representation of nodes in an information network. Deep-walk is an approach that generates a series of random walks, and the deep-walk based embedding uses Skip-gram to process these generated random walks because of the similarity between the distribution of the random walk in the network and that of sequences in vocabulary; ASPEM is more complicated embedding approach compared with Deep-walk because it proposed a concept of aspect. By dividing multiple aspects, ASPEM can reduce some impacts which effect the outcome of representative learning process and the final classification performance. We also make a comparative analysis of the performance of two algorithms on the node classification task for representative learning on four large-scale real-word data sets. The empirical results show that ASPEM outperforms Deep-walk embedding if multiple types of nodes or edges exists in the network.
author2 Lihui CHEN
author_facet Lihui CHEN
Li, Chuyun
format Thesis-Master by Coursework
author Li, Chuyun
author_sort Li, Chuyun
title Graph representative learning
title_short Graph representative learning
title_full Graph representative learning
title_fullStr Graph representative learning
title_full_unstemmed Graph representative learning
title_sort graph representative learning
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/141308
_version_ 1772826156175720448