Graph contrastive learning
Heterogeneous graph is a natural way to model complex relationships and interactions among entities in the real world, such as social networks or user--product relations. Learning good representations for heterogeneous graphs is a crucial step in deploying large-scale graph-based systems in an effic...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/167024 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Heterogeneous graph is a natural way to model complex relationships and interactions among entities in the real world, such as social networks or user--product relations. Learning good representations for heterogeneous graphs is a crucial step in deploying large-scale graph-based systems in an efficient and effective manner. Despite many great breakthroughs in self-supervised learning for Computer Vision and Natural Language Processing applications, similar efforts for graph data often pale in comparison, especially for heterogeneous graphs. Graph augmentation methods are limited and are weaker those for image data, limiting the potential of contrastive learning on graphs. Node dropping or edge perturbations are typically not suitable for heterogeneous graphs as they may result in invalid structures. Motivated by this, we propose to improve HeCo, the current state-of-the-art method for heterogeneous graph representation learning, with intra-view contrastive learning to obtain extra supervision signal. To maintain a graph's structural integrity, only Dropout is used to generate augmented views. To ensure the contrastive learning objective remain challenging, we further apply modified ArcFace loss to encourage more discriminative embeddings. We call our method HeCo-drop. HeCo-drop enhances HeCo consistently on various datasets, with up to +1% improvements in AUC scores. In addition, we analyse the key differences between graph and image/text data, thus outlining the challenges in adapting existing self-supervised methods to graphs. |
---|