Citation network analysis with deep learning
Heterogeneous information networks (HINs) can be found everywhere in real-world applications. At the same time, node embedding has been regarded as a useful tool to mine and learn from networked data. As a result, it is of interest to develop HIN embedding methods. However, the heterogeneity in HINs...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/77382 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Heterogeneous information networks (HINs) can be found everywhere in real-world applications. At the same time, node embedding has been regarded as a useful tool to mine and learn from networked data. As a result, it is of interest to develop HIN embedding methods. However, the heterogeneity in HINs introduces not only rich information but also potentially incompatible semantics, which poses special challenges to embedding learning in HINs. With the intention to preserve the rich yet potentially incompatible information in HIN embedding, we propose to study the problem of comprehensive transcription of heterogeneous information networks. The comprehensive transcription of HINs also provides an easy-to-use approach to unleash the power of HINs, since it requires no additional supervision, expertise, or feature engineering. To cope with the challenges in the comprehensive transcription of HINs, we studied the HEER algorithm, which embeds HINs via edge representations that are further coupled with properly-learned heterogeneous metrics. To verify the efficacy of HEER, we conducted experiments on real-world datasets DBIS with an edge reconstruction task and also run simulations on several case studies to fully understand the strengths and weakness of HEER. Besides, we explored a newly proposed model, which is to enhance HEER by adding Meta-path information to improve the performance. Experiment results demonstrate the effectiveness of the new proposed model and the utility of edge representations and heterogeneous metrics. |
---|