Variational graph author topic modeling
While Variational Graph Auto-Encoder (VGAE) has presented promising ability to learn representations for documents, most existing VGAE methods do not model a latent topic structure and therefore lack semantic interpretability. Exploring hidden topics within documents and discovering key words associ...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7271 https://ink.library.smu.edu.sg/context/sis_research/article/8274/viewcontent/3534678.3539310_pv.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8274 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-82742022-09-15T07:31:47Z Variational graph author topic modeling ZHANG, Ce LAUW, Hady Wirawan While Variational Graph Auto-Encoder (VGAE) has presented promising ability to learn representations for documents, most existing VGAE methods do not model a latent topic structure and therefore lack semantic interpretability. Exploring hidden topics within documents and discovering key words associated with each topic allow us to develop a semantic interpretation of the corpus. Moreover, documents are usually associated with authors. For example, news reports have journalists specializing in writing certain type of events, academic papers have authors with expertise in certain research topics, etc. Modeling authorship information could benefit topic modeling, since documents by the same authors tend to reveal similar semantics. This observation also holds for documents published on the same venues. However, most topic models ignore the auxiliary authorship and publication venues. Given above two challenges, we propose a Variational Graph Author Topic Model for documents to integrate both semantic interpretability and authorship and venue modeling into a unified VGAE framework. For authorship and venue modeling, we construct a hierarchical multi-layered document graph with both intra- and cross-layer topic propagation. For semantic interpretability, three word relations (contextual, syntactic, semantic) are modeled and constitute three word sub-layers in the document graph. We further propose three alternatives for variational divergence. Experiments verify the effectiveness of our model on supervised and unsupervised tasks. 2022-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7271 info:doi/10.1145/3534678.3539310 https://ink.library.smu.edu.sg/context/sis_research/article/8274/viewcontent/3534678.3539310_pv.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University author topic modeling graph neural networks text mining variational graph auto-encoder Databases and Information Systems Theory and Algorithms |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
author topic modeling graph neural networks text mining variational graph auto-encoder Databases and Information Systems Theory and Algorithms |
spellingShingle |
author topic modeling graph neural networks text mining variational graph auto-encoder Databases and Information Systems Theory and Algorithms ZHANG, Ce LAUW, Hady Wirawan Variational graph author topic modeling |
description |
While Variational Graph Auto-Encoder (VGAE) has presented promising ability to learn representations for documents, most existing VGAE methods do not model a latent topic structure and therefore lack semantic interpretability. Exploring hidden topics within documents and discovering key words associated with each topic allow us to develop a semantic interpretation of the corpus. Moreover, documents are usually associated with authors. For example, news reports have journalists specializing in writing certain type of events, academic papers have authors with expertise in certain research topics, etc. Modeling authorship information could benefit topic modeling, since documents by the same authors tend to reveal similar semantics. This observation also holds for documents published on the same venues. However, most topic models ignore the auxiliary authorship and publication venues. Given above two challenges, we propose a Variational Graph Author Topic Model for documents to integrate both semantic interpretability and authorship and venue modeling into a unified VGAE framework. For authorship and venue modeling, we construct a hierarchical multi-layered document graph with both intra- and cross-layer topic propagation. For semantic interpretability, three word relations (contextual, syntactic, semantic) are modeled and constitute three word sub-layers in the document graph. We further propose three alternatives for variational divergence. Experiments verify the effectiveness of our model on supervised and unsupervised tasks. |
format |
text |
author |
ZHANG, Ce LAUW, Hady Wirawan |
author_facet |
ZHANG, Ce LAUW, Hady Wirawan |
author_sort |
ZHANG, Ce |
title |
Variational graph author topic modeling |
title_short |
Variational graph author topic modeling |
title_full |
Variational graph author topic modeling |
title_fullStr |
Variational graph author topic modeling |
title_full_unstemmed |
Variational graph author topic modeling |
title_sort |
variational graph author topic modeling |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/7271 https://ink.library.smu.edu.sg/context/sis_research/article/8274/viewcontent/3534678.3539310_pv.pdf |
_version_ |
1770576295808204800 |