Variational graph author topic modeling

While Variational Graph Auto-Encoder (VGAE) has presented promising ability to learn representations for documents, most existing VGAE methods do not model a latent topic structure and therefore lack semantic interpretability. Exploring hidden topics within documents and discovering key words associ...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHANG, Ce, LAUW, Hady Wirawan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7271
https://ink.library.smu.edu.sg/context/sis_research/article/8274/viewcontent/3534678.3539310_pv.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8274
record_format dspace
spelling sg-smu-ink.sis_research-82742022-09-15T07:31:47Z Variational graph author topic modeling ZHANG, Ce LAUW, Hady Wirawan While Variational Graph Auto-Encoder (VGAE) has presented promising ability to learn representations for documents, most existing VGAE methods do not model a latent topic structure and therefore lack semantic interpretability. Exploring hidden topics within documents and discovering key words associated with each topic allow us to develop a semantic interpretation of the corpus. Moreover, documents are usually associated with authors. For example, news reports have journalists specializing in writing certain type of events, academic papers have authors with expertise in certain research topics, etc. Modeling authorship information could benefit topic modeling, since documents by the same authors tend to reveal similar semantics. This observation also holds for documents published on the same venues. However, most topic models ignore the auxiliary authorship and publication venues. Given above two challenges, we propose a Variational Graph Author Topic Model for documents to integrate both semantic interpretability and authorship and venue modeling into a unified VGAE framework. For authorship and venue modeling, we construct a hierarchical multi-layered document graph with both intra- and cross-layer topic propagation. For semantic interpretability, three word relations (contextual, syntactic, semantic) are modeled and constitute three word sub-layers in the document graph. We further propose three alternatives for variational divergence. Experiments verify the effectiveness of our model on supervised and unsupervised tasks. 2022-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7271 info:doi/10.1145/3534678.3539310 https://ink.library.smu.edu.sg/context/sis_research/article/8274/viewcontent/3534678.3539310_pv.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University author topic modeling graph neural networks text mining variational graph auto-encoder Databases and Information Systems Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic author topic modeling
graph neural networks
text mining
variational graph auto-encoder
Databases and Information Systems
Theory and Algorithms
spellingShingle author topic modeling
graph neural networks
text mining
variational graph auto-encoder
Databases and Information Systems
Theory and Algorithms
ZHANG, Ce
LAUW, Hady Wirawan
Variational graph author topic modeling
description While Variational Graph Auto-Encoder (VGAE) has presented promising ability to learn representations for documents, most existing VGAE methods do not model a latent topic structure and therefore lack semantic interpretability. Exploring hidden topics within documents and discovering key words associated with each topic allow us to develop a semantic interpretation of the corpus. Moreover, documents are usually associated with authors. For example, news reports have journalists specializing in writing certain type of events, academic papers have authors with expertise in certain research topics, etc. Modeling authorship information could benefit topic modeling, since documents by the same authors tend to reveal similar semantics. This observation also holds for documents published on the same venues. However, most topic models ignore the auxiliary authorship and publication venues. Given above two challenges, we propose a Variational Graph Author Topic Model for documents to integrate both semantic interpretability and authorship and venue modeling into a unified VGAE framework. For authorship and venue modeling, we construct a hierarchical multi-layered document graph with both intra- and cross-layer topic propagation. For semantic interpretability, three word relations (contextual, syntactic, semantic) are modeled and constitute three word sub-layers in the document graph. We further propose three alternatives for variational divergence. Experiments verify the effectiveness of our model on supervised and unsupervised tasks.
format text
author ZHANG, Ce
LAUW, Hady Wirawan
author_facet ZHANG, Ce
LAUW, Hady Wirawan
author_sort ZHANG, Ce
title Variational graph author topic modeling
title_short Variational graph author topic modeling
title_full Variational graph author topic modeling
title_fullStr Variational graph author topic modeling
title_full_unstemmed Variational graph author topic modeling
title_sort variational graph author topic modeling
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7271
https://ink.library.smu.edu.sg/context/sis_research/article/8274/viewcontent/3534678.3539310_pv.pdf
_version_ 1770576295808204800