Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract)

Texts are often interconnected in a network structure, e.g., academic papers via citations. On the one hand, though Graph Neural Networks (GNNs) have shown promising ability to derive effective embeddings for networked documents, they do not assume latent topics, resulting in uninterpretahle embeddi...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHANG, Ce, LAUW, Hady Wirawan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9840
https://ink.library.smu.edu.sg/context/sis_research/article/10840/viewcontent/icde24.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10840
record_format dspace
spelling sg-smu-ink.sis_research-108402024-12-24T03:28:10Z Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract) ZHANG, Ce LAUW, Hady Wirawan Texts are often interconnected in a network structure, e.g., academic papers via citations. On the one hand, though Graph Neural Networks (GNNs) have shown promising ability to derive effective embeddings for networked documents, they do not assume latent topics, resulting in uninterpretahle embeddings. On the other hand, topic models can infer interpretable document representations. However, most topic models focus on plain text and fail to leverage network structure across documents. In this paper, we propose a GNN-based topic model that both captures network connection and derives semantically interpretable text representations. For network modeling, we build our model with Optimal Transport Barycenter. For semantic interpretability, we extend optimal transport with pre-trained word embeddings. 2024-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9840 info:doi/10.1109/ICDE60146.2024.00503 https://ink.library.smu.edu.sg/context/sis_research/article/10840/viewcontent/icde24.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Graph Neural Networks Text Mining Optimal Transport Dirichlet Distribution Document Networks Artificial Intelligence and Robotics Computer Sciences
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Graph Neural Networks
Text Mining
Optimal Transport
Dirichlet Distribution
Document Networks
Artificial Intelligence and Robotics
Computer Sciences
spellingShingle Graph Neural Networks
Text Mining
Optimal Transport
Dirichlet Distribution
Document Networks
Artificial Intelligence and Robotics
Computer Sciences
ZHANG, Ce
LAUW, Hady Wirawan
Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract)
description Texts are often interconnected in a network structure, e.g., academic papers via citations. On the one hand, though Graph Neural Networks (GNNs) have shown promising ability to derive effective embeddings for networked documents, they do not assume latent topics, resulting in uninterpretahle embeddings. On the other hand, topic models can infer interpretable document representations. However, most topic models focus on plain text and fail to leverage network structure across documents. In this paper, we propose a GNN-based topic model that both captures network connection and derives semantically interpretable text representations. For network modeling, we build our model with Optimal Transport Barycenter. For semantic interpretability, we extend optimal transport with pre-trained word embeddings.
format text
author ZHANG, Ce
LAUW, Hady Wirawan
author_facet ZHANG, Ce
LAUW, Hady Wirawan
author_sort ZHANG, Ce
title Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract)
title_short Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract)
title_full Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract)
title_fullStr Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract)
title_full_unstemmed Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract)
title_sort topic modeling on document networks with dirichlet optimal transport barycenter (extended abstract)
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9840
https://ink.library.smu.edu.sg/context/sis_research/article/10840/viewcontent/icde24.pdf
_version_ 1820027796371537920