Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract)
Texts are often interconnected in a network structure, e.g., academic papers via citations. On the one hand, though Graph Neural Networks (GNNs) have shown promising ability to derive effective embeddings for networked documents, they do not assume latent topics, resulting in uninterpretahle embeddi...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9840 https://ink.library.smu.edu.sg/context/sis_research/article/10840/viewcontent/icde24.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-10840 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-108402024-12-24T03:28:10Z Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract) ZHANG, Ce LAUW, Hady Wirawan Texts are often interconnected in a network structure, e.g., academic papers via citations. On the one hand, though Graph Neural Networks (GNNs) have shown promising ability to derive effective embeddings for networked documents, they do not assume latent topics, resulting in uninterpretahle embeddings. On the other hand, topic models can infer interpretable document representations. However, most topic models focus on plain text and fail to leverage network structure across documents. In this paper, we propose a GNN-based topic model that both captures network connection and derives semantically interpretable text representations. For network modeling, we build our model with Optimal Transport Barycenter. For semantic interpretability, we extend optimal transport with pre-trained word embeddings. 2024-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9840 info:doi/10.1109/ICDE60146.2024.00503 https://ink.library.smu.edu.sg/context/sis_research/article/10840/viewcontent/icde24.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Graph Neural Networks Text Mining Optimal Transport Dirichlet Distribution Document Networks Artificial Intelligence and Robotics Computer Sciences |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Graph Neural Networks Text Mining Optimal Transport Dirichlet Distribution Document Networks Artificial Intelligence and Robotics Computer Sciences |
spellingShingle |
Graph Neural Networks Text Mining Optimal Transport Dirichlet Distribution Document Networks Artificial Intelligence and Robotics Computer Sciences ZHANG, Ce LAUW, Hady Wirawan Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract) |
description |
Texts are often interconnected in a network structure, e.g., academic papers via citations. On the one hand, though Graph Neural Networks (GNNs) have shown promising ability to derive effective embeddings for networked documents, they do not assume latent topics, resulting in uninterpretahle embeddings. On the other hand, topic models can infer interpretable document representations. However, most topic models focus on plain text and fail to leverage network structure across documents. In this paper, we propose a GNN-based topic model that both captures network connection and derives semantically interpretable text representations. For network modeling, we build our model with Optimal Transport Barycenter. For semantic interpretability, we extend optimal transport with pre-trained word embeddings. |
format |
text |
author |
ZHANG, Ce LAUW, Hady Wirawan |
author_facet |
ZHANG, Ce LAUW, Hady Wirawan |
author_sort |
ZHANG, Ce |
title |
Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract) |
title_short |
Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract) |
title_full |
Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract) |
title_fullStr |
Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract) |
title_full_unstemmed |
Topic modeling on document networks with Dirichlet Optimal Transport Barycenter (Extended Abstract) |
title_sort |
topic modeling on document networks with dirichlet optimal transport barycenter (extended abstract) |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2024 |
url |
https://ink.library.smu.edu.sg/sis_research/9840 https://ink.library.smu.edu.sg/context/sis_research/article/10840/viewcontent/icde24.pdf |
_version_ |
1820027796371537920 |