Topic modeling on document networks with adjacent-encoder
Oftentimes documents are linked to one another in a network structure,e.g., academic papers cite other papers, Web pages link to other pages. In this paper we propose a holistic topic model to learn meaningful and unified low-dimensional representations for networked documents that seek to preserve...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2020
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/5124 https://ink.library.smu.edu.sg/context/sis_research/article/6126/viewcontent/aaai20a.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-6126 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-61262021-06-08T05:00:32Z Topic modeling on document networks with adjacent-encoder ZHANG, Ce LAUW, Hady W. Oftentimes documents are linked to one another in a network structure,e.g., academic papers cite other papers, Web pages link to other pages. In this paper we propose a holistic topic model to learn meaningful and unified low-dimensional representations for networked documents that seek to preserve both textual content and network structure. On the basis of reconstructing not only the input document but also its adjacent neighbors, we develop two neural encoder architectures. Adjacent-Encoder, or AdjEnc, induces competition among documents for topic propagation, and reconstruction among neighbors for semantic capture. Adjacent-Encoder-X, or AdjEnc-X, extends this to also encode the network structure in addition to document content. We evaluate our models on real-world document networks quantitatively and qualitatively, outperforming comparable baselines comprehensively 2020-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5124 info:doi/10.1609/aaai.v34i04.6152 https://ink.library.smu.edu.sg/context/sis_research/article/6126/viewcontent/aaai20a.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Encoder architecture Low-dimensional representation Network structures Semantic capture Textual content Topic Modeling Artificial intelligence Artificial Intelligence and Robotics Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Encoder architecture Low-dimensional representation Network structures Semantic capture Textual content Topic Modeling Artificial intelligence Artificial Intelligence and Robotics Databases and Information Systems |
spellingShingle |
Encoder architecture Low-dimensional representation Network structures Semantic capture Textual content Topic Modeling Artificial intelligence Artificial Intelligence and Robotics Databases and Information Systems ZHANG, Ce LAUW, Hady W. Topic modeling on document networks with adjacent-encoder |
description |
Oftentimes documents are linked to one another in a network structure,e.g., academic papers cite other papers, Web pages link to other pages. In this paper we propose a holistic topic model to learn meaningful and unified low-dimensional representations for networked documents that seek to preserve both textual content and network structure. On the basis of reconstructing not only the input document but also its adjacent neighbors, we develop two neural encoder architectures. Adjacent-Encoder, or AdjEnc, induces competition among documents for topic propagation, and reconstruction among neighbors for semantic capture. Adjacent-Encoder-X, or AdjEnc-X, extends this to also encode the network structure in addition to document content. We evaluate our models on real-world document networks quantitatively and qualitatively, outperforming comparable baselines comprehensively |
format |
text |
author |
ZHANG, Ce LAUW, Hady W. |
author_facet |
ZHANG, Ce LAUW, Hady W. |
author_sort |
ZHANG, Ce |
title |
Topic modeling on document networks with adjacent-encoder |
title_short |
Topic modeling on document networks with adjacent-encoder |
title_full |
Topic modeling on document networks with adjacent-encoder |
title_fullStr |
Topic modeling on document networks with adjacent-encoder |
title_full_unstemmed |
Topic modeling on document networks with adjacent-encoder |
title_sort |
topic modeling on document networks with adjacent-encoder |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2020 |
url |
https://ink.library.smu.edu.sg/sis_research/5124 https://ink.library.smu.edu.sg/context/sis_research/article/6126/viewcontent/aaai20a.pdf |
_version_ |
1770575251023855616 |