Topic modeling on document networks with adjacent-encoder

Oftentimes documents are linked to one another in a network structure,e.g., academic papers cite other papers, Web pages link to other pages. In this paper we propose a holistic topic model to learn meaningful and unified low-dimensional representations for networked documents that seek to preserve...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHANG, Ce, LAUW, Hady W.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2020
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5124
https://ink.library.smu.edu.sg/context/sis_research/article/6126/viewcontent/aaai20a.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6126
record_format dspace
spelling sg-smu-ink.sis_research-61262021-06-08T05:00:32Z Topic modeling on document networks with adjacent-encoder ZHANG, Ce LAUW, Hady W. Oftentimes documents are linked to one another in a network structure,e.g., academic papers cite other papers, Web pages link to other pages. In this paper we propose a holistic topic model to learn meaningful and unified low-dimensional representations for networked documents that seek to preserve both textual content and network structure. On the basis of reconstructing not only the input document but also its adjacent neighbors, we develop two neural encoder architectures. Adjacent-Encoder, or AdjEnc, induces competition among documents for topic propagation, and reconstruction among neighbors for semantic capture. Adjacent-Encoder-X, or AdjEnc-X, extends this to also encode the network structure in addition to document content. We evaluate our models on real-world document networks quantitatively and qualitatively, outperforming comparable baselines comprehensively 2020-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5124 info:doi/10.1609/aaai.v34i04.6152 https://ink.library.smu.edu.sg/context/sis_research/article/6126/viewcontent/aaai20a.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Encoder architecture Low-dimensional representation Network structures Semantic capture Textual content Topic Modeling Artificial intelligence Artificial Intelligence and Robotics Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Encoder architecture
Low-dimensional representation
Network structures
Semantic capture
Textual content
Topic Modeling
Artificial intelligence
Artificial Intelligence and Robotics
Databases and Information Systems
spellingShingle Encoder architecture
Low-dimensional representation
Network structures
Semantic capture
Textual content
Topic Modeling
Artificial intelligence
Artificial Intelligence and Robotics
Databases and Information Systems
ZHANG, Ce
LAUW, Hady W.
Topic modeling on document networks with adjacent-encoder
description Oftentimes documents are linked to one another in a network structure,e.g., academic papers cite other papers, Web pages link to other pages. In this paper we propose a holistic topic model to learn meaningful and unified low-dimensional representations for networked documents that seek to preserve both textual content and network structure. On the basis of reconstructing not only the input document but also its adjacent neighbors, we develop two neural encoder architectures. Adjacent-Encoder, or AdjEnc, induces competition among documents for topic propagation, and reconstruction among neighbors for semantic capture. Adjacent-Encoder-X, or AdjEnc-X, extends this to also encode the network structure in addition to document content. We evaluate our models on real-world document networks quantitatively and qualitatively, outperforming comparable baselines comprehensively
format text
author ZHANG, Ce
LAUW, Hady W.
author_facet ZHANG, Ce
LAUW, Hady W.
author_sort ZHANG, Ce
title Topic modeling on document networks with adjacent-encoder
title_short Topic modeling on document networks with adjacent-encoder
title_full Topic modeling on document networks with adjacent-encoder
title_fullStr Topic modeling on document networks with adjacent-encoder
title_full_unstemmed Topic modeling on document networks with adjacent-encoder
title_sort topic modeling on document networks with adjacent-encoder
publisher Institutional Knowledge at Singapore Management University
publishDate 2020
url https://ink.library.smu.edu.sg/sis_research/5124
https://ink.library.smu.edu.sg/context/sis_research/article/6126/viewcontent/aaai20a.pdf
_version_ 1770575251023855616