Semantic Visualization for Spherical Representation
Visualization of high-dimensional data such as text documents is widely applicable. The traditional means is to find an appropriate embedding of the high-dimensional representation in a low-dimensional visualizable space. As topic modeling is a useful form of dimensionality reduction that preserves...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2014
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2250 https://ink.library.smu.edu.sg/context/sis_research/article/3250/viewcontent/kdd14.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3250 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-32502017-12-26T06:00:45Z Semantic Visualization for Spherical Representation LE, Tuan M. V. LAUW, Hady W. Visualization of high-dimensional data such as text documents is widely applicable. The traditional means is to find an appropriate embedding of the high-dimensional representation in a low-dimensional visualizable space. As topic modeling is a useful form of dimensionality reduction that preserves the semantics in documents, recent approaches aim for a visualization that is consistent with both the original word space, as well as the semantic topic space. In this paper, we address the semantic visualization problem. Given a corpus of documents, the objective is to simultaneously learn the topic distributions as well as the visualization coordinates of documents. We propose to develop a semantic visualization model that approximates L2-normalized data directly. The key is to associate each document with three representations: a coordinate in the visualization space, a multinomial distribution in the topic space, and a directional vector in a high-dimensional unit hypersphere in the word space. We join these representations in a unified generative model, and describe its parameter estimation through variational inference. Comprehensive experiments on real-life text datasets show that the proposed method outperforms the existing baselines on objective evaluation metrics for visualization quality and topic interpretability. 2014-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2250 info:doi/10.1145/2623330.2623620 https://ink.library.smu.edu.sg/context/sis_research/article/3250/viewcontent/kdd14.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University dimensionality reduction semantic visualization spherical semantic embedding spherical space generative model L2-normalized vector topic model Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
dimensionality reduction semantic visualization spherical semantic embedding spherical space generative model L2-normalized vector topic model Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
dimensionality reduction semantic visualization spherical semantic embedding spherical space generative model L2-normalized vector topic model Databases and Information Systems Numerical Analysis and Scientific Computing LE, Tuan M. V. LAUW, Hady W. Semantic Visualization for Spherical Representation |
description |
Visualization of high-dimensional data such as text documents is widely applicable. The traditional means is to find an appropriate embedding of the high-dimensional representation in a low-dimensional visualizable space. As topic modeling is a useful form of dimensionality reduction that preserves the semantics in documents, recent approaches aim for a visualization that is consistent with both the original word space, as well as the semantic topic space. In this paper, we address the semantic visualization problem. Given a corpus of documents, the objective is to simultaneously learn the topic distributions as well as the visualization coordinates of documents. We propose to develop a semantic visualization model that approximates L2-normalized data directly. The key is to associate each document with three representations: a coordinate in the visualization space, a multinomial distribution in the topic space, and a directional vector in a high-dimensional unit hypersphere in the word space. We join these representations in a unified generative model, and describe its parameter estimation through variational inference. Comprehensive experiments on real-life text datasets show that the proposed method outperforms the existing baselines on objective evaluation metrics for visualization quality and topic interpretability. |
format |
text |
author |
LE, Tuan M. V. LAUW, Hady W. |
author_facet |
LE, Tuan M. V. LAUW, Hady W. |
author_sort |
LE, Tuan M. V. |
title |
Semantic Visualization for Spherical Representation |
title_short |
Semantic Visualization for Spherical Representation |
title_full |
Semantic Visualization for Spherical Representation |
title_fullStr |
Semantic Visualization for Spherical Representation |
title_full_unstemmed |
Semantic Visualization for Spherical Representation |
title_sort |
semantic visualization for spherical representation |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2014 |
url |
https://ink.library.smu.edu.sg/sis_research/2250 https://ink.library.smu.edu.sg/context/sis_research/article/3250/viewcontent/kdd14.pdf |
_version_ |
1770571928368578560 |