Semantic Visualization for Spherical Representation

Visualization of high-dimensional data such as text documents is widely applicable. The traditional means is to find an appropriate embedding of the high-dimensional representation in a low-dimensional visualizable space. As topic modeling is a useful form of dimensionality reduction that preserves...

Full description

Saved in:

Bibliographic Details
Main Authors:	LE, Tuan M. V., LAUW, Hady W.
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2014
Subjects:	dimensionality reduction semantic visualization spherical semantic embedding spherical space generative model L2-normalized vector topic model Databases and Information Systems Numerical Analysis and Scientific Computing
Online Access:	https://ink.library.smu.edu.sg/sis_research/2250 https://ink.library.smu.edu.sg/context/sis_research/article/3250/viewcontent/kdd14.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-3250
record_format	dspace
spelling	sg-smu-ink.sis_research-32502017-12-26T06:00:45Z Semantic Visualization for Spherical Representation LE, Tuan M. V. LAUW, Hady W. Visualization of high-dimensional data such as text documents is widely applicable. The traditional means is to find an appropriate embedding of the high-dimensional representation in a low-dimensional visualizable space. As topic modeling is a useful form of dimensionality reduction that preserves the semantics in documents, recent approaches aim for a visualization that is consistent with both the original word space, as well as the semantic topic space. In this paper, we address the semantic visualization problem. Given a corpus of documents, the objective is to simultaneously learn the topic distributions as well as the visualization coordinates of documents. We propose to develop a semantic visualization model that approximates L2-normalized data directly. The key is to associate each document with three representations: a coordinate in the visualization space, a multinomial distribution in the topic space, and a directional vector in a high-dimensional unit hypersphere in the word space. We join these representations in a unified generative model, and describe its parameter estimation through variational inference. Comprehensive experiments on real-life text datasets show that the proposed method outperforms the existing baselines on objective evaluation metrics for visualization quality and topic interpretability. 2014-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2250 info:doi/10.1145/2623330.2623620 https://ink.library.smu.edu.sg/context/sis_research/article/3250/viewcontent/kdd14.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University dimensionality reduction semantic visualization spherical semantic embedding spherical space generative model L2-normalized vector topic model Databases and Information Systems Numerical Analysis and Scientific Computing
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	dimensionality reduction semantic visualization spherical semantic embedding spherical space generative model L2-normalized vector topic model Databases and Information Systems Numerical Analysis and Scientific Computing
spellingShingle	dimensionality reduction semantic visualization spherical semantic embedding spherical space generative model L2-normalized vector topic model Databases and Information Systems Numerical Analysis and Scientific Computing LE, Tuan M. V. LAUW, Hady W. Semantic Visualization for Spherical Representation
description	Visualization of high-dimensional data such as text documents is widely applicable. The traditional means is to find an appropriate embedding of the high-dimensional representation in a low-dimensional visualizable space. As topic modeling is a useful form of dimensionality reduction that preserves the semantics in documents, recent approaches aim for a visualization that is consistent with both the original word space, as well as the semantic topic space. In this paper, we address the semantic visualization problem. Given a corpus of documents, the objective is to simultaneously learn the topic distributions as well as the visualization coordinates of documents. We propose to develop a semantic visualization model that approximates L2-normalized data directly. The key is to associate each document with three representations: a coordinate in the visualization space, a multinomial distribution in the topic space, and a directional vector in a high-dimensional unit hypersphere in the word space. We join these representations in a unified generative model, and describe its parameter estimation through variational inference. Comprehensive experiments on real-life text datasets show that the proposed method outperforms the existing baselines on objective evaluation metrics for visualization quality and topic interpretability.
format	text
author	LE, Tuan M. V. LAUW, Hady W.
author_facet	LE, Tuan M. V. LAUW, Hady W.
author_sort	LE, Tuan M. V.
title	Semantic Visualization for Spherical Representation
title_short	Semantic Visualization for Spherical Representation
title_full	Semantic Visualization for Spherical Representation
title_fullStr	Semantic Visualization for Spherical Representation
title_full_unstemmed	Semantic Visualization for Spherical Representation
title_sort	semantic visualization for spherical representation
publisher	Institutional Knowledge at Singapore Management University
publishDate	2014
url	https://ink.library.smu.edu.sg/sis_research/2250 https://ink.library.smu.edu.sg/context/sis_research/article/3250/viewcontent/kdd14.pdf
_version_	1770571928368578560

Semantic Visualization for Spherical Representation

Similar Items