Word clouds with latent variable analysis for visual comparison of documents
Word cloud is a visualization form for text that is recognized for its aesthetic, social, and analytical values. Here, we are concerned with deepening its analytical value for visual comparison of documents. To aid comparative analysis of two or more documents, users need to be able to perceive simi...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2016
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/3357 https://ink.library.smu.edu.sg/context/sis_research/article/4359/viewcontent/WordCloudsLatentVariable.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-4359 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-43592018-03-09T08:47:57Z Word clouds with latent variable analysis for visual comparison of documents LE, Tuan M. V. LAUW, Hady W. Word cloud is a visualization form for text that is recognized for its aesthetic, social, and analytical values. Here, we are concerned with deepening its analytical value for visual comparison of documents. To aid comparative analysis of two or more documents, users need to be able to perceive similarities and differences among documents through their word clouds. However, as we are dealing with text, approaches that treat words independently may impede accurate discernment of similarities among word clouds containing different words of related meanings. We therefore motivate the principle of displaying related words in a coherent manner, and propose to realize it through modeling the latent aspects of words. Our WORD FLOCK solution brings together latent variable analysis for embedding and aspect modeling, and calibrated layout algorithm within a synchronized word cloud generation framework. We present the quantitative and qualitative results on real-life text corpora, showcasing how the word clouds are useful in preserving the information content of documents so as to allow more accurate visual comparison of documents 2016-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3357 https://ink.library.smu.edu.sg/context/sis_research/article/4359/viewcontent/WordCloudsLatentVariable.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
Databases and Information Systems Numerical Analysis and Scientific Computing LE, Tuan M. V. LAUW, Hady W. Word clouds with latent variable analysis for visual comparison of documents |
description |
Word cloud is a visualization form for text that is recognized for its aesthetic, social, and analytical values. Here, we are concerned with deepening its analytical value for visual comparison of documents. To aid comparative analysis of two or more documents, users need to be able to perceive similarities and differences among documents through their word clouds. However, as we are dealing with text, approaches that treat words independently may impede accurate discernment of similarities among word clouds containing different words of related meanings. We therefore motivate the principle of displaying related words in a coherent manner, and propose to realize it through modeling the latent aspects of words. Our WORD FLOCK solution brings together latent variable analysis for embedding and aspect modeling, and calibrated layout algorithm within a synchronized word cloud generation framework. We present the quantitative and qualitative results on real-life text corpora, showcasing how the word clouds are useful in preserving the information content of documents so as to allow more accurate visual comparison of documents |
format |
text |
author |
LE, Tuan M. V. LAUW, Hady W. |
author_facet |
LE, Tuan M. V. LAUW, Hady W. |
author_sort |
LE, Tuan M. V. |
title |
Word clouds with latent variable analysis for visual comparison of documents |
title_short |
Word clouds with latent variable analysis for visual comparison of documents |
title_full |
Word clouds with latent variable analysis for visual comparison of documents |
title_fullStr |
Word clouds with latent variable analysis for visual comparison of documents |
title_full_unstemmed |
Word clouds with latent variable analysis for visual comparison of documents |
title_sort |
word clouds with latent variable analysis for visual comparison of documents |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2016 |
url |
https://ink.library.smu.edu.sg/sis_research/3357 https://ink.library.smu.edu.sg/context/sis_research/article/4359/viewcontent/WordCloudsLatentVariable.pdf |
_version_ |
1770573121569423360 |