Semi-supervised semantic visualization for networked documents

Semantic interpretability and visual expressivity are important objectives in exploratory analysis of text. On the one hand, while some documents may have explicit categories, we could develop a better understanding of a corpus by studying its finer-grained structures, which may be latent. By inferr...

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHANG, Delvin Ce, LAUW, Hady W.
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2021
Subjects:	Dimensionality reduction Generative models Semantic visualization Topic modeling Databases and Information Systems Graphics and Human Computer Interfaces
Online Access:	https://ink.library.smu.edu.sg/sis_research/6428 https://ink.library.smu.edu.sg/context/sis_research/article/7431/viewcontent/ecmlpkdd21b.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-7431
record_format	dspace
spelling	sg-smu-ink.sis_research-74312021-12-14T04:52:11Z Semi-supervised semantic visualization for networked documents ZHANG, Delvin Ce LAUW, Hady W. Semantic interpretability and visual expressivity are important objectives in exploratory analysis of text. On the one hand, while some documents may have explicit categories, we could develop a better understanding of a corpus by studying its finer-grained structures, which may be latent. By inferring latent topics and discovering keywords associated with each topic, one obtains a semantic interpretation of the corpus. One the other hand, by visualizing documents, latent topics, and category labels on the same plot, one gains a bird’s eye view of the relationships among documents, topics, and various categories. Semantic visualization is a class of methods that unify both topic modeling and visualization. In this paper, we propose a novel semantic visualization model for networked documents that incorporates partial labels. We introduce coordinate-based label distribution and label-dependent topic distribution to visualize documents, topics, and labels in a semi-supervised way. We further derive three variants for singly-labeled, multi-labeled, and hierarchically-labeled documents. The focus on semi-supervision that employs variants of labeling structures is particularly novel. Experiments verify the efficacy of our model against baselines. 2021-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6428 info:doi/10.1007/978-3-030-86523-8_46 https://ink.library.smu.edu.sg/context/sis_research/article/7431/viewcontent/ecmlpkdd21b.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Dimensionality reduction Generative models Semantic visualization Topic modeling Databases and Information Systems Graphics and Human Computer Interfaces
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Dimensionality reduction Generative models Semantic visualization Topic modeling Databases and Information Systems Graphics and Human Computer Interfaces
spellingShingle	Dimensionality reduction Generative models Semantic visualization Topic modeling Databases and Information Systems Graphics and Human Computer Interfaces ZHANG, Delvin Ce LAUW, Hady W. Semi-supervised semantic visualization for networked documents
description	Semantic interpretability and visual expressivity are important objectives in exploratory analysis of text. On the one hand, while some documents may have explicit categories, we could develop a better understanding of a corpus by studying its finer-grained structures, which may be latent. By inferring latent topics and discovering keywords associated with each topic, one obtains a semantic interpretation of the corpus. One the other hand, by visualizing documents, latent topics, and category labels on the same plot, one gains a bird’s eye view of the relationships among documents, topics, and various categories. Semantic visualization is a class of methods that unify both topic modeling and visualization. In this paper, we propose a novel semantic visualization model for networked documents that incorporates partial labels. We introduce coordinate-based label distribution and label-dependent topic distribution to visualize documents, topics, and labels in a semi-supervised way. We further derive three variants for singly-labeled, multi-labeled, and hierarchically-labeled documents. The focus on semi-supervision that employs variants of labeling structures is particularly novel. Experiments verify the efficacy of our model against baselines.
format	text
author	ZHANG, Delvin Ce LAUW, Hady W.
author_facet	ZHANG, Delvin Ce LAUW, Hady W.
author_sort	ZHANG, Delvin Ce
title	Semi-supervised semantic visualization for networked documents
title_short	Semi-supervised semantic visualization for networked documents
title_full	Semi-supervised semantic visualization for networked documents
title_fullStr	Semi-supervised semantic visualization for networked documents
title_full_unstemmed	Semi-supervised semantic visualization for networked documents
title_sort	semi-supervised semantic visualization for networked documents
publisher	Institutional Knowledge at Singapore Management University
publishDate	2021
url	https://ink.library.smu.edu.sg/sis_research/6428 https://ink.library.smu.edu.sg/context/sis_research/article/7431/viewcontent/ecmlpkdd21b.pdf
_version_	1770575958746595328

Semi-supervised semantic visualization for networked documents

Similar Items