A non-parametric visual-sense model of images: Extending the cluster hypothesis beyond text

The main challenge of a search engine is to find information that are relevant and appropriate. However, this can become difficult when queries are issued using ambiguous words. Rijsbergen first hypothesized a clustering approach for web pages wherein closely associated pages are treated as a semant...

Full description

Saved in:

Bibliographic Details
Main Authors:	WAN, Kong-Wah, TAN, Ah-hwee, LIM, Joo-Hwee, CHIA, Liang-Tien
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2012
Subjects:	Hierarchical Dirichlet Process Non-parametric models Image clustering Sense disambiguation Databases and Information Systems Numerical Analysis and Scientific Computing
Online Access:	https://ink.library.smu.edu.sg/sis_research/5204 https://ink.library.smu.edu.sg/context/sis_research/article/6207/viewcontent/Wan2012_Article_ANon_parametricVisual_senseMod.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-6207
record_format	dspace
spelling	sg-smu-ink.sis_research-62072020-07-23T18:41:16Z A non-parametric visual-sense model of images: Extending the cluster hypothesis beyond text WAN, Kong-Wah TAN, Ah-hwee LIM, Joo-Hwee CHIA, Liang-Tien The main challenge of a search engine is to find information that are relevant and appropriate. However, this can become difficult when queries are issued using ambiguous words. Rijsbergen first hypothesized a clustering approach for web pages wherein closely associated pages are treated as a semantic group with the same relevance to the query (Rijsbergen 1979). In this paper, we extend Rijsbergen’s cluster hypothesis to multimedia content such as images. Given a user query, the polysemy in the return image set is related to the many possible meanings of the query. We develop a method to cluster the polysemous images into their semantic categories. The resulting clusters can be seen as the visual senses of the query, which collectively embody the visual interpretations of the query. At the heart of our method is a non-parametric Bayesian approach that exploits the complementary text and visual information of images for semantic clustering. Latent structures of polysemous images are mined using the Hierarchical Dirichlet Process (HDP). HDP is a non-parametric Bayesian model that represents images using a mixture of components. The main advantage of our model is that the number of mixture components is not fixed a priori, but is determined during the posterior inference process. This allows our model to grow with the level of polysemy (and visual diversity) of images. The same set of components is used to model all images, with only the mixture weights varying amongst images. Evaluation results on a large collection of web images show the efficacy of our approach. 2012-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5204 info:doi/10.1007/s11042-010-0615-y https://ink.library.smu.edu.sg/context/sis_research/article/6207/viewcontent/Wan2012_Article_ANon_parametricVisual_senseMod.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Hierarchical Dirichlet Process Non-parametric models Image clustering Sense disambiguation Databases and Information Systems Numerical Analysis and Scientific Computing
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Hierarchical Dirichlet Process Non-parametric models Image clustering Sense disambiguation Databases and Information Systems Numerical Analysis and Scientific Computing
spellingShingle	Hierarchical Dirichlet Process Non-parametric models Image clustering Sense disambiguation Databases and Information Systems Numerical Analysis and Scientific Computing WAN, Kong-Wah TAN, Ah-hwee LIM, Joo-Hwee CHIA, Liang-Tien A non-parametric visual-sense model of images: Extending the cluster hypothesis beyond text
description	The main challenge of a search engine is to find information that are relevant and appropriate. However, this can become difficult when queries are issued using ambiguous words. Rijsbergen first hypothesized a clustering approach for web pages wherein closely associated pages are treated as a semantic group with the same relevance to the query (Rijsbergen 1979). In this paper, we extend Rijsbergen’s cluster hypothesis to multimedia content such as images. Given a user query, the polysemy in the return image set is related to the many possible meanings of the query. We develop a method to cluster the polysemous images into their semantic categories. The resulting clusters can be seen as the visual senses of the query, which collectively embody the visual interpretations of the query. At the heart of our method is a non-parametric Bayesian approach that exploits the complementary text and visual information of images for semantic clustering. Latent structures of polysemous images are mined using the Hierarchical Dirichlet Process (HDP). HDP is a non-parametric Bayesian model that represents images using a mixture of components. The main advantage of our model is that the number of mixture components is not fixed a priori, but is determined during the posterior inference process. This allows our model to grow with the level of polysemy (and visual diversity) of images. The same set of components is used to model all images, with only the mixture weights varying amongst images. Evaluation results on a large collection of web images show the efficacy of our approach.
format	text
author	WAN, Kong-Wah TAN, Ah-hwee LIM, Joo-Hwee CHIA, Liang-Tien
author_facet	WAN, Kong-Wah TAN, Ah-hwee LIM, Joo-Hwee CHIA, Liang-Tien
author_sort	WAN, Kong-Wah
title	A non-parametric visual-sense model of images: Extending the cluster hypothesis beyond text
title_short	A non-parametric visual-sense model of images: Extending the cluster hypothesis beyond text
title_full	A non-parametric visual-sense model of images: Extending the cluster hypothesis beyond text
title_fullStr	A non-parametric visual-sense model of images: Extending the cluster hypothesis beyond text
title_full_unstemmed	A non-parametric visual-sense model of images: Extending the cluster hypothesis beyond text
title_sort	non-parametric visual-sense model of images: extending the cluster hypothesis beyond text
publisher	Institutional Knowledge at Singapore Management University
publishDate	2012
url	https://ink.library.smu.edu.sg/sis_research/5204 https://ink.library.smu.edu.sg/context/sis_research/article/6207/viewcontent/Wan2012_Article_ANon_parametricVisual_senseMod.pdf
_version_	1770575331600629760

A non-parametric visual-sense model of images: Extending the cluster hypothesis beyond text

Similar Items