Scene recognition by semantic visual words

In this paper, we propose a novel approach to introduce semantic relations into the bag-of-words framework. We use the latent semantic models, such as latent semantic analysis (LSA) and probabilistic latent semantic analysis (pLSA), in order to define semantically rich features and embed the visual...

Full description

Saved in:

Bibliographic Details
Main Authors:	Farahzadeh, Elahe, Sluzek, Andrzej, Cham Tat Jen (SCE)
Other Authors:	School of Computer Engineering
Format:	Article
Language:	English
Published:	2015
Subjects:	Scene recognition Semantic vocabulary Visual words
Online Access:	https://hdl.handle.net/10356/80983 http://hdl.handle.net/10220/38975
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-80983
record_format	dspace
spelling	sg-ntu-dr.10356-809832020-05-28T07:18:57Z Scene recognition by semantic visual words Farahzadeh, Elahe Sluzek, Andrzej Cham Tat Jen (SCE) School of Computer Engineering Centre for Computational Intelligence Centre for Multimedia and Network Technology Scene recognition Semantic vocabulary Visual words In this paper, we propose a novel approach to introduce semantic relations into the bag-of-words framework. We use the latent semantic models, such as latent semantic analysis (LSA) and probabilistic latent semantic analysis (pLSA), in order to define semantically rich features and embed the visual features into a semantic space. The semantic features used in LSA technique are derived from the low-rank approximation of word–image occurrence matrix by singular value decomposition. Similarly, by using the pLSA approach, the topic-specific distributions of words can be considered dimensions of a concept space. In the proposed space, the distances between words represent the semantic distances which are used for constructing a discriminative and semantically meaningful vocabulary. Position information significantly improves scene recognition accuracy. Inspired by this, in this paper, we bring position information into the proposed semantic vocabulary frameworks. We have tested our approach on the 15-Scene and 67-MIT Indoor datasets and have achieved very promising results. Accepted version 2015-12-07T04:48:57Z 2019-12-06T14:18:53Z 2015-12-07T04:48:57Z 2019-12-06T14:18:53Z 2014 Journal Article Farahzadeh, E., Cham, T. J., & Sluzek, A. (2015). Scene recognition by semantic visual words. Signal, Image and Video Processing, 9(8), 1935-1944. 1863-1703 https://hdl.handle.net/10356/80983 http://hdl.handle.net/10220/38975 10.1007/s11760-014-0687-7 en Signal, Image and Video Processing © 2014 Springer-Verlag London. This is the author created version of a work that has been peer reviewed and accepted for publication by Signal, Image and Video Processing, Springer-Verlag London. It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. The published version is available at: [http://dx.doi.org/10.1007/s11760-014-0687-7]. application/pdf
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	Scene recognition Semantic vocabulary Visual words
spellingShingle	Scene recognition Semantic vocabulary Visual words Farahzadeh, Elahe Sluzek, Andrzej Cham Tat Jen (SCE) Scene recognition by semantic visual words
description	In this paper, we propose a novel approach to introduce semantic relations into the bag-of-words framework. We use the latent semantic models, such as latent semantic analysis (LSA) and probabilistic latent semantic analysis (pLSA), in order to define semantically rich features and embed the visual features into a semantic space. The semantic features used in LSA technique are derived from the low-rank approximation of word–image occurrence matrix by singular value decomposition. Similarly, by using the pLSA approach, the topic-specific distributions of words can be considered dimensions of a concept space. In the proposed space, the distances between words represent the semantic distances which are used for constructing a discriminative and semantically meaningful vocabulary. Position information significantly improves scene recognition accuracy. Inspired by this, in this paper, we bring position information into the proposed semantic vocabulary frameworks. We have tested our approach on the 15-Scene and 67-MIT Indoor datasets and have achieved very promising results.
author2	School of Computer Engineering
author_facet	School of Computer Engineering Farahzadeh, Elahe Sluzek, Andrzej Cham Tat Jen (SCE)
format	Article
author	Farahzadeh, Elahe Sluzek, Andrzej Cham Tat Jen (SCE)
author_sort	Farahzadeh, Elahe
title	Scene recognition by semantic visual words
title_short	Scene recognition by semantic visual words
title_full	Scene recognition by semantic visual words
title_fullStr	Scene recognition by semantic visual words
title_full_unstemmed	Scene recognition by semantic visual words
title_sort	scene recognition by semantic visual words
publishDate	2015
url	https://hdl.handle.net/10356/80983 http://hdl.handle.net/10220/38975
_version_	1681056065389592576

Scene recognition by semantic visual words

Similar Items