Discriminative bag-of-visual phrase learning for landmark recognition

Bag-of-visual phrase (BoP) has been proposed and developed for landmark recognition recently. However, existing BoP methods for landmark recognition have two major shortcomings: (i) they try to construct a universal phrase vocabulary for all object categories, which lacks specific descriptive capabi...

Full description

Saved in:
Bibliographic Details
Main Authors: Chen, Tao, Yap, Kim-Hui, Zhang, Dajiang
Other Authors: School of Electrical and Electronic Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Online Access:https://hdl.handle.net/10356/98521
http://hdl.handle.net/10220/13371
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-98521
record_format dspace
spelling sg-ntu-dr.10356-985212020-03-07T13:24:48Z Discriminative bag-of-visual phrase learning for landmark recognition Chen, Tao Yap, Kim-Hui Zhang, Dajiang School of Electrical and Electronic Engineering IEEE International Conference on Acoustics, Speech and Signal Processing (2012 : Kyoto, Japan) Bag-of-visual phrase (BoP) has been proposed and developed for landmark recognition recently. However, existing BoP methods for landmark recognition have two major shortcomings: (i) they try to construct a universal phrase vocabulary for all object categories, which lacks specific descriptive capabilities for a particular category, and (ii) they often adopt simple criterion such as the frequency information to mine the visual phrases, which may cause the selected phrases to be less discriminative or representative for recognition. In view of this, this paper proposes a new discriminative BoP approach for landmark recognition. First, the candidate visual phrases defined as adjacent pairwise words are selected for each category. A phrase-level similarity measure at the latent space is proposed to evaluate the semantic similarity between pairwise phrases. This is then integrated with the phrase frequency information to shortlist the discriminative phrases for each category through a proposed phrase ranking algorithm. Finally, the BoP and bag-of-words (BoW) histograms are combined through a pyramid matching method for recognition. Experimental results on two different datasets demonstrate that the proposed method is effective in landmark recognition. 2013-09-06T08:20:47Z 2019-12-06T19:56:27Z 2013-09-06T08:20:47Z 2019-12-06T19:56:27Z 2012 2012 Conference Paper Chen, T., Yap, K. H., & Zhang, D. (2012). Discriminative bag-of-visual phrase learning for landmark recognition. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.893-896. https://hdl.handle.net/10356/98521 http://hdl.handle.net/10220/13371 10.1109/ICASSP.2012.6288028 en © 2012 IEEE.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
description Bag-of-visual phrase (BoP) has been proposed and developed for landmark recognition recently. However, existing BoP methods for landmark recognition have two major shortcomings: (i) they try to construct a universal phrase vocabulary for all object categories, which lacks specific descriptive capabilities for a particular category, and (ii) they often adopt simple criterion such as the frequency information to mine the visual phrases, which may cause the selected phrases to be less discriminative or representative for recognition. In view of this, this paper proposes a new discriminative BoP approach for landmark recognition. First, the candidate visual phrases defined as adjacent pairwise words are selected for each category. A phrase-level similarity measure at the latent space is proposed to evaluate the semantic similarity between pairwise phrases. This is then integrated with the phrase frequency information to shortlist the discriminative phrases for each category through a proposed phrase ranking algorithm. Finally, the BoP and bag-of-words (BoW) histograms are combined through a pyramid matching method for recognition. Experimental results on two different datasets demonstrate that the proposed method is effective in landmark recognition.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Chen, Tao
Yap, Kim-Hui
Zhang, Dajiang
format Conference or Workshop Item
author Chen, Tao
Yap, Kim-Hui
Zhang, Dajiang
spellingShingle Chen, Tao
Yap, Kim-Hui
Zhang, Dajiang
Discriminative bag-of-visual phrase learning for landmark recognition
author_sort Chen, Tao
title Discriminative bag-of-visual phrase learning for landmark recognition
title_short Discriminative bag-of-visual phrase learning for landmark recognition
title_full Discriminative bag-of-visual phrase learning for landmark recognition
title_fullStr Discriminative bag-of-visual phrase learning for landmark recognition
title_full_unstemmed Discriminative bag-of-visual phrase learning for landmark recognition
title_sort discriminative bag-of-visual phrase learning for landmark recognition
publishDate 2013
url https://hdl.handle.net/10356/98521
http://hdl.handle.net/10220/13371
_version_ 1681037018995359744