Representations of keypoint-based semantic concept detection: A comprehensive study
Based on the local keypoints extracted as salient image patches, an image can be described as a "bag-of-visual-words (BoW)" and this representation has appeared promising for object and scene classification. The performance of BoW features in semantic concept detection for large-scale mult...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2010
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6339 https://ink.library.smu.edu.sg/context/sis_research/article/7342/viewcontent/Representations_of_Keypoint_Based_Semantic_Concept_Detection__A_Comprehensive_Study.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7342 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-73422021-11-23T04:30:17Z Representations of keypoint-based semantic concept detection: A comprehensive study JIANG, Yu-Gang YANG, Jun NGO, Chong-wah Based on the local keypoints extracted as salient image patches, an image can be described as a "bag-of-visual-words (BoW)" and this representation has appeared promising for object and scene classification. The performance of BoW features in semantic concept detection for large-scale multimedia databases is subject to various representation choices. In this paper, we conduct a comprehensive study on the representation choices of BoW, including vocabulary size, weighting scheme, stop word removal, feature selection, spatial information, and visual bi-gram. We offer practical insights in how to optimize the performance of BoW by choosing appropriate representation choices. For the weighting scheme, we elaborate a soft-weighting method to assess the significance of a visual word to an image. We experimentally show that the soft-weighting outperforms other popular weighting schemes such as TF-IDF with a large margin. Our extensive experiments on TRECVID data sets also indicate that BoW feature alone, with appropriate representation choices, already produces highly competitive concept detection performance. Based on our empirical findings, we further apply our method to detect a large set of 374 semantic concepts. The detectors, as well as the features and detection scores on several recent benchmark data sets, are released to the multimedia community. 2010-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6339 info:doi/10.1109/TMM.2009.2036235 https://ink.library.smu.edu.sg/context/sis_research/article/7342/viewcontent/Representations_of_Keypoint_Based_Semantic_Concept_Detection__A_Comprehensive_Study.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Bag-of-visual-words representation choice semantic concept detection Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Bag-of-visual-words representation choice semantic concept detection Graphics and Human Computer Interfaces |
spellingShingle |
Bag-of-visual-words representation choice semantic concept detection Graphics and Human Computer Interfaces JIANG, Yu-Gang YANG, Jun NGO, Chong-wah Representations of keypoint-based semantic concept detection: A comprehensive study |
description |
Based on the local keypoints extracted as salient image patches, an image can be described as a "bag-of-visual-words (BoW)" and this representation has appeared promising for object and scene classification. The performance of BoW features in semantic concept detection for large-scale multimedia databases is subject to various representation choices. In this paper, we conduct a comprehensive study on the representation choices of BoW, including vocabulary size, weighting scheme, stop word removal, feature selection, spatial information, and visual bi-gram. We offer practical insights in how to optimize the performance of BoW by choosing appropriate representation choices. For the weighting scheme, we elaborate a soft-weighting method to assess the significance of a visual word to an image. We experimentally show that the soft-weighting outperforms other popular weighting schemes such as TF-IDF with a large margin. Our extensive experiments on TRECVID data sets also indicate that BoW feature alone, with appropriate representation choices, already produces highly competitive concept detection performance. Based on our empirical findings, we further apply our method to detect a large set of 374 semantic concepts. The detectors, as well as the features and detection scores on several recent benchmark data sets, are released to the multimedia community. |
format |
text |
author |
JIANG, Yu-Gang YANG, Jun NGO, Chong-wah |
author_facet |
JIANG, Yu-Gang YANG, Jun NGO, Chong-wah |
author_sort |
JIANG, Yu-Gang |
title |
Representations of keypoint-based semantic concept detection: A comprehensive study |
title_short |
Representations of keypoint-based semantic concept detection: A comprehensive study |
title_full |
Representations of keypoint-based semantic concept detection: A comprehensive study |
title_fullStr |
Representations of keypoint-based semantic concept detection: A comprehensive study |
title_full_unstemmed |
Representations of keypoint-based semantic concept detection: A comprehensive study |
title_sort |
representations of keypoint-based semantic concept detection: a comprehensive study |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2010 |
url |
https://ink.library.smu.edu.sg/sis_research/6339 https://ink.library.smu.edu.sg/context/sis_research/article/7342/viewcontent/Representations_of_Keypoint_Based_Semantic_Concept_Detection__A_Comprehensive_Study.pdf |
_version_ |
1770575937703772160 |