Evaluating bag-of-visual-words representations in scene classification

Based on keypoints extracted as salient image patches, an image can be described as a “bag of visual words” and this representation has been used in scene classification. The choice of dimension, selection, and weighting of visual words in this representation is crucial to the classification perform...

Full description

Saved in:
Bibliographic Details
Main Authors: YANG, Jun, JIANG, Yu-Gang, HAUPTMANN, Alexander G., NGO, Chong-wah
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2007
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6478
https://ink.library.smu.edu.sg/context/sis_research/article/7481/viewcontent/1290082.1290111.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7481
record_format dspace
spelling sg-smu-ink.sis_research-74812022-01-10T05:37:26Z Evaluating bag-of-visual-words representations in scene classification YANG, Jun JIANG, Yu-Gang HAUPTMANN, Alexander G. NGO, Chong-wah Based on keypoints extracted as salient image patches, an image can be described as a “bag of visual words” and this representation has been used in scene classification. The choice of dimension, selection, and weighting of visual words in this representation is crucial to the classification performance but has not been thoroughly studied in previous work. Given the analogy between this representation and the bag-of-words representation of text documents, we apply techniques used in text categorization, including term weighting, stop word removal, feature selection, to generate image representations that differ in the dimension, selection, and weighting of visual words. The impact of these representation choices to scene classification is studied through extensive experiments on the TRECVID and PASCAL collection. This study provides an empirical basis for designing visual-word representations that are likely to produce superior classification performance. 2007-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6478 info:doi/10.1145/1290082.1290111 https://ink.library.smu.edu.sg/context/sis_research/article/7481/viewcontent/1290082.1290111.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Bag-of-visual-words Keypoint Local interest point Scene classification Data Storage Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Bag-of-visual-words
Keypoint
Local interest point
Scene classification
Data Storage Systems
Graphics and Human Computer Interfaces
spellingShingle Bag-of-visual-words
Keypoint
Local interest point
Scene classification
Data Storage Systems
Graphics and Human Computer Interfaces
YANG, Jun
JIANG, Yu-Gang
HAUPTMANN, Alexander G.
NGO, Chong-wah
Evaluating bag-of-visual-words representations in scene classification
description Based on keypoints extracted as salient image patches, an image can be described as a “bag of visual words” and this representation has been used in scene classification. The choice of dimension, selection, and weighting of visual words in this representation is crucial to the classification performance but has not been thoroughly studied in previous work. Given the analogy between this representation and the bag-of-words representation of text documents, we apply techniques used in text categorization, including term weighting, stop word removal, feature selection, to generate image representations that differ in the dimension, selection, and weighting of visual words. The impact of these representation choices to scene classification is studied through extensive experiments on the TRECVID and PASCAL collection. This study provides an empirical basis for designing visual-word representations that are likely to produce superior classification performance.
format text
author YANG, Jun
JIANG, Yu-Gang
HAUPTMANN, Alexander G.
NGO, Chong-wah
author_facet YANG, Jun
JIANG, Yu-Gang
HAUPTMANN, Alexander G.
NGO, Chong-wah
author_sort YANG, Jun
title Evaluating bag-of-visual-words representations in scene classification
title_short Evaluating bag-of-visual-words representations in scene classification
title_full Evaluating bag-of-visual-words representations in scene classification
title_fullStr Evaluating bag-of-visual-words representations in scene classification
title_full_unstemmed Evaluating bag-of-visual-words representations in scene classification
title_sort evaluating bag-of-visual-words representations in scene classification
publisher Institutional Knowledge at Singapore Management University
publishDate 2007
url https://ink.library.smu.edu.sg/sis_research/6478
https://ink.library.smu.edu.sg/context/sis_research/article/7481/viewcontent/1290082.1290111.pdf
_version_ 1770575972490280960