Image search by graph-based label propagation with image representation from DNN

Our objective is to estimate the relevance of an image to a query for image search purposes. We address two limitations of the existing image search engines in this paper. First, there is no straightforward way of bridging the gap between semantic textual queries as well as users’ search intents and...

Full description

Saved in:
Bibliographic Details
Main Authors: PAN, Yingwei, TING, Yao, YANG, Kuiyuan, LI, Houqiang, NGO, Chong-wah, WANG, Jingdong, MEI, Tao
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2013
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6459
https://ink.library.smu.edu.sg/context/sis_research/article/7462/viewcontent/2502081.2508128.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7462
record_format dspace
spelling sg-smu-ink.sis_research-74622022-01-10T06:09:32Z Image search by graph-based label propagation with image representation from DNN PAN, Yingwei TING, Yao YANG, Kuiyuan LI, Houqiang NGO, Chong-wah WANG, Jingdong MEI, Tao Our objective is to estimate the relevance of an image to a query for image search purposes. We address two limitations of the existing image search engines in this paper. First, there is no straightforward way of bridging the gap between semantic textual queries as well as users’ search intents and image visual content. Image search engines therefore primarily rely on static and textual features. Visual features are mainly used to identify potentially useful recurrent patterns or relevant training examples for complementing search by image reranking. Second, image rankers are trained on query-image pairs labeled by human experts, making the annotation intellectually expensive and timeconsuming. Furthermore, the labels may be subjective when the queries are ambiguous, resulting in difficulty in predicting the search intention. We demonstrate that the aforementioned two problems can be mitigated by exploring the use of click-through data, which can be viewed as the footprints of user searching behavior, as an effective means of understanding query. The correspondences between an image and a query are determined by whether the image was searched and clicked by users under the query in a commercial image search engine. We therefore hypothesize that the image click counts in response to a query are as their relevance indications. For each new image, our proposed graph-based label propagation algorithm employs neighborhood graph search to find the nearest neighbors on an image similarity graph built up with visual representations from deep neural networks and further aggregates their clicked queries/click counts to get the labels of the new image. We conduct experiments on MSR-Bing Grand Challenge and the results show consistent performance gain over various baselines. In addition, the proposed approach is very efficient, completing annotation of each query-image pair within just 15 milliseconds on a regular PC. 2013-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6459 info:doi/10.1145/2502081.2508128 https://ink.library.smu.edu.sg/context/sis_research/article/7462/viewcontent/2502081.2508128.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Click-through data Deep neural networks Image search Neighborhood graph search Databases and Information Systems Data Storage Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Click-through data
Deep neural networks
Image search
Neighborhood graph search
Databases and Information Systems
Data Storage Systems
Graphics and Human Computer Interfaces
spellingShingle Click-through data
Deep neural networks
Image search
Neighborhood graph search
Databases and Information Systems
Data Storage Systems
Graphics and Human Computer Interfaces
PAN, Yingwei
TING, Yao
YANG, Kuiyuan
LI, Houqiang
NGO, Chong-wah
WANG, Jingdong
MEI, Tao
Image search by graph-based label propagation with image representation from DNN
description Our objective is to estimate the relevance of an image to a query for image search purposes. We address two limitations of the existing image search engines in this paper. First, there is no straightforward way of bridging the gap between semantic textual queries as well as users’ search intents and image visual content. Image search engines therefore primarily rely on static and textual features. Visual features are mainly used to identify potentially useful recurrent patterns or relevant training examples for complementing search by image reranking. Second, image rankers are trained on query-image pairs labeled by human experts, making the annotation intellectually expensive and timeconsuming. Furthermore, the labels may be subjective when the queries are ambiguous, resulting in difficulty in predicting the search intention. We demonstrate that the aforementioned two problems can be mitigated by exploring the use of click-through data, which can be viewed as the footprints of user searching behavior, as an effective means of understanding query. The correspondences between an image and a query are determined by whether the image was searched and clicked by users under the query in a commercial image search engine. We therefore hypothesize that the image click counts in response to a query are as their relevance indications. For each new image, our proposed graph-based label propagation algorithm employs neighborhood graph search to find the nearest neighbors on an image similarity graph built up with visual representations from deep neural networks and further aggregates their clicked queries/click counts to get the labels of the new image. We conduct experiments on MSR-Bing Grand Challenge and the results show consistent performance gain over various baselines. In addition, the proposed approach is very efficient, completing annotation of each query-image pair within just 15 milliseconds on a regular PC.
format text
author PAN, Yingwei
TING, Yao
YANG, Kuiyuan
LI, Houqiang
NGO, Chong-wah
WANG, Jingdong
MEI, Tao
author_facet PAN, Yingwei
TING, Yao
YANG, Kuiyuan
LI, Houqiang
NGO, Chong-wah
WANG, Jingdong
MEI, Tao
author_sort PAN, Yingwei
title Image search by graph-based label propagation with image representation from DNN
title_short Image search by graph-based label propagation with image representation from DNN
title_full Image search by graph-based label propagation with image representation from DNN
title_fullStr Image search by graph-based label propagation with image representation from DNN
title_full_unstemmed Image search by graph-based label propagation with image representation from DNN
title_sort image search by graph-based label propagation with image representation from dnn
publisher Institutional Knowledge at Singapore Management University
publishDate 2013
url https://ink.library.smu.edu.sg/sis_research/6459
https://ink.library.smu.edu.sg/context/sis_research/article/7462/viewcontent/2502081.2508128.pdf
_version_ 1770575964438265856