Visual typo correction by collocative optimization: A case study on merchandize images

Near-duplicate retrieval (NDR) in merchandize images is of great importance to a lot of online applications on e-Commerce websites. In those applications where the requirement of response time is critical, however, the conventional techniques developed for a general purpose NDR are limited, because...

全面介紹

Saved in:
書目詳細資料
Main Authors: WEI, Xiao-Yong, YANG, Zhen-Qun, NGO, Chong-wah, ZHANG, Wei
格式: text
語言:English
出版: Institutional Knowledge at Singapore Management University 2014
主題:
在線閱讀:https://ink.library.smu.edu.sg/sis_research/6353
https://ink.library.smu.edu.sg/context/sis_research/article/7356/viewcontent/tip14.pdf
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Singapore Management University
語言: English
id sg-smu-ink.sis_research-7356
record_format dspace
spelling sg-smu-ink.sis_research-73562021-11-23T03:58:38Z Visual typo correction by collocative optimization: A case study on merchandize images WEI, Xiao-Yong YANG, Zhen-Qun NGO, Chong-wah ZHANG, Wei Near-duplicate retrieval (NDR) in merchandize images is of great importance to a lot of online applications on e-Commerce websites. In those applications where the requirement of response time is critical, however, the conventional techniques developed for a general purpose NDR are limited, because expensive post-processing like spatial verification or hashing is usually employed to compromise the quantization errors among the visual words used for the images. In this paper, we argue that most of the errors are introduced because of the quantization process where the visual words are considered individually, which has ignored the contextual relations among words. We propose a "spelling or phrase correction" like process for NDR, which extends the concept of collocations to visual domain for modeling the contextual relations. Binary quadratic programming is used to enforce the contextual consistency of words selected for an image, so that the errors (typos) are eliminated and the quality of the quantization process is improved. The experimental results show that the proposed method can improve the efficiency of NDR by reducing vocabulary size by 1000% times, and under the scenario of merchandize image NDR, the expensive local interest point feature used in conventional approaches can be replaced by color-moment feature, which reduces the time cost by 9202% while maintaining comparable performance to the state-of-the-art methods. 2014-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6353 info:doi/10.1109/TIP.2013.2293427 https://ink.library.smu.edu.sg/context/sis_research/article/7356/viewcontent/tip14.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Near-duplicate retrieval visual word quantization binary quadratic programming Graphics and Human Computer Interfaces Programming Languages and Compilers
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Near-duplicate retrieval
visual word quantization
binary quadratic programming
Graphics and Human Computer Interfaces
Programming Languages and Compilers
spellingShingle Near-duplicate retrieval
visual word quantization
binary quadratic programming
Graphics and Human Computer Interfaces
Programming Languages and Compilers
WEI, Xiao-Yong
YANG, Zhen-Qun
NGO, Chong-wah
ZHANG, Wei
Visual typo correction by collocative optimization: A case study on merchandize images
description Near-duplicate retrieval (NDR) in merchandize images is of great importance to a lot of online applications on e-Commerce websites. In those applications where the requirement of response time is critical, however, the conventional techniques developed for a general purpose NDR are limited, because expensive post-processing like spatial verification or hashing is usually employed to compromise the quantization errors among the visual words used for the images. In this paper, we argue that most of the errors are introduced because of the quantization process where the visual words are considered individually, which has ignored the contextual relations among words. We propose a "spelling or phrase correction" like process for NDR, which extends the concept of collocations to visual domain for modeling the contextual relations. Binary quadratic programming is used to enforce the contextual consistency of words selected for an image, so that the errors (typos) are eliminated and the quality of the quantization process is improved. The experimental results show that the proposed method can improve the efficiency of NDR by reducing vocabulary size by 1000% times, and under the scenario of merchandize image NDR, the expensive local interest point feature used in conventional approaches can be replaced by color-moment feature, which reduces the time cost by 9202% while maintaining comparable performance to the state-of-the-art methods.
format text
author WEI, Xiao-Yong
YANG, Zhen-Qun
NGO, Chong-wah
ZHANG, Wei
author_facet WEI, Xiao-Yong
YANG, Zhen-Qun
NGO, Chong-wah
ZHANG, Wei
author_sort WEI, Xiao-Yong
title Visual typo correction by collocative optimization: A case study on merchandize images
title_short Visual typo correction by collocative optimization: A case study on merchandize images
title_full Visual typo correction by collocative optimization: A case study on merchandize images
title_fullStr Visual typo correction by collocative optimization: A case study on merchandize images
title_full_unstemmed Visual typo correction by collocative optimization: A case study on merchandize images
title_sort visual typo correction by collocative optimization: a case study on merchandize images
publisher Institutional Knowledge at Singapore Management University
publishDate 2014
url https://ink.library.smu.edu.sg/sis_research/6353
https://ink.library.smu.edu.sg/context/sis_research/article/7356/viewcontent/tip14.pdf
_version_ 1770575940248666112