Online multi-modal distance metric learning with application to image retrieval

Distance metric learning (DML) is an important technique to improve similarity search in content-based image retrieval. Despite being studied extensively, most existing DML approaches typically adopt a single-modal learning framework that learns the distance metric on either a single feature type or...

Full description

Saved in:
Bibliographic Details
Main Authors: WU, Pengcheng, HOI, Steven C. H., ZHAO, Peilin, MIAO, Chunyan, LIU, Zhi-Yong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2924
https://ink.library.smu.edu.sg/context/sis_research/article/3924/viewcontent/OMDML_TKDE_afv.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3924
record_format dspace
spelling sg-smu-ink.sis_research-39242020-04-01T02:47:29Z Online multi-modal distance metric learning with application to image retrieval WU, Pengcheng HOI, Steven C. H. ZHAO, Peilin MIAO, Chunyan LIU, Zhi-Yong Distance metric learning (DML) is an important technique to improve similarity search in content-based image retrieval. Despite being studied extensively, most existing DML approaches typically adopt a single-modal learning framework that learns the distance metric on either a single feature type or a combined feature space where multiple types of features are simply concatenated. Such single-modal DML methods suffer from some critical limitations: (i) some type of features may significantly dominate the others in the DML task due to diverse feature representations; and (ii) learning a distance metric on the combined high-dimensional feature space can be extremely time-consuming using the naive feature concatenation approach. To address these limitations, in this paper, we investigate a novel scheme of online multi-modal distance metric learning (OMDML), which explores a unified two-level online learning scheme: (i) it learns to optimize a distance metric on each individual feature space; and (ii) then it learns to find the optimal combination of diverse types of features. To further reduce the expensive cost of DML on high-dimensional feature space, we propose a low-rank OMDML algorithm which not only significantly reduces the computational cost but also retains highly competing or even better learning accuracy. We conduct extensive experiments to evaluate the performance of the proposed algorithms for multi-modal image retrieval, in which encouraging results validate the effectiveness of the proposed technique. 2016-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2924 info:doi/10.1109/TKDE.2015.2477296 https://ink.library.smu.edu.sg/context/sis_research/article/3924/viewcontent/OMDML_TKDE_afv.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University content-based image retrieval multi-modal retrieval distance metric learning online learning Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic content-based image retrieval
multi-modal retrieval
distance metric learning
online learning
Databases and Information Systems
spellingShingle content-based image retrieval
multi-modal retrieval
distance metric learning
online learning
Databases and Information Systems
WU, Pengcheng
HOI, Steven C. H.
ZHAO, Peilin
MIAO, Chunyan
LIU, Zhi-Yong
Online multi-modal distance metric learning with application to image retrieval
description Distance metric learning (DML) is an important technique to improve similarity search in content-based image retrieval. Despite being studied extensively, most existing DML approaches typically adopt a single-modal learning framework that learns the distance metric on either a single feature type or a combined feature space where multiple types of features are simply concatenated. Such single-modal DML methods suffer from some critical limitations: (i) some type of features may significantly dominate the others in the DML task due to diverse feature representations; and (ii) learning a distance metric on the combined high-dimensional feature space can be extremely time-consuming using the naive feature concatenation approach. To address these limitations, in this paper, we investigate a novel scheme of online multi-modal distance metric learning (OMDML), which explores a unified two-level online learning scheme: (i) it learns to optimize a distance metric on each individual feature space; and (ii) then it learns to find the optimal combination of diverse types of features. To further reduce the expensive cost of DML on high-dimensional feature space, we propose a low-rank OMDML algorithm which not only significantly reduces the computational cost but also retains highly competing or even better learning accuracy. We conduct extensive experiments to evaluate the performance of the proposed algorithms for multi-modal image retrieval, in which encouraging results validate the effectiveness of the proposed technique.
format text
author WU, Pengcheng
HOI, Steven C. H.
ZHAO, Peilin
MIAO, Chunyan
LIU, Zhi-Yong
author_facet WU, Pengcheng
HOI, Steven C. H.
ZHAO, Peilin
MIAO, Chunyan
LIU, Zhi-Yong
author_sort WU, Pengcheng
title Online multi-modal distance metric learning with application to image retrieval
title_short Online multi-modal distance metric learning with application to image retrieval
title_full Online multi-modal distance metric learning with application to image retrieval
title_fullStr Online multi-modal distance metric learning with application to image retrieval
title_full_unstemmed Online multi-modal distance metric learning with application to image retrieval
title_sort online multi-modal distance metric learning with application to image retrieval
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/2924
https://ink.library.smu.edu.sg/context/sis_research/article/3924/viewcontent/OMDML_TKDE_afv.pdf
_version_ 1770572739319431168