Interpretable multimodal retrieval for fashion products

Deep learning methods have been successfully applied to fashion retrieval. However, the latent meaning of learned feature vectors hinders the explanation of retrieval results and integration of user feedback. Fortunately, there are many online shopping websites organizing fashion items into hierarch...

Full description

Saved in:
Bibliographic Details
Main Authors: LIAO, Lizi, HE, Xiangnan, ZHAO, Bo, NGO, Chong-Wah, CHUA, Tat-Seng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2018
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7677
https://ink.library.smu.edu.sg/context/sis_research/article/8680/viewcontent/Interpretable_Multimodal_Retrieval_for_Fashion_Products.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8680
record_format dspace
spelling sg-smu-ink.sis_research-86802023-01-10T03:37:22Z Interpretable multimodal retrieval for fashion products LIAO, Lizi HE, Xiangnan ZHAO, Bo NGO, Chong-Wah CHUA, Tat-Seng Deep learning methods have been successfully applied to fashion retrieval. However, the latent meaning of learned feature vectors hinders the explanation of retrieval results and integration of user feedback. Fortunately, there are many online shopping websites organizing fashion items into hierarchical structures based on product taxonomy and domain knowledge. Such structures help to reveal how human perceive the relatedness among fashion products. Nevertheless, incorporating structural knowledge for deep learning remains a challenging problem. This paper presents techniques for organizing and utilizing the fashion hierarchies in deep learning to facilitate the reasoning of search results and user intent. The novelty of our work originates from the development of an EI (Exclusive & Independent) tree that can cooperate with deep models for end-to-end multimodal learning. EI tree organizes the fashion concepts into multiple semantic levels and augments the tree structure with exclusive as well as independent constraints. It describes the different relationships among sibling concepts and guides the end-to-end learning of multi-level fashion semantics. From EI tree, we learn an explicit hierarchical similarity function to characterize the semantic similarities among fashion products. It facilitates the interpretable retrieval scheme that can integrate the concept-level feedback. Experiment results on two large fashion datasets show that the proposed approach can characterize the semantic similarities among fashion items accurately and capture user's search intent precisely, leading to more accurate search results as compared to the state-of-the-art methods. 2018-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7677 info:doi/10.1145/3240508.3240646 https://ink.library.smu.edu.sg/context/sis_research/article/8680/viewcontent/Interpretable_Multimodal_Retrieval_for_Fashion_Products.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Attribute manipulation EI tree Multimodal fashion retrieval Artificial Intelligence and Robotics Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Attribute manipulation
EI tree
Multimodal fashion retrieval
Artificial Intelligence and Robotics
Databases and Information Systems
spellingShingle Attribute manipulation
EI tree
Multimodal fashion retrieval
Artificial Intelligence and Robotics
Databases and Information Systems
LIAO, Lizi
HE, Xiangnan
ZHAO, Bo
NGO, Chong-Wah
CHUA, Tat-Seng
Interpretable multimodal retrieval for fashion products
description Deep learning methods have been successfully applied to fashion retrieval. However, the latent meaning of learned feature vectors hinders the explanation of retrieval results and integration of user feedback. Fortunately, there are many online shopping websites organizing fashion items into hierarchical structures based on product taxonomy and domain knowledge. Such structures help to reveal how human perceive the relatedness among fashion products. Nevertheless, incorporating structural knowledge for deep learning remains a challenging problem. This paper presents techniques for organizing and utilizing the fashion hierarchies in deep learning to facilitate the reasoning of search results and user intent. The novelty of our work originates from the development of an EI (Exclusive & Independent) tree that can cooperate with deep models for end-to-end multimodal learning. EI tree organizes the fashion concepts into multiple semantic levels and augments the tree structure with exclusive as well as independent constraints. It describes the different relationships among sibling concepts and guides the end-to-end learning of multi-level fashion semantics. From EI tree, we learn an explicit hierarchical similarity function to characterize the semantic similarities among fashion products. It facilitates the interpretable retrieval scheme that can integrate the concept-level feedback. Experiment results on two large fashion datasets show that the proposed approach can characterize the semantic similarities among fashion items accurately and capture user's search intent precisely, leading to more accurate search results as compared to the state-of-the-art methods.
format text
author LIAO, Lizi
HE, Xiangnan
ZHAO, Bo
NGO, Chong-Wah
CHUA, Tat-Seng
author_facet LIAO, Lizi
HE, Xiangnan
ZHAO, Bo
NGO, Chong-Wah
CHUA, Tat-Seng
author_sort LIAO, Lizi
title Interpretable multimodal retrieval for fashion products
title_short Interpretable multimodal retrieval for fashion products
title_full Interpretable multimodal retrieval for fashion products
title_fullStr Interpretable multimodal retrieval for fashion products
title_full_unstemmed Interpretable multimodal retrieval for fashion products
title_sort interpretable multimodal retrieval for fashion products
publisher Institutional Knowledge at Singapore Management University
publishDate 2018
url https://ink.library.smu.edu.sg/sis_research/7677
https://ink.library.smu.edu.sg/context/sis_research/article/8680/viewcontent/Interpretable_Multimodal_Retrieval_for_Fashion_Products.pdf
_version_ 1770576412427681792