Cross-lingual adaptation for recipe retrieval with mixup

Cross-modal recipe retrieval has attracted research attention in recent years, thanks to the availability of large-scale paired data for training. Nevertheless, obtaining adequate recipe-image pairs covering the majority of cuisines for supervised learning is difficult if not impossible. By transfer...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHU, Bin, NGO, Chong-Wah, CHEN, Jingjing, CHAN, Wing-Kwong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7502
https://ink.library.smu.edu.sg/context/sis_research/article/8505/viewcontent/3512527.3531375.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8505
record_format dspace
spelling sg-smu-ink.sis_research-85052023-08-24T08:54:13Z Cross-lingual adaptation for recipe retrieval with mixup ZHU, Bin NGO, Chong-Wah CHEN, Jingjing CHAN, Wing-Kwong Cross-modal recipe retrieval has attracted research attention in recent years, thanks to the availability of large-scale paired data for training. Nevertheless, obtaining adequate recipe-image pairs covering the majority of cuisines for supervised learning is difficult if not impossible. By transferring knowledge learnt from a data-rich cuisine to a data-scarce cuisine, domain adaptation sheds light on this practical problem. Nevertheless, existing works assume recipes in source and target domains are mostly originated from the same cuisine and written in the same language. This paper studies unsupervised domain adaptation for image-to-recipe retrieval, where recipes in source and target domains are in different languages. Moreover, only recipes are available for training in the target domain. A novel recipe mixup method is proposed to learn transferable embedding features between the two domains. Specifically, recipe mixup produces mixed recipes to form an intermediate domain by discretely exchanging the section(s) between source and target recipes. To bridge the domain gap, recipe mixup loss is proposed to enforce the intermediate domain to locate in the shortest geodesic path between source and target domains in the recipe embedding space. By using Recipe 1M dataset as source domain (English) and Vireo-FoodTransfer dataset as target domain (Chinese), empirical experiments verify the effectiveness of recipe mixup for cross-lingual adaptation in the context of image-to-recipe retrieval. 2022-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7502 info:doi/10.1145/3512527.3531375 https://ink.library.smu.edu.sg/context/sis_research/article/8505/viewcontent/3512527.3531375.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University recipe retrieval mixup cross-lingual domain adaptation Artificial Intelligence and Robotics Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic recipe retrieval
mixup
cross-lingual
domain adaptation
Artificial Intelligence and Robotics
Graphics and Human Computer Interfaces
spellingShingle recipe retrieval
mixup
cross-lingual
domain adaptation
Artificial Intelligence and Robotics
Graphics and Human Computer Interfaces
ZHU, Bin
NGO, Chong-Wah
CHEN, Jingjing
CHAN, Wing-Kwong
Cross-lingual adaptation for recipe retrieval with mixup
description Cross-modal recipe retrieval has attracted research attention in recent years, thanks to the availability of large-scale paired data for training. Nevertheless, obtaining adequate recipe-image pairs covering the majority of cuisines for supervised learning is difficult if not impossible. By transferring knowledge learnt from a data-rich cuisine to a data-scarce cuisine, domain adaptation sheds light on this practical problem. Nevertheless, existing works assume recipes in source and target domains are mostly originated from the same cuisine and written in the same language. This paper studies unsupervised domain adaptation for image-to-recipe retrieval, where recipes in source and target domains are in different languages. Moreover, only recipes are available for training in the target domain. A novel recipe mixup method is proposed to learn transferable embedding features between the two domains. Specifically, recipe mixup produces mixed recipes to form an intermediate domain by discretely exchanging the section(s) between source and target recipes. To bridge the domain gap, recipe mixup loss is proposed to enforce the intermediate domain to locate in the shortest geodesic path between source and target domains in the recipe embedding space. By using Recipe 1M dataset as source domain (English) and Vireo-FoodTransfer dataset as target domain (Chinese), empirical experiments verify the effectiveness of recipe mixup for cross-lingual adaptation in the context of image-to-recipe retrieval.
format text
author ZHU, Bin
NGO, Chong-Wah
CHEN, Jingjing
CHAN, Wing-Kwong
author_facet ZHU, Bin
NGO, Chong-Wah
CHEN, Jingjing
CHAN, Wing-Kwong
author_sort ZHU, Bin
title Cross-lingual adaptation for recipe retrieval with mixup
title_short Cross-lingual adaptation for recipe retrieval with mixup
title_full Cross-lingual adaptation for recipe retrieval with mixup
title_fullStr Cross-lingual adaptation for recipe retrieval with mixup
title_full_unstemmed Cross-lingual adaptation for recipe retrieval with mixup
title_sort cross-lingual adaptation for recipe retrieval with mixup
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7502
https://ink.library.smu.edu.sg/context/sis_research/article/8505/viewcontent/3512527.3531375.pdf
_version_ 1779156953505726464