Structure-aware generation network for recipe generation from images
Sharing food has become very popular with the development of social media. For many real-world applications, people are keen to know the underlying recipes of a food item. In this paper, we are interested in automatically generating cooking instructions for food. We investigate an open research task...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/150963 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-150963 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1509632021-06-08T06:38:27Z Structure-aware generation network for recipe generation from images Wang, Hao Lin, Guosheng Hoi, Steven C. H. Miao, Chunyan School of Computer Science and Engineering 2020 European Conference on Computer Vision (ECCV’20) Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly (LILY) Engineering::Computer science and engineering Structure Learning Text Generation Sharing food has become very popular with the development of social media. For many real-world applications, people are keen to know the underlying recipes of a food item. In this paper, we are interested in automatically generating cooking instructions for food. We investigate an open research task of generating cooking instructions based on only food images and ingredients, which is similar to the image captioning task. However, compared with image captioning datasets, the target recipes are long-length paragraphs and do not have annotations on structure information. To address the above limitations, we propose a novel framework of Structure-aware Generation Network (SGN) to tackle the food recipe generation task. Our approach brings together several novel ideas in a systematic framework: (1) exploiting an unsupervised learning approach to obtain the sentence-level tree structure labels before training; (2) generating trees of target recipes from images with the supervision of tree structure labels learned from (1); and (3) integrating the inferred tree structures with the recipe generation procedure. Our proposed model can produce high-quality and coherent recipes, and achieve the state-of-the-art performance on the benchmark Recipe1M dataset. AI Singapore Accepted version This research is supported, in part, by the National Research Foundation (NRF), Singapore under its AI Singapore Programme (AISG Award No: AISG-GC-2019-003) and under its NRF Investigatorship Programme (NRFI Award No. NRF-NRFI05-2019-0002). Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not reflect the views of National Research Foundation, Singapore. This research is also supported, in part, by the Singapore Ministry of Health under its National Innovation Challenge on Active and Confident Ageing (NIC Project No. MOH/NIC/COG04/2017 and MOH/NIC/HAIG03/2017), and the MOE Tier-1 research grants: RG28/18 (S) and RG22/19 (S). 2021-06-08T06:27:34Z 2021-06-08T06:27:34Z 2020 Conference Paper Wang, H., Lin, G., Hoi, S. C. H. & Miao, C. (2020). Structure-aware generation network for recipe generation from images. 2020 European Conference on Computer Vision (ECCV’20), 12372 LNCS, 359-374. https://dx.doi.org/10.1007/978-3-030-58583-9_22 9783030585822 https://hdl.handle.net/10356/150963 10.1007/978-3-030-58583-9_22 2-s2.0-85097372602 12372 LNCS 359 374 en © 2020 Springer Nature Switzerland AG. This is a post-peer-review, pre-copyedit version of an article published in 2020 European Conference on Computer Vision (ECCV’20). The final authenticated version is available online at: https://doi.org/10.1007/978-3-030-58583-9_22 application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Structure Learning Text Generation |
spellingShingle |
Engineering::Computer science and engineering Structure Learning Text Generation Wang, Hao Lin, Guosheng Hoi, Steven C. H. Miao, Chunyan Structure-aware generation network for recipe generation from images |
description |
Sharing food has become very popular with the development of social media. For many real-world applications, people are keen to know the underlying recipes of a food item. In this paper, we are interested in automatically generating cooking instructions for food. We investigate an open research task of generating cooking instructions based on only food images and ingredients, which is similar to the image captioning task. However, compared with image captioning datasets, the target recipes are long-length paragraphs and do not have annotations on structure information. To address the above limitations, we propose a novel framework of Structure-aware Generation Network (SGN) to tackle the food recipe generation task. Our approach brings together several novel ideas in a systematic framework: (1) exploiting an unsupervised learning approach to obtain the sentence-level tree structure labels before training; (2) generating trees of target recipes from images with the supervision of tree structure labels learned from (1); and (3) integrating the inferred tree structures with the recipe generation procedure. Our proposed model can produce high-quality and coherent recipes, and achieve the state-of-the-art performance on the benchmark Recipe1M dataset. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Wang, Hao Lin, Guosheng Hoi, Steven C. H. Miao, Chunyan |
format |
Conference or Workshop Item |
author |
Wang, Hao Lin, Guosheng Hoi, Steven C. H. Miao, Chunyan |
author_sort |
Wang, Hao |
title |
Structure-aware generation network for recipe generation from images |
title_short |
Structure-aware generation network for recipe generation from images |
title_full |
Structure-aware generation network for recipe generation from images |
title_fullStr |
Structure-aware generation network for recipe generation from images |
title_full_unstemmed |
Structure-aware generation network for recipe generation from images |
title_sort |
structure-aware generation network for recipe generation from images |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/150963 |
_version_ |
1702431304502476800 |