Structure-aware generation network for recipe generation from images

Sharing food has become very popular with the development of social media. For many real-world applications, people are keen to know the underlying recipes of a food item. In this paper, we are interested in automatically generating cooking instructions for food. We investigate an open research task...

Full description

Saved in:
Bibliographic Details
Main Authors: Wang, Hao, Lin, Guosheng, Hoi, Steven C. H., Miao, Chunyan
Other Authors: School of Computer Science and Engineering
Format: Conference or Workshop Item
Language:English
Published: 2021
Subjects:
Online Access:https://hdl.handle.net/10356/150963
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-150963
record_format dspace
spelling sg-ntu-dr.10356-1509632021-06-08T06:38:27Z Structure-aware generation network for recipe generation from images Wang, Hao Lin, Guosheng Hoi, Steven C. H. Miao, Chunyan School of Computer Science and Engineering 2020 European Conference on Computer Vision (ECCV’20) Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly (LILY) Engineering::Computer science and engineering Structure Learning Text Generation Sharing food has become very popular with the development of social media. For many real-world applications, people are keen to know the underlying recipes of a food item. In this paper, we are interested in automatically generating cooking instructions for food. We investigate an open research task of generating cooking instructions based on only food images and ingredients, which is similar to the image captioning task. However, compared with image captioning datasets, the target recipes are long-length paragraphs and do not have annotations on structure information. To address the above limitations, we propose a novel framework of Structure-aware Generation Network (SGN) to tackle the food recipe generation task. Our approach brings together several novel ideas in a systematic framework: (1) exploiting an unsupervised learning approach to obtain the sentence-level tree structure labels before training; (2) generating trees of target recipes from images with the supervision of tree structure labels learned from (1); and (3) integrating the inferred tree structures with the recipe generation procedure. Our proposed model can produce high-quality and coherent recipes, and achieve the state-of-the-art performance on the benchmark Recipe1M dataset. AI Singapore Accepted version This research is supported, in part, by the National Research Foundation (NRF), Singapore under its AI Singapore Programme (AISG Award No: AISG-GC-2019-003) and under its NRF Investigatorship Programme (NRFI Award No. NRF-NRFI05-2019-0002). Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not reflect the views of National Research Foundation, Singapore. This research is also supported, in part, by the Singapore Ministry of Health under its National Innovation Challenge on Active and Confident Ageing (NIC Project No. MOH/NIC/COG04/2017 and MOH/NIC/HAIG03/2017), and the MOE Tier-1 research grants: RG28/18 (S) and RG22/19 (S). 2021-06-08T06:27:34Z 2021-06-08T06:27:34Z 2020 Conference Paper Wang, H., Lin, G., Hoi, S. C. H. & Miao, C. (2020). Structure-aware generation network for recipe generation from images. 2020 European Conference on Computer Vision (ECCV’20), 12372 LNCS, 359-374. https://dx.doi.org/10.1007/978-3-030-58583-9_22 9783030585822 https://hdl.handle.net/10356/150963 10.1007/978-3-030-58583-9_22 2-s2.0-85097372602 12372 LNCS 359 374 en © 2020 Springer Nature Switzerland AG. This is a post-peer-review, pre-copyedit version of an article published in 2020 European Conference on Computer Vision (ECCV’20). The final authenticated version is available online at: https://doi.org/10.1007/978-3-030-58583-9_22 application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Structure Learning
Text Generation
spellingShingle Engineering::Computer science and engineering
Structure Learning
Text Generation
Wang, Hao
Lin, Guosheng
Hoi, Steven C. H.
Miao, Chunyan
Structure-aware generation network for recipe generation from images
description Sharing food has become very popular with the development of social media. For many real-world applications, people are keen to know the underlying recipes of a food item. In this paper, we are interested in automatically generating cooking instructions for food. We investigate an open research task of generating cooking instructions based on only food images and ingredients, which is similar to the image captioning task. However, compared with image captioning datasets, the target recipes are long-length paragraphs and do not have annotations on structure information. To address the above limitations, we propose a novel framework of Structure-aware Generation Network (SGN) to tackle the food recipe generation task. Our approach brings together several novel ideas in a systematic framework: (1) exploiting an unsupervised learning approach to obtain the sentence-level tree structure labels before training; (2) generating trees of target recipes from images with the supervision of tree structure labels learned from (1); and (3) integrating the inferred tree structures with the recipe generation procedure. Our proposed model can produce high-quality and coherent recipes, and achieve the state-of-the-art performance on the benchmark Recipe1M dataset.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Wang, Hao
Lin, Guosheng
Hoi, Steven C. H.
Miao, Chunyan
format Conference or Workshop Item
author Wang, Hao
Lin, Guosheng
Hoi, Steven C. H.
Miao, Chunyan
author_sort Wang, Hao
title Structure-aware generation network for recipe generation from images
title_short Structure-aware generation network for recipe generation from images
title_full Structure-aware generation network for recipe generation from images
title_fullStr Structure-aware generation network for recipe generation from images
title_full_unstemmed Structure-aware generation network for recipe generation from images
title_sort structure-aware generation network for recipe generation from images
publishDate 2021
url https://hdl.handle.net/10356/150963
_version_ 1702431304502476800