Cookgan: Causality based text-to-image synthesis
This paper addresses the problem of text-to-image synthesis from a new perspective, i.e., the cause-and-effect chain in image generation. Causality is a common phenomenon in cooking. The dish appearance changes depending on the cooking actions and ingredients. The challenge of synthesis is that a ge...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2020
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6484 https://ink.library.smu.edu.sg/context/sis_research/article/7487/viewcontent/Zhu_CookGAN_Causality_Based_Text_to_Image_Synthesis_CVPR_2020_paper.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7487 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-74872022-01-10T05:31:46Z Cookgan: Causality based text-to-image synthesis ZHU, Bin NGO, Chong-wah This paper addresses the problem of text-to-image synthesis from a new perspective, i.e., the cause-and-effect chain in image generation. Causality is a common phenomenon in cooking. The dish appearance changes depending on the cooking actions and ingredients. The challenge of synthesis is that a generated image should depict the visual result of action-on-object. This paper presents a new network architecture, CookGAN, that mimics visual effect in causality chain, preserves fine-grained details and progressively upsamples image. Particularly, a cooking simulator sub-network is proposed to incrementally make changes to food images based on the interaction between ingredients and cooking methods over a series of steps. Experiments on Recipe1M verify that CookGAN manages to generate food images with reasonably impressive inception score. Furthermore, the images are semantically interpretable and manipulable. 2020-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6484 info:doi/10.1109/CVPR42600.2020.00556 https://ink.library.smu.edu.sg/context/sis_research/article/7487/viewcontent/Zhu_CookGAN_Causality_Based_Text_to_Image_Synthesis_CVPR_2020_paper.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Sciences Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Computer Sciences Graphics and Human Computer Interfaces |
spellingShingle |
Computer Sciences Graphics and Human Computer Interfaces ZHU, Bin NGO, Chong-wah Cookgan: Causality based text-to-image synthesis |
description |
This paper addresses the problem of text-to-image synthesis from a new perspective, i.e., the cause-and-effect chain in image generation. Causality is a common phenomenon in cooking. The dish appearance changes depending on the cooking actions and ingredients. The challenge of synthesis is that a generated image should depict the visual result of action-on-object. This paper presents a new network architecture, CookGAN, that mimics visual effect in causality chain, preserves fine-grained details and progressively upsamples image. Particularly, a cooking simulator sub-network is proposed to incrementally make changes to food images based on the interaction between ingredients and cooking methods over a series of steps. Experiments on Recipe1M verify that CookGAN manages to generate food images with reasonably impressive inception score. Furthermore, the images are semantically interpretable and manipulable. |
format |
text |
author |
ZHU, Bin NGO, Chong-wah |
author_facet |
ZHU, Bin NGO, Chong-wah |
author_sort |
ZHU, Bin |
title |
Cookgan: Causality based text-to-image synthesis |
title_short |
Cookgan: Causality based text-to-image synthesis |
title_full |
Cookgan: Causality based text-to-image synthesis |
title_fullStr |
Cookgan: Causality based text-to-image synthesis |
title_full_unstemmed |
Cookgan: Causality based text-to-image synthesis |
title_sort |
cookgan: causality based text-to-image synthesis |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2020 |
url |
https://ink.library.smu.edu.sg/sis_research/6484 https://ink.library.smu.edu.sg/context/sis_research/article/7487/viewcontent/Zhu_CookGAN_Causality_Based_Text_to_Image_Synthesis_CVPR_2020_paper.pdf |
_version_ |
1770575973725503488 |