Layout generation as intermediate action sequence prediction
Layout generation plays a crucial role in graphic design intelligence. One important characteristic of the graphic layouts is that they usually follow certain design principles. For example, the principle of repetition emphasizes the reuse of similar visual elements throughout the design. To generat...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8088 https://ink.library.smu.edu.sg/context/sis_research/article/9091/viewcontent/AAAI_layout_pvoa.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9091 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-90912023-09-07T07:28:29Z Layout generation as intermediate action sequence prediction YANG, Huiting HUANG, Danqing LIN, Chin-Yew HE, Shengfeng Layout generation plays a crucial role in graphic design intelligence. One important characteristic of the graphic layouts is that they usually follow certain design principles. For example, the principle of repetition emphasizes the reuse of similar visual elements throughout the design. To generate a layout, previous works mainly attempt at predicting the absolute value of bounding box for each element, where such target representation has hidden the information of higher-order design operations like repetition (e.g. copy the size of the previously generated element). In this paper, we introduce a novel action schema to encode these operations for better modeling the generation process. Instead of predicting the bounding box values, our approach autoregressively outputs the intermediate action sequence, which can then be deterministically converted to the final layout. We achieve state-of-the-art performances on three datasets. Both automatic and human evaluations show that our approach generates high-quality and diverse layouts. Furthermore, we revisit the commonly used evaluation metric FID adapted in this task, and observe that previous works use different settings to train the feature extractor for obtaining real/generated data distribution, which leads to inconsistent conclusions. We conduct an in-depth analysis on this metric and settle for a more robust and reliable evaluation setting. 2023-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8088 info:doi/10.1609/aaai.v37i9.26277 https://ink.library.smu.edu.sg/context/sis_research/article/9091/viewcontent/AAAI_layout_pvoa.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Deep Generative Models & Autoencoders Computational Photography Image & Video Synthesis Deep Neural Network Algorithms valuation and Analysis (Machine Learning) Artificial Intelligence and Robotics Theory and Algorithms |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Deep Generative Models & Autoencoders Computational Photography Image & Video Synthesis Deep Neural Network Algorithms valuation and Analysis (Machine Learning) Artificial Intelligence and Robotics Theory and Algorithms |
spellingShingle |
Deep Generative Models & Autoencoders Computational Photography Image & Video Synthesis Deep Neural Network Algorithms valuation and Analysis (Machine Learning) Artificial Intelligence and Robotics Theory and Algorithms YANG, Huiting HUANG, Danqing LIN, Chin-Yew HE, Shengfeng Layout generation as intermediate action sequence prediction |
description |
Layout generation plays a crucial role in graphic design intelligence. One important characteristic of the graphic layouts is that they usually follow certain design principles. For example, the principle of repetition emphasizes the reuse of similar visual elements throughout the design. To generate a layout, previous works mainly attempt at predicting the absolute value of bounding box for each element, where such target representation has hidden the information of higher-order design operations like repetition (e.g. copy the size of the previously generated element). In this paper, we introduce a novel action schema to encode these operations for better modeling the generation process. Instead of predicting the bounding box values, our approach autoregressively outputs the intermediate action sequence, which can then be deterministically converted to the final layout. We achieve state-of-the-art performances on three datasets. Both automatic and human evaluations show that our approach generates high-quality and diverse layouts. Furthermore, we revisit the commonly used evaluation metric FID adapted in this task, and observe that previous works use different settings to train the feature extractor for obtaining real/generated data distribution, which leads to inconsistent conclusions. We conduct an in-depth analysis on this metric and settle for a more robust and reliable evaluation setting. |
format |
text |
author |
YANG, Huiting HUANG, Danqing LIN, Chin-Yew HE, Shengfeng |
author_facet |
YANG, Huiting HUANG, Danqing LIN, Chin-Yew HE, Shengfeng |
author_sort |
YANG, Huiting |
title |
Layout generation as intermediate action sequence prediction |
title_short |
Layout generation as intermediate action sequence prediction |
title_full |
Layout generation as intermediate action sequence prediction |
title_fullStr |
Layout generation as intermediate action sequence prediction |
title_full_unstemmed |
Layout generation as intermediate action sequence prediction |
title_sort |
layout generation as intermediate action sequence prediction |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2023 |
url |
https://ink.library.smu.edu.sg/sis_research/8088 https://ink.library.smu.edu.sg/context/sis_research/article/9091/viewcontent/AAAI_layout_pvoa.pdf |
_version_ |
1779157150253187072 |