Layout generation as intermediate action sequence prediction

Layout generation plays a crucial role in graphic design intelligence. One important characteristic of the graphic layouts is that they usually follow certain design principles. For example, the principle of repetition emphasizes the reuse of similar visual elements throughout the design. To generat...

Full description

Saved in:
Bibliographic Details
Main Authors: YANG, Huiting, HUANG, Danqing, LIN, Chin-Yew, HE, Shengfeng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8088
https://ink.library.smu.edu.sg/context/sis_research/article/9091/viewcontent/AAAI_layout_pvoa.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9091
record_format dspace
spelling sg-smu-ink.sis_research-90912023-09-07T07:28:29Z Layout generation as intermediate action sequence prediction YANG, Huiting HUANG, Danqing LIN, Chin-Yew HE, Shengfeng Layout generation plays a crucial role in graphic design intelligence. One important characteristic of the graphic layouts is that they usually follow certain design principles. For example, the principle of repetition emphasizes the reuse of similar visual elements throughout the design. To generate a layout, previous works mainly attempt at predicting the absolute value of bounding box for each element, where such target representation has hidden the information of higher-order design operations like repetition (e.g. copy the size of the previously generated element). In this paper, we introduce a novel action schema to encode these operations for better modeling the generation process. Instead of predicting the bounding box values, our approach autoregressively outputs the intermediate action sequence, which can then be deterministically converted to the final layout. We achieve state-of-the-art performances on three datasets. Both automatic and human evaluations show that our approach generates high-quality and diverse layouts. Furthermore, we revisit the commonly used evaluation metric FID adapted in this task, and observe that previous works use different settings to train the feature extractor for obtaining real/generated data distribution, which leads to inconsistent conclusions. We conduct an in-depth analysis on this metric and settle for a more robust and reliable evaluation setting. 2023-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8088 info:doi/10.1609/aaai.v37i9.26277 https://ink.library.smu.edu.sg/context/sis_research/article/9091/viewcontent/AAAI_layout_pvoa.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Deep Generative Models & Autoencoders Computational Photography Image & Video Synthesis Deep Neural Network Algorithms valuation and Analysis (Machine Learning) Artificial Intelligence and Robotics Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Deep Generative Models & Autoencoders
Computational Photography
Image & Video Synthesis
Deep Neural Network Algorithms
valuation and Analysis (Machine Learning)
Artificial Intelligence and Robotics
Theory and Algorithms
spellingShingle Deep Generative Models & Autoencoders
Computational Photography
Image & Video Synthesis
Deep Neural Network Algorithms
valuation and Analysis (Machine Learning)
Artificial Intelligence and Robotics
Theory and Algorithms
YANG, Huiting
HUANG, Danqing
LIN, Chin-Yew
HE, Shengfeng
Layout generation as intermediate action sequence prediction
description Layout generation plays a crucial role in graphic design intelligence. One important characteristic of the graphic layouts is that they usually follow certain design principles. For example, the principle of repetition emphasizes the reuse of similar visual elements throughout the design. To generate a layout, previous works mainly attempt at predicting the absolute value of bounding box for each element, where such target representation has hidden the information of higher-order design operations like repetition (e.g. copy the size of the previously generated element). In this paper, we introduce a novel action schema to encode these operations for better modeling the generation process. Instead of predicting the bounding box values, our approach autoregressively outputs the intermediate action sequence, which can then be deterministically converted to the final layout. We achieve state-of-the-art performances on three datasets. Both automatic and human evaluations show that our approach generates high-quality and diverse layouts. Furthermore, we revisit the commonly used evaluation metric FID adapted in this task, and observe that previous works use different settings to train the feature extractor for obtaining real/generated data distribution, which leads to inconsistent conclusions. We conduct an in-depth analysis on this metric and settle for a more robust and reliable evaluation setting.
format text
author YANG, Huiting
HUANG, Danqing
LIN, Chin-Yew
HE, Shengfeng
author_facet YANG, Huiting
HUANG, Danqing
LIN, Chin-Yew
HE, Shengfeng
author_sort YANG, Huiting
title Layout generation as intermediate action sequence prediction
title_short Layout generation as intermediate action sequence prediction
title_full Layout generation as intermediate action sequence prediction
title_fullStr Layout generation as intermediate action sequence prediction
title_full_unstemmed Layout generation as intermediate action sequence prediction
title_sort layout generation as intermediate action sequence prediction
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8088
https://ink.library.smu.edu.sg/context/sis_research/article/9091/viewcontent/AAAI_layout_pvoa.pdf
_version_ 1779157150253187072