Masked diffusion transformer is a strong image synthesizer
Despite its success in image synthesis, we observe that diffusion probabilistic models (DPMs) often lack contextual reasoning ability to learn the relations among object parts in an image, leading to a slow learning process. To solve this issue, we propose a Masked Diffusion Transformer (MDT) that i...
Saved in:
Main Authors: | GAO, Shanghua, ZHOU, Pan, CHENG, Ming-Ming, YAN, Shuicheng |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9024 https://ink.library.smu.edu.sg/context/sis_research/article/10027/viewcontent/2023_ICCV_MDT.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Towards understanding why mask reconstruction pretraining helps in downstream tasks
by: PAN, Jiachun, et al.
Published: (2023) -
EditAnything: Empowering unparalleled flexibility in image editing and generation
by: GAO, Shanghua, et al.
Published: (2023) -
Computer-driven synthesizer (musical instrument digital interface)
by: Encarnacion, Memory Joy R., et al.
Published: (1994) -
Multimedia event detection: Strong by integration
by: ZHANG, Hao, et al.
Published: (2015) -
Few-shot learner parameterization by diffusion time-steps
by: YUE, Zhongqi, et al.
Published: (2024)