LaVie: high-quality video generation with cascaded latent diffusion models
This work aims to learn a high-quality text-to-video (T2V) generative model by leveraging a pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task to simultaneously (a) accomplish the synthesis of visually realistic and temporally coherent videos while (b) pr...
Saved in:
Main Authors: | , , , , , , , , , , , , , , , , , , , |
---|---|
其他作者: | |
格式: | Article |
語言: | English |
出版: |
2025
|
主題: | |
在線閱讀: | https://hdl.handle.net/10356/183061 |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|